首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
The Bombyx mori genome: analysis by DNA reassociation kinetics   总被引:3,自引:0,他引:3  
The size and nucleotide sequence complexity of the Bombyx mori genome has been determined from the kinetics of reassociation of its DNA. Nonrepeated DNA comprises 55% of the genome, and the remainder is divided equally between sequences repeated roughly 500 and 50000 times. Non-repeated sequence DNA virtually free of repeated sequences was prepared by partial reassociation and subsequent fractionation on hydroxyapatite. The nucleotide sequence complexity of this component was determined relative to DNA from B. subtilis and E. coli. After correction for the size of the repeated sequence fraction, the DNA content of the Bombyx mori genome was calculated to be 0.53±0.02×10?12 g. This value compares favorably with the DNA content of haploid B. mori spermatids and mature sperm determined cytophotometrically by Rasch (1973).  相似文献   

3.
4.
5.
6.

Background

Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number variation (CNV) and inversions.

Results

We have combined computational re-analysis of existing whole genome sequence data with novel microarray-based analysis, and detect 12,178 structural variants covering 40.6 Mb that were not reported in the initial sequencing of the first published personal genome. We estimate a total non-SNP variation content of 48.8 Mb in a single genome. Our results indicate that this genome differs from the consensus reference sequence by approximately 1.2% when considering indels/CNVs, 0.1% by SNPs and approximately 0.3% by inversions. The structural variants impact 4,867 genes, and >24% of structural variants would not be imputed by SNP-association.

Conclusions

Our results indicate that a large number of structural variants have been unreported in the individual genomes published to date. This significant extent and complexity of structural variants, as well as the growing recognition of their medical relevance, necessitate they be actively studied in health-related analyses of personal genomes. The new catalogue of structural variants generated for this genome provides a crucial resource for future comparison studies.  相似文献   

7.
Halalkalicoccus jeotgali B3T, isolated from salt-fermented seafood from South Korea, is an extremely halophilic archaeon belonging to the family Halobacteriaceae. Here, we present the complete genome sequence of the type strain H. jeotgali B3T (3,698,650 bp, with a G+C content of 62.5%), which consists of one chromosome and six plasmids. This is the first complete genome sequence of the Halalkalicoccus species.Extremely halophilic archaea (haloarchaea) are adapted to hypersaline environments and grow optimally in NaCl solutions of 2.6 M or higher (12). These haloarchaea are classified within the family Halobacteriaceae in the order Halobacteriales; currently, this family comprises 28 genera (3), and only 11 complete genome sequences in Halobacteriaceae have been reported. In a study of archaeal diversity in salt-fermented small shrimp or shellfish from South Korea, our laboratory isolated and characterized 5 novel, extremely halophilic archaeal strains of Halobacteriaceae. These strains included Natronococcus jeotgali (9), Halalkalicoccus jeotgali (11), Halorubrum cibi (7), Haloterrigena jeotgali (10) and Haladaptatus cibarius (8). We have now sequenced the genome of Halalkalicoccus jeotgali B3T; genome sequencing had not been completed or initiated for any strain in this genus when our sequencing project was begun. The genus Halalkalicoccus currently contains only two species, Halalkalicoccus tibetensis (13) and H. jeotgali, and these species exhibit 98.6% gene sequence similarity in their 16S rRNA. The genome of H. jeotgali B3T is the first of this genus to be sequenced.The complete genome sequence of H. jeotgali B3T was determined by a whole-genome shotgun strategy using Roche 454 GS (FLX Titanium) pyrosequencing (898,168 reads totaling ∼348 Mb; ∼94-fold coverage of the genome) and a fosmid library (514 reads totaling ∼680 kb) at the Genome Resource Center, KRIBB (Korea Research Institute of Bioscience and Biotechnology). Genome sequences from pyrosequencing were processed by Roche''s software according to the manufacturer''s instructions, and sequences from the fosmid library were processed by PESTAS (6). A total of 898,196 reads were assembled using Newbler Assembler 2.3 (454 Life Science), which generated 54 large contigs (>100 bp in size) with bases having quality scores of 40 and above. The gaps between contigs were closed by primer walking and sequencing of PCR products across the gaps. The annotation was done by merging results obtained from the RAST (Rapid Annotation using Subsystem Technology) pipeline (1), Glimmer 3.02 (2), tRNAscan-SE 1.21 (5), and RNAmmer 1.2 (4).The H. jeotgali B3T genome is 3,698,650 bases long with a 62.5% G+C content. The chromosome consists of a single circular chromosome (2,809,118 bp, with a G+C content of 65.0%) and six plasmids (406,285 bp, 55.3%; 363,534 bp, 54.2%; 44,576 bp, 58.9%; 44,459 bp, 54.9%; 23,727 bp, 47.6%; 6,951 bp, 60.6%). The genome contains 3,860 predicted coding sequences and 52 RNA genes (determined using RAST). The chromosome is predicted to contain 3,101 coding sequences with a coding intensity of 90.0%, including 47 tRNA genes, 1 5S rRNA gene, 1 16S rRNA gene, and 1 23S rRNA gene. The largest plasmid contains 466 coding sequences with a coding intensity of 81.2% and 2 tRNA genes, while the other five plasmids contain 425, 44, 48, 29, and 5 coding sequences with coding intensities of 80.2%, 84.2%, 83.0%, 69.6%, and 22.8%, respectively (determined using Glimmer3). More detailed analysis of this genome and comparative analysis with other haloarchaea will provide further insight into the genomic differences and metabolism of the extremely halophilic archaea.  相似文献   

8.
Single-nucleotide polymorphisms (SNPs)are molecular markers based on nucleotide variation and can be used for genotyping assays across populations and to track genomic inheritance. SNPs offer a comprehensive genotyping alternative to whole-genome sequencing for both agricultural and research purposes including molecular breeding and diagnostics, genome evolution and genetic diversity analyses, genetic mapping, and trait association studies. Here genomic SNPs were discovered between four cultivars of the important amphidiploid oilseed species Brassica napus and used to develop a B. napus Infinium? array containing 5,306 SNPs randomly dispersed across the genome. Assay success was high, with >94 % of these producing a reproducible, polymorphic genotype in the 1,070 samples screened. Although the assay was designed to B. napus, successful SNP amplification was achieved in the B. napus progenitor species, Brassica rapa and Brassica oleracea, and to a lesser extent in the related species Brassica nigra. Phylogenetic analysis was consistent with the expected relationships between B. napus individuals. This study presents an efficient custom SNP assay development pipeline in the complex polyploid Brassica genome and demonstrates the utility of the array for high-throughput genotyping in a number of related Brassica species. It also demonstrates the utility of this assay in genotyping resistance genes on chromosome A7, which segregate amongst the 1,070 samples.  相似文献   

9.
Ralstonia paucula strain RA4T, a gram negative, non-spore forming, motile bacterium having positive catalase and oxidase test, was isolated from surface soil. Twin arginine translocation protein type D (TatD) is shown to be located in cytoplasm and exhibits magnesium-dependent DNase. A tatD DNase gene was isolated and cloned from Ralstonia paucula RA4T genome. Nucleotide sequence analysis of the gene revealed 813 nucleotides encoding a protein of 270 amino acid residues. The tatD gene showed a high similarity to homolog gene from Ralstonia pickettii strain 12D. The deduced polypeptide sequence of TatD DNase from R. paucula RA4T had a typical catalytic site, HHPLDEHRHDP, and its calculated molecular mass and predicted isoelectric point were 29616 Da and 5.33, respectively. The deduced amino acid sequence showed a high degree of similarity to TatD DNase isoforms from Ralstonia genus and other sources. Predicted three-dimensional structure of TatD confirmed the presence of active site and theoretical function as DNase.  相似文献   

10.
11.
12.
We offer a guide to de novo genome assembly1 using sequence data generated by the Illumina platform for biologists working with fungi or other organisms whose genomes are less than 100 Mb in size. The guide requires no familiarity with sequencing assembly technology or associated computer programs. It defines commonly used terms in genome sequencing and assembly; provides examples of assembling short-read genome sequence data for four strains of the fungus Grosmannia clavigera using four assembly programs; gives examples of protocols and software; and presents a commented flowchart that extends from DNA preparation for submission to a sequencing center, through to processing and assembly of the raw sequence reads using freely available operating systems and software.  相似文献   

13.
Channel catfish, Ictalurus punctatus, T cell receptors (TCR) γ and δ were identified by mining of expressed sequence tag databases, and full-length sequences were obtained by 5′-RACE and RT-PCR protocols. cDNAs for each of these TCR chains encode typical variable (V), diversity (D), joining (J), and constant (C) regions. Three TCRγ V families, seven TCRγ J sequences, and three TCRγ C sequences were identified from sequencing of cDNA. Primer walking on bacterial artificial chromosomes (BACs) confirmed that the TRG locus contained seven TRGJ segments and indicated that the locus consists of (Vγ3-Jγ6-Cγ2)–(Vγ1n-Jγ7-Cγ3)–(Vγ2-Jγ5-Jγ4-Jγ3-Jγ2-Jγ1-Cγ1). In comparison for TCRδ, two V families, four TCRδ D sequences, one TCRδ J sequence, and one TCRδ C sequence were identified by cDNA sequencing. Importantly, the finding that some catfish TCRδ cDNAs contain TCR Vα-D-Jδ rearrangements and some TCRα cDNAs contain Vδ-Jα rearrangements strongly implies that the catfish TRA and TRD loci are linked. Finally, primer walking on BACs and Southern blotting suggest that catfish have four TRDD gene segments and a single TRDJ and TRDC gene. As in most vertebrates, all three reading frames of each of the catfish TRDD segments can be used in functional rearrangements, and more than one TRDD segment can be used in a single rearrangement. As expected, catfish TCRδ CDR3 regions are longer and more diverse than TCRγ CDR3 regions, and as a group they utilize more nucleotide additions and contain more nucleotide deletions than catfish TCRγ rearrangements.  相似文献   

14.
15.
16.
17.
Prunus ×yedoensis Matsum. ‘Somei-yoshino’ is the most common and widespread cultivar of the ornamental flowering cherries. We hereby report its complete chloroplast (cp) genome sequences generated by whole-genome next-generation sequencing approach. The cp genome size was 157,792 bp in length consisting of four regions; large single-copy region (85,914 bp), small single-copy region (19,120 bp), and a pair of inverted repeat regions (26,379 bp). The genome contained a total of 131 genes, including 86 coding genes, 8 rRNA genes, and 37 tRNA genes. A total of 92 simple sequence repeats (SSRs) were detected within the cp genome. Its molecular features were compared with the complete cp genome of wild P. yedoensis, which occurs rarely in natural habitats of Mt. Halla in Jeju Island, Korea, displaying nearly indistinguishable morphology as P. ×yedoensis ‘Somei-yoshino’. Although both cp genomes were structured highly alike, the sequence variations between them were revealed in several single-nucleotide polymorphisms (SNPs). Using additional individuals of wild and cultivated flowering cherries, PCR amplification confirmed that those SNPs were phylogenetically informative, providing distinction between wild and cultivated flowering cherries. In future study, the SNPs and SSRs reported in this study could be used to identify wild individuals from morphologically identical cultivars of flowering cherries and also to conserve the genetic diversity of wild flowering cherries in Jeju Island.  相似文献   

18.
Sixty years after Watson and Crick published the double helix model of DNA''s structure, thirteen members of Genome Biology''s Editorial Board select key advances in the field of genome biology subsequent to that discovery.April 25th 2013 is the sixtieth anniversary of the infamous Watson and Crick Nature paper describing a model for the structure of DNA, published 25 April 1953: the now infamous ''double helix'' [1]. Two accompanying papers from Rosalind Franklin, Maurice Wilkins and colleagues leant experimental support to the proposed structure in the form of X-ray diffraction data [2,3], as described elsewhere in this issue of Genome Biology [4]. The model was a landmark discovery in the history of modern science, and was notable for its cross-disciplinary importance: the question addressed was of immense biological importance, but it was physicists and chemists whose expertise and techniques were needed in order to arrive at an answer. One of these physicists, Ray Gosling, describes the unveiling of Watson and Crick''s double helix structure as a ''eureka'' moment [4]: its simplicity and elegance were striking, and not only explained the X-ray diffraction data but also the mode of replication of life itself. It is rare for a scientific discovery to achieve such an iconic status, to pervade popular culture and the public consciousness, as well as to become an emblem of scientific inquiry - as exemplified by Genome Biology''s double helix-inspired logo. Although Avery had already shown DNA to be the genetic material [5], it took the convincing simplicity of Watson and Crick''s double helix for this notion to widely take hold, in place of theories favoring proteins. The discovery, therefore, had many important implications, and set the scene for future breakthroughs in the field of genome biology.To celebrate sixty years of such discoveries, we asked a jury composed of Genome Biology Editorial Board members to select key advances in the field since 25 April 1953. The brief was to choose a development that was either the most important or the most surprising, or that had the most personal impact, and to briefly summarize why. A number of selections focused on technological advances - from restriction mapping through microarrays and high-throughput sequencing. These technologies have clearly done much to inform our understanding of the biology of genomes. The most popular choice, however, was the discovery of introns. Much like the double helix, this discovery had something of the ''X factor'' to it: biologists trained in the post-intron era may take the concept of gene fragmentation for granted, but at the time it was a truly radical and paradigm-shifting idea. The sense of surprise made a strong impression on those old enough to remember the discovery, and one of the groups involved went so far as to describe it as ''amazing'' in the title of their paper [6].  相似文献   

19.
Prokaryotes in the superficial sediments are ecologically important microorganisms that are responsible for the decomposition, mineralization and subsequent recycling of organic matter. The aim of this study was to explore the phylogenetic and functional diversity of halophilic archaea and bacteria isolated from the superficial sediments of solar salterns at Sfax, Tunisia. Sixty four strains were isolated from crystallizer (TS18) and non-crystallizer (M1) ponds and submitted to genotypic characterization and evaluation by amplified ribosomal RNA restriction analysis (ARDRA) techniques. Our findings revealed that the archaeal diversity observed for 29 isolates generated five distinct patterns from the non-crystallizer M1 pond, with Halorubrum chaoviator as the most prevalent cultivable species. However, in the TS18 crystallizer pond, ten restriction patterns were observed, with the prevalence of haloarchaea EB27K, a not yet identified genotype. The construction of a neighbour-joining tree of 16S rRNA gene sequences resulted in the division of the potential new species into two major groups, with four strains closely related to the sequence of the unculturable haloarchaeon EB27K and one strain to the recently described Halovenus aranensis strain. The 35 bacterial strains observed in this work were present only in the non-crystallizer pond (M1) and presented two distinct ARDRA patterns. These strains belonged to the γ-proteobacteria subdivision, with members of Salicola marasensis (83 %) being the most predominant species among the isolates. 16S rRNA gene sequencing revealed that Salicola strains displayed different degrees of homogeneity. The results from pulsed field gel electrophoresis assays showed that the Salicola isolates could be clustered in two distinct groups with different genome sizes.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号