首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Comparative genome analyses of close relatives have yielded exciting insight into the sources of microbial genome variability with respect to gene content, gene order and evolution of genes with unknown functions. The genomes of free-living bacteria often carry phages and repetitive sequences that mediate genomic rearrangements in contrast to the small genomes of obligate host-associated bacteria. This suggests that genomic stability correlates with the genomic content of repeated sequences and movable genetic elements, and thereby with bacterial lifestyle. Genes with unknown functions present in a single species tend to be shorter than conserved, functional genes, indicating that the fraction of unique genes in microbial genomes has been overestimated.  相似文献   

2.
Venkatesh B  Dandona N  Brenner S 《Genomics》2006,87(2):307-310
Contrary to previous observations that fish genomes are devoid of nuclear mitochondrial pseudogenes, a genome-wide survey identified a large number of "recent" and "ancient" nuclear mitochondrial DNA fragments (Numts) in the whole-genome sequences of the fugu (Takifugu rubripes), Tetraodon nigroviridis, and zebrafish (Danio rerio). We have analyzed the latest assembly (v4.0) of the fugu genome and show that, like the Anopheles genome, the fugu nuclear genome does not contain mitochondrial pseudogenes. Fugu assembly v4.0 contains a single scaffold representing the near complete sequence of the fugu mitochondria. The "recent" Numts identified by the previous study in fugu assembly v2.0 are in fact shotgun sequences of mitochondrial DNA that were misassembled with the nuclear sequences, whereas the "ancient" Numts appear to be the result of spurious matches. It is likely that the Numts identified in the genomes of Tetraodon and zebrafish are also similar artifacts. Shotgun sequences of whole genomes often include some mitochondrial sequences. Therefore, any Numts identified in shotgun-sequence assemblies should be verified by Southern hybridization or PCR amplification.  相似文献   

3.
Plants contain large mitochondrial genomes, which are several times as complex as those in animals, fungi or algae. However, genome size is not correlated with information content. The mitochondrial genome (mtDNA) of Arabidopsis specifies only 58 genes in 367 kb, whereas the 184 kb mtDNA in the liverwort Marchantia polymorpha codes for 66 genes, and the 58 kb genome in the green alga Prototheca wickerhamii encodes 63 genes. In Arabidopsis’ mtDNA, genes for subunits of complex II, for several ribosomal proteins and for 16 tRNAs are missing, some of which have been transferred recently to the nuclear genome. Numerous integrated fragments originate from alien genomes, including 16 sequence stretches of plastid origin, 41 fragments of nuclear (retro)transposons and two fragments of fungal viruses. These immigrant sequences suggest that the large size of plant mitochondrial genomes is caused by secondary expansion as a result of integration and propagation, and is thus a derived trait established during the evolution of land plants.  相似文献   

4.
Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 × coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity.  相似文献   

5.
Genetic information of human is encoded in two genomes: nuclear and mitochondrial. Both of them reflect molecular evolution of human starting from the beginning of life (about 4.5 billion years ago) until the origin of Homo sapiens species about 100,000 years ago. From this reason human genome contains some features that are common for different groups of organisms and some features that are unique for Homo sapiens. 3.2 x 10(9) base pairs of human nuclear genome are packed into 23 chromosomes of different size. The smallest chromosome - 21st contains 5 x 10(7) base pairs while the biggest one -1st contains 2.63 x 10(8) base pairs. Despite the fact that the nucleotide sequence of all chromosomes is established, the organisation of nuclear genome put still questions: for example: the exact number of genes encoded by the human genome is still unknown giving estimations from 30 to 150 thousand genes. Coding sequences represent a few percent of human nuclear genome. The majority of the genome is represented by repetitiVe sequences (about 50%) and noncoding unique sequences. This part of the genome is frequently wrongly called "junk DNA". The distribution of genes on chromosomes is irregular, DNA fragments containing low percentage of GC pairs code lower number of genes than the fragments of high percentage of GC pairs.  相似文献   

6.
Trifonov EN 《Genetika》2002,38(6):793-798
Numerous observations, measurements and calculations strongly indicate that both eukaryotic and prokaryotic genomes are built as linear arrays of units of rather uniform size, about 400 base pairs. The units are likely to correspond to early individual genes that existed, presumably, in form of DNA circles. Their combinatorial fusion resulted eventually in formation of the early segmented genomes. The segmented structure of the genomes is, apparently, still maintained by some structural selection pressures. Some of the units can be recognized in the sequences by characteristic sequence motifs at the borders of the units. Identification and characterization of the units, their mapping on the genomes should become an important prerequisite of genome comparisons and genome evolution studies.  相似文献   

7.
Horizontal gene transfer, a process through which genomes acquire sequences from distantly related organisms, is believed to be a major source of genetic diversity in bacteria. A central question concerning the impact of gene transfer on bacterial genome evolution is the proportion of horizontally transferred sequences within genomes. Through BLAST search, we found that the genomes of two phytopathogens, Xanthomonas campestris pv. campestris and Xanthomonas axonopodis pv. citri, have close to 40% of the genes with the highest similarity to genes from phylogenetically distant organisms (non-gamma-proteobacteria). Most of these genes are found to be contiguous in the genome, forming genome islands, which may have been transferred from other organisms. Overall, the total number of genes within genome islands corresponds to almost one quarter of the entire xanthomonad genomes. Interestingly, many of the genes in these islands are functionally related to plant pathogenesis and virulence. Thus, these results suggest that horizontally transferred genes are clustered in the genome, and may facilitate fitness in new environments, as in the case of plant-bacteria interaction.  相似文献   

8.
9.
Ma PF  Guo ZH  Li DZ 《PloS one》2012,7(1):e30297

Background

Compared to their counterparts in animals, the mitochondrial (mt) genomes of angiosperms exhibit a number of unique features. However, unravelling their evolution is hindered by the few completed genomes, of which are essentially Sanger sequenced. While next-generation sequencing technologies have revolutionized chloroplast genome sequencing, they are just beginning to be applied to angiosperm mt genomes. Chloroplast genomes of grasses (Poaceae) have undergone episodic evolution and the evolutionary rate was suggested to be correlated between chloroplast and mt genomes in Poaceae. It is interesting to investigate whether correlated rate change also occurred in grass mt genomes as expected under lineage effects. A time-calibrated phylogenetic tree is needed to examine rate change.

Methodology/Principal Findings

We determined a largely completed mt genome from a bamboo, Ferrocalamus rimosivaginus (Poaceae), through Illumina sequencing of total DNA. With combination of de novo and reference-guided assembly, 39.5-fold coverage Illumina reads were finally assembled into scaffolds totalling 432,839 bp. The assembled genome contains nearly the same genes as the completed mt genomes in Poaceae. For examining evolutionary rate in grass mt genomes, we reconstructed a phylogenetic tree including 22 taxa based on 31 mt genes. The topology of the well-resolved tree was almost identical to that inferred from chloroplast genome with only minor difference. The inconsistency possibly derived from long branch attraction in mtDNA tree. By calculating absolute substitution rates, we found significant rate change (∼4-fold) in mt genome before and after the diversification of Poaceae both in synonymous and nonsynonymous terms. Furthermore, the rate change was correlated with that of chloroplast genomes in grasses.

Conclusions/Significance

Our result demonstrates that it is a rapid and efficient approach to obtain angiosperm mt genome sequences using Illumina sequencing technology. The parallel episodic evolution of mt and chloroplast genomes in grasses is consistent with lineage effects.  相似文献   

10.
Analysis of evolution of paralogous genes in a genome is central to our understanding of genome evolution. Comparison of closely related bacterial genomes, which has provided clues as to how genome sequences evolve under natural conditions, would help in such an analysis. With species Staphylococcus aureus, whole-genome sequences have been decoded for seven strains. We compared their DNA sequences to detect large genome polymorphisms and to deduce mechanisms of genome rearrangements that have formed each of them. We first compared strains N315 and Mu50, which make one of the most closely related strain pairs, at the single-nucleotide resolution to catalogue all the middle-sized (more than 10 bp) to large genome polymorphisms such as indels and substitutions. These polymorphisms include two paralogous gene sets, one in a tandem paralogue gene cluster for toxins in a genomic island and the other in a ribosomal RNA operon. We also focused on two other tandem paralogue gene clusters and type I restriction-modification (RM) genes on the genomic islands. Then we reconstructed rearrangement events responsible for these polymorphisms, in the paralogous genes and the others, with reference to the other five genomes. For the tandem paralogue gene clusters, we were able to infer sequences for homologous recombination generating the change in the repeat number. These sequences were conserved among the repeated paralogous units likely because of their functional importance. The sequence specificity (S) subunit of type I RM systems showed recombination, likely at the homology of a conserved region, between the two variable regions for sequence specificity. We also noticed novel alleles in the ribosomal RNA operons and suggested a role for illegitimate recombination in their formation. These results revealed importance of recombination involving long conserved sequence in the evolution of paralogous genes in the genome.  相似文献   

11.
Through their enabling of simultaneous identification of multiple non-essential genes in a genome, large-segment genome deletion methods are an increasingly popular approach to minimize and tailor microbial genomes for specific functions. At present, difficulties in identifying target regions for deletion are a result of inadequate knowledge to define gene essentiality. Furthermore, with the majority of predicted open reading frames of completely sequenced genomes still annotated as putative genes, essential or important genes are found scattered throughout the genomes, limiting the size of non-essential segments that can be safely deleted in a single sweep. Recently described large-segment random genome deletion methods that utilize transposons enable the generation of random deletion strains, analysis of which makes identification of non-essential genes less tedious. Such and other efforts to determine the minimum genome content necessary for cell survival continue to accumulate important information that should help improve our understanding of genome function and evolution. This review presents an assessment of technological advancements of random genome deletion methods in prokaryotes to date.  相似文献   

12.
13.
Chloroplast genomes have been widely used in studying plant phylogeny and evolution. Several chloroplast genome visualization tools have been developed to display the distribution of genes on the genome. However, these tools do not draw features, such as exons, introns, repetitive elements, and variable sites, disallowing in-depth examination of the genome structures. Here, we developed and validated a software package called Chloroplast Genome Viewers (CPGView). CPGView can draw three maps showing (i) the distributions of genes, variable sites, and repetitive sequences, including microsatellites, tandem and dispersed repeats; (ii) the structure of the cis-splicing genes after adjusting the exon-intron boundary positions using a coordinate scaling algorithm, and (iii) the structure of the trans-splicing gene rps12. To test the accuracy of CPGView, we sequenced, assembled, and annotated 31 chloroplast genomes from 31 genera of 22 families. CPGView drew maps correctly for all the 31 chloroplast genomes. Lastly, we used CPGView to examine 5998 publicly released chloroplast genomes from 2513 genera of 553 families. CPGView succeeded in plotting maps for 5882 but failed to plot maps for 116 chloroplast genomes. Further examination showed that the annotations of these 116 genomes had various errors needing manual correction. The test on newly generated data and publicly available data demonstrated the ability of CPGView to identify errors in the annotations of chloroplast genomes. CPGView will become a widely used tool to study the detailed structure of chloroplast genomes. The web version of CPGView can be accessed from http://www.1kmpg.cn/cpgview .  相似文献   

14.
Charles Darwin believed that all traits of organisms have been honed to near perfection by natural selection. The empirical basis underlying Darwin's conclusions consisted of numerous observations made by him and other naturalists on the exquisite adaptations of animals and plants to their natural habitats and on the impressive results of artificial selection. Darwin fully appreciated the importance of heredity but was unaware of the nature and, in fact, the very existence of genomes. A century and a half after the publication of the "Origin", we have the opportunity to draw conclusions from the comparisons of hundreds of genome sequences from all walks of life. These comparisons suggest that the dominant mode of genome evolution is quite different from that of the phenotypic evolution. The genomes of vertebrates, those purported paragons of biological perfection, turned out to be veritable junkyards of selfish genetic elements where only a small fraction of the genetic material is dedicated to encoding biologically relevant information. In sharp contrast, genomes of microbes and viruses are incomparably more compact, with most of the genetic material assigned to distinct biological functions. However, even in these genomes, the specific genome organization (gene order) is poorly conserved. The results of comparative genomics lead to the conclusion that the genome architecture is not a straightforward result of continuous adaptation but rather is determined by the balance between the selection pressure, that is itself dependent on the effective population size and mutation rate, the level of recombination, and the activity of selfish elements. Although genes and, in many cases, multigene regions of genomes possess elaborate architectures that ensure regulation of expression, these arrangements are evolutionarily volatile and typically change substantially even on short evolutionary scales when gene sequences diverge minimally. Thus, the observed genome architectures are, mostly, products of neutral processes or epiphenomena of more general selective processes, such as selection for genome streamlining in successful lineages with large populations. Selection for specific gene arrangements (elements of genome architecture) seems only to modulate the results of these processes.  相似文献   

15.
Sequence organization of the mitochondrial genome of yeast--a review   总被引:3,自引:0,他引:3  
M de Zamaroczy  G Bernardi 《Gene》1985,37(1-3):1-17
We have compiled the available primary structural data for the mitochondrial genome of Saccharomyces cerevisiae and have estimated the size of the remaining gaps, which represent 12-13% of the genome. The lengths of sequenced regions and of gaps lead to a new assessment of genome sizes; these range (in round figures) from 85 000 bp for the long genomes, to 78 000 bp for the short genomes, to 74 000 bp for the supershort genome of Saccharomyces carlsbergensis. These values are 8-11% higher than those previously estimated from restriction fragments. Interstrain differences concern not only facultative intervening sequences (introns) and mini-inserts, but also insertions/deletions in intergenic sequences. The primary structure appears to be extremely conserved in genes and ori sequences, and highly conserved in intergenic sequences. Since coding sequences represent at most 33-35% of the genome, at least two thirds of the genome are formed by noncoding and yet highly conserved sequences. The G + C level of genes or exon is 25%, and that of intronic open reading frames (ORFs) 22%; increasingly lower values are shown by intronic closed reading frames (CRFs), 20%, ori sequences, 19%, intergenic ORFs, 17.5% and intergenic sequences, 15%.  相似文献   

16.

Background

Is it possible to construct an accurate and detailed subgene-level map of a genome using bacterial artificial chromosome (BAC) end sequences, a sparse marker map, and the sequences of other genomes?

Results

A sheep BAC library, CHORI-243, was constructed and the BAC end sequences were determined and mapped with high sensitivity and low specificity onto the frameworks of the human, dog, and cow genomes. To maximize genome coverage, the coordinates of all BAC end sequence hits to the cow and dog genomes were also converted to the equivalent human genome coordinates. The 84,624 sheep BACs (about 5.4-fold genome coverage) with paired ends in the correct orientation (tail-to-tail) and spacing, combined with information from sheep BAC comparative genome contigs (CGCs) built separately on the dog and cow genomes, were used to construct 1,172 sheep BAC-CGCs, covering 91.2% of the human genome. Clustered non-tail-to-tail and outsize BACs located close to the ends of many BAC-CGCs linked BAC-CGCs covering about 70% of the genome to at least one other BAC-CGC on the same chromosome. Using the BAC-CGCs, the intrachromosomal and interchromosomal BAC-CGC linkage information, human/cow and vertebrate synteny, and the sheep marker map, a virtual sheep genome was constructed. To identify BACs potentially located in gaps between BAC-CGCs, an additional set of 55,668 sheep BACs were positioned on the sheep genome with lower confidence. A coordinate conversion process allowed us to transfer human genes and other genome features to the virtual sheep genome to display on a sheep genome browser.

Conclusion

We demonstrate that limited sequencing of BACs combined with positioning on a well assembled genome and integrating locations from other less well assembled genomes can yield extensive, detailed subgene-level maps of mammalian genomes, for which genomic resources are currently limited.  相似文献   

17.
There are many ways to group completed genome sequences in hierarchical patterns (trees) reflecting relationships between their genes. Such groupings help us organize biological information and bear crucially on underlying processes of genome and organismal evolution. Genome trees make use of all comparable genes but can variously weight the contributions of these genes according to similarity, congruent patterns of similarity, or prevalence among genomes. Here we explore such possible weighting strategies, in an analysis of 142 prokaryotic and 5 eukaryotic genomes. We demonstrate that alternate weighting strategies have different advantages, and we propose that each may have its specific uses in systematic or evolutionary biology. Comparisons of results obtained with different methods can provide further clues to major events and processes in genome evolution.  相似文献   

18.
The red bayberry genome and genetic basis of sex determination   总被引:2,自引:0,他引:2  
Morella rubra, red bayberry, is an economically important fruit tree in south China. Here, we assembled the first high‐quality genome for both a female and a male individual of red bayberry. The genome size was 313‐Mb, and 90% sequences were assembled into eight pseudo chromosome molecules, with 32 493 predicted genes. By whole‐genome comparison between the female and male and association analysis with sequences of bulked and individual DNA samples from female and male, a 59‐Kb region determining female was identified and located on distal end of pseudochromosome 8, which contains abundant transposable element and seven putative genes, four of them are related to sex floral development. This 59‐Kb female‐specific region was likely to be derived from duplication and rearrangement of paralogous genes and retained non‐recombinant in the female‐specific region. Sex‐specific molecular markers developed from candidate genes co‐segregated with sex in a genetically diverse female and male germplasm. We propose sex determination follow the ZW model of female heterogamety. The genome sequence of red bayberry provides a valuable resource for plant sex chromosome evolution and also provides important insights for molecular biology, genetics and modern breeding in Myricaceae family.  相似文献   

19.
Extant genomes are the result of repeated duplications and subsequent divergence of primordial genes that assembled the genomes of the first living beings. Increased information on genome maps of different species is revealing conserved syntenies among different vertebrate taxa, which allow to trace back the history of current chromosomes. However, inferring neighboring relationships between genes of more primitive genomes has proven to be very difficult. Most often, the ancestral arrangements of genes have been lost by multiple histories of internal duplications, chromosomal breaks, and large-scale genomic rearrangements. Here we describe a gene arrangement of nonrelated genes that seems to have endured evolution, at least from the separation of the two major clades of bilateria: deuterostomia and protostomia, approximately 1 billion years ago. In its simplest conception, this gene cluster, named EVG, groups the genes for a glucose transporter, an enolase, and a vesicle-associated membrane protein (VAMP). EVG might represent the evolutionary remnants of the gene organization of an ancient bilaterian genome.  相似文献   

20.
Butterflyfish are among the most iconic of the coral reef fishes and represent a model system to study general questions of biogeography, evolution and population genetics. We assembled and annotated the genome sequence of the blacktail butterflyfish (Chaetodon austriacus), an Arabian region endemic species that is reliant on coral reefs for food and shelter. Using available bony fish (superclass Osteichthyes) genomes as a reference, a total of 28 926 high‐quality protein‐coding genes were predicted from 13 967 assembled scaffolds. The quality and completeness of the draft genome of C. austriacus suggest that it has the potential to serve as a resource for studies on the co‐evolution of reef fish adaptations to the unique Red Sea environment, as well as a comparison of gene sequences between closely related congeneric species of butterflyfish distributed more broadly across the tropical Indo‐Pacific.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号