首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The complete sequenced genomes of chloroplast have provided much information on the origin and evolution of this organelle. In this paper we attempt to use these sequences to test a novel approach for phylogenetic analysis of complete genomes based on correlation analysis of compositional vectors. All protein sequences from 21 complete chloroplast genomes are analyzed in comparison with selected archaea, eubacteria, and eukaryotes. The distance-based analysis shows that the chloroplast genomes are most closely related to cyanobacteria, consistent with the endosymbiotic origin of chloroplasts. The chloroplast genomes are separated to two major clades corresponding to chlorophytes (green plants) s.l. and rhodophytes (red algae) s.l. The interrelationships among the chloroplasts are largely in agreement with the current understanding on chloroplast evolution. For instance, the analysis places the chloroplasts of two chromophytes (Guillardia and Odontella) within the rhodophyte lineage, supporting secondary endosymbiosis as the source of these chloroplasts. The relationships among the green algae and land plants in our tree also agree with results from traditional phylogenetic analyses. Thus, this study establishes the value of our simple correlation analysis in elucidating the evolutionary relationships among genomes. It is hoped that this approach will provide insights on comparative genome analysis.  相似文献   

2.
There is currently no consensus on the evolutionary origin of eukaryotes. In the search of the ancestors of eukaryotes, we analyzed the phylogeny of 46 genomes, including those of 2 eukaryotes, 8 archaea, and 36 eubacteria. To avoid the effects of gene duplications, we used inparalog pairs of genes with orthologous relationships. First, we grouped these inparalogs into the functional categories of the nucleus, cytoplasm, and mitochondria. Next, we counted the sister groups of eukaryotes in prokaryotic phyla and plotted them on a standard phylogenetic tree. Finally, we used Pearson's chi-square test to estimate the origin of the genomes from specific prokaryotic ancestors. The results suggest the eukaryotic nuclear genome descends from an archaea that was neither euryarchaeota nor crenarchaeota and that the mitochondrial genome descends from alpha-proteobacteria. In contrast, genes related to the cytoplasm do not appear to originate from a specific group of prokaryotes.  相似文献   

3.
The genomic peculiarities among microbial eukaryotes challenge the conventional wisdom of genome evolution. Currently, many studies and textbooks explore principles of genome evolution from a limited number of eukaryotic lineages, focusing often on only a few representative species of plants, animals and fungi. Increasing emphasis on studies of genomes in microbial eukaryotes has and will continue to uncover features that are either not present in the representative species (e.g. hypervariable karyotypes or highly fragmented mitochondrial genomes) or are exaggerated in microbial groups (e.g. chromosomal processing between germline and somatic nuclei). Data for microbial eukaryotes have emerged from recent genome sequencing projects, enabling comparisons of the genomes from diverse lineages across the eukaryotic phylogenetic tree. Some of these features, including amplified rDNAs, subtelomeric rDNAs and reduced genomes, appear to have evolved multiple times within eukaryotes, whereas other features, such as absolute strand polarity, are found only within single lineages.  相似文献   

4.
The origin of eukaryotic cells is one of the most fascinating challenges in biology, and has inspired decades of controversy and debate. Recent work has led to major upheavals in our understanding of eukaryotic origins and has catalysed new debates about the roles of endosymbiosis and gene flow across the tree of life. Improved methods of phylogenetic analysis support scenarios in which the host cell for the mitochondrial endosymbiont was a member of the Archaea, and new technologies for sampling the genomes of environmental prokaryotes have allowed investigators to home in on closer relatives of founding symbiotic partners. The inference and interpretation of phylogenetic trees from genomic data remains at the centre of many of these debates, and there is increasing recognition that trees built using inadequate methods can prove misleading, whether describing the relationship of eukaryotes to other cells or the root of the universal tree. New statistical approaches show promise for addressing these questions but they come with their own computational challenges. The papers in this theme issue discuss recent progress on the origin of eukaryotic cells and genomes, highlight some of the ongoing debates, and suggest possible routes to future progress.  相似文献   

5.
Bornaviruses are the only animal RNA viruses that establish a persistent infection in their host cell nucleus. Studies of bornaviruses have provided unique information about viral replication strategies and virus–host interactions. Although bornaviruses do not integrate into the host genome during their replication cycle, we and others have recently reported that there are DNA sequences derived from the mRNAs of ancient bornaviruses in the genomes of vertebrates, including humans, and these have been designated endogenous borna-like (EBL) elements. Therefore, bornaviruses have been interacting with their hosts as driving forces in the evolution of host genomes in a previously unexpected way. Studies of EBL elements have provided new models for virology, evolutionary biology and general cell biology. In this review, we summarize the data on EBL elements including what we have newly identified in eukaryotes genomes, and discuss the biological significance of EBL elements, with a focus on EBL nucleoprotein elements in mammalian genomes. Surprisingly, EBL elements were detected in the genomes of invertebrates, suggesting that the host range of bornaviruses may be much wider than previously thought. We also review our new data on non-retroviral integration of Borna disease virus.  相似文献   

6.
Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www. ncbi.nlm. nih.gov/COG). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 56-83% of the gene products from each of the complete bacterial and archaeal genomes and approximately 35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes.  相似文献   

7.
Evolutionary innovation in eukaryotes and especially animals is at least partially driven by genome rearrangements and the resulting emergence of proteins with new domain combinations, and thus potentially novel functionality. Given the random nature of such rearrangements, one could expect that proteins with particularly useful multidomain combinations may have been rediscovered multiple times by parallel evolution. However, existing reports suggest a minimal role of this phenomenon in the overall evolution of eukaryotic proteomes. We assembled a collection of 172 complete eukaryotic genomes that is not only the largest, but also the most phylogenetically complete set of genomes analyzed so far. By employing a maximum parsimony approach to compare repertoires of Pfam domains and their combinations, we show that independent evolution of domain combinations is significantly more prevalent than previously thought. Our results indicate that about 25% of all currently observed domain combinations have evolved multiple times. Interestingly, this percentage is even higher for sets of domain combinations in individual species, with, for instance, 70% of the domain combinations found in the human genome having evolved independently at least once in other species. We also show that previous, much lower estimates of this rate are most likely due to the small number and biased phylogenetic distribution of the genomes analyzed. The process of independent emergence of identical domain combination is widespread, not limited to domains with specific functional categories. Besides data from large-scale analyses, we also present individual examples of independent domain combination evolution. The surprisingly large contribution of parallel evolution to the development of the domain combination repertoire in extant genomes has profound consequences for our understanding of the evolution of pathways and cellular processes in eukaryotes and for comparative functional genomics.  相似文献   

8.
Transfer RNA genes tend to be presented in multiple copies in the genomes of most organisms, from bacteria to eukaryotes. The evolution and genomic structure of tRNA genes has been a somewhat neglected area of molecular evolution. Escherichia coli, the first phylogenetic species for which more than two different strains have been sequenced, provides an invaluable framework to study the evolution of tRNA genes. In this work, a detailed analysis of the tRNA structure of the genomes of Escherichia coli strains K12, CFT073, and O157:H7, Shigella flexneri 2a 301, and Salmonella typhimurium LT2 was carried out. A phylogenetic analysis of these organisms was completed, and an archaeological map depicting the main events in the evolution of tRNA genes was drawn. It is shown that duplications, deletions, and horizontal gene transfers are the main factors driving tRNA evolution in these genomes. On average, 0.64 tRNA insertions/duplications occur every million years (Myr) per genome per lineage, while deletions occur at the slower rate of 0.30 per million years per genome per lineage. This work provides a first genomic glance at the problem of tRNA evolution as a repetitive process, and the relationship of this mechanism to genome evolution and codon usage is discussed.  相似文献   

9.
While there is compelling evidence for the impact of endosymbiotic gene transfer (EGT; transfer from either mitochondrion or chloroplast to the nucleus) on genome evolution in eukaryotes, the role of interdomain transfer from bacteria and/or archaea (i.e. prokaryotes) is less clear. Lateral gene transfers (LGTs) have been argued to be potential sources of phylogenetic information, particularly for reconstructing deep nodes that are difficult to recover with traditional phylogenetic methods. We sought to identify interdomain LGTs by using a phylogenomic pipeline that generated 13 465 single gene trees and included up to 487 eukaryotes, 303 bacteria and 118 archaea. Our goals include searching for LGTs that unite major eukaryotic clades, and describing the relative contributions of LGT and EGT across the eukaryotic tree of life. Given the difficulties in interpreting single gene trees that aim to capture the approximately 1.8 billion years of eukaryotic evolution, we focus on presence–absence data to identify interdomain transfer events. Specifically, we identify 1138 genes found only in prokaryotes and representatives of three or fewer major clades of eukaryotes (e.g. Amoebozoa, Archaeplastida, Excavata, Opisthokonta, SAR and orphan lineages). The majority of these genes have phylogenetic patterns that are consistent with recent interdomain LGTs and, with the notable exception of EGTs involving photosynthetic eukaryotes, we detect few ancient interdomain LGTs. These analyses suggest that LGTs have probably occurred throughout the history of eukaryotes, but that ancient events are not maintained unless they are associated with endosymbiotic gene transfer among photosynthetic lineages.  相似文献   

10.
Chaperonins are oligomeric protein-folding complexes which are divided into two distantly related structural classes. Group I chaperonins (called GroEL/cpn60/hsp60) are found in bacteria and eukaryotic organelles, while group II chaperonins are present in archaea and the cytoplasm of eukaryotes (called CCT/TriC). While archaea possess one to three chaperonin subunit-encoding genes, eight distinct CCT gene families (paralogs) have been characterized in eukaryotes. We are interested in determining when during eukaryotic evolution the multiple gene duplications producing the CCT subunits occurred. We describe the sequence and phylogenetic analysis of five CCT genes from TRICHOMONAS: vaginalis and seven from GIARDIA: lamblia, representatives of amitochondriate protist lineages thought to have diverged early from other eukaryotes. Our data show that the gene duplications producing the eight CCT paralogs took place prior to the organismal divergence of TRICHOMONAS: and GIARDIA: from other eukaryotes. Thus, these divergent protists likely possess completely hetero-oligomeric CCT complexes like those in yeast and mammalian cells. No close phylogenetic relationship between the archaeal chaperonins and specific CCT subunits was observed, suggesting that none of the CCT gene duplications predate the divergence of archaea and eukaryotes. The duplications producing the CCTdelta and CCTepsilon subunits, as well as CCTalpha, CCTbeta, and CCTeta, are the most recent in the CCT gene family. Our analyses show significant differences in the rates of evolution of archaeal chaperonins compared with the eukaryotic CCTs, as well as among the different CCT subunits themselves. We discuss these results in light of current views on the origin, evolution, and function of CCT complexes.  相似文献   

11.
Attempts were made to define the relationship among the three domains (eukaryotes, archaea, and eubacteria) using phylogenetic tree analyses of 16S rRNA sequences as well as of other protein sequences. Since the results are inconsistent, it is implied that the eukaryotic genome has a chimeric structure. In our previous studies, the origin of eukaryotes to be the symbiosis of archaea into eubacteria using the whole open reading frames (ORF) of many genomes was suggested. In these studies, the species participating in the symbiosis were not clarified, and the effect of gene duplication after speciation (in-paralog) was not addressed. To avoid the influence of the in-paralog, we developed a new method to calculate orthologous ORFs. Furthermore, we separated eukaryotic in-paralogs into three groups by sequence similarity to archaea, eubacteria (other than -proteobacteria), and -proteobacteria and treated them as individual organisms. The relationship between the three ORF groups and the functional classification was clarified by this analysis. The introduction of this new method into the phylogenetic tree analysis of 66 organisms (4 eukaryotes, 13 archaea, and 49 eubacteria) based on gene content suggests the symbiosis of pyrococcus into -proteobacteria as the origin of eukaryotes.  相似文献   

12.
The ABC superfamily of genes is one of the largest in the genomes of both bacteria and eukaryotes. The proteins encoded by these genes all carry a characteristic 200- to 250-amino-acid ATP-binding cassette that gives them their family name. In bacteria they are mostly involved in nutrient import, while in eukaryotes many are involved in export. Seven different families have been defined in eukaryotes based on sequence homology, domain topology, and function. While only 6 ABC genes in Dictyostelium discoideum have been studied in detail previously, sequences from the well-advanced Dictyostelium genome project have allowed us to recognize 68 members of this superfamily. They have been classified and compared to animal, plant, and fungal orthologs in order to gain some insight into the evolution of this superfamily. It appears that many of the genes inferred to have been present in the ancestor of the crown organisms duplicated extensively in some but not all phyla, while others were lost in one lineage or the other.  相似文献   

13.
Phytoplasmas are a group of bacteria that are associated with hundreds of plant diseases. Due to their economical importance and the difficulties involved in the experimental study of these obligate pathogens, genome sequencing and comparative analysis have been utilized as powerful tools to understand phytoplasma biology. To date four complete phytoplasma genome sequences have been published. However, these four strains represent limited phylogenetic diversity. In this study, we report the shotgun sequencing and evolutionary analysis of a peanut witches''-broom (PnWB) phytoplasma genome. The availability of this genome provides the first representative of the 16SrII group and substantially improves the taxon sampling to investigate genome evolution. The draft genome assembly contains 13 chromosomal contigs with a total size of 562,473 bp, covering ∼90% of the chromosome. Additionally, a complete plasmid sequence is included. Comparisons among the five available phytoplasma genomes reveal the differentiations in gene content and metabolic capacity. Notably, phylogenetic inferences of the potential mobile units (PMUs) in these genomes indicate that horizontal transfer may have occurred between divergent phytoplasma lineages. Because many effectors are associated with PMUs, the horizontal transfer of these transposon-like elements can contribute to the adaptation and diversification of these pathogens. In summary, the findings from this study highlight the importance of improving taxon sampling when investigating genome evolution. Moreover, the currently available sequences are inadequate to fully characterize the pan-genome of phytoplasmas. Future genome sequencing efforts to expand phylogenetic diversity are essential in improving our understanding of phytoplasma evolution.  相似文献   

14.
We have identified all homologues in the current databases of the ubiquitous protein constituents of the general secretory (Sec) pathway. These prokaryotic/eukaryotic proteins include (1) SecY/Sec61alpha, (2) SecE/Sec61gamma, (3) SecG/Sec61beta, (4) Ffh/SRP54 and (5) FtsY/SRP receptor subunit-alpha. Phylogenetic and sequence analyses lead to major conclusions concerning (1) the ubiquity of these proteins in living organisms, (2) the topological uniformity of some but not other Sec constituents, (3) the orthologous nature of almost all of them, (4) a total lack of paralogues in almost all organisms for which complete genome sequences are available, (5) the occurrence of two or even three paralogues in a few bacteria, plants, and yeast, depending on the Sec constituent, and (6) a tremendous degree of sequence divergence in bacteria compared with that in archaea or eukaryotes. The phylogenetic analyses lead to the conclusion that with a few possible exceptions, the five families of Sec constituents analyzed generally underwent sequence divergence in parallel but at different characteristic rates. The results provide evolutionary insights as well as guides for future functional studies. Because every organism with a fully sequenced genome exhibits at least one orthologue of each of these Sec proteins, we conclude that all living organisms have relied on the Sec system as their primary protein secretory/membrane insertion system. Because most prokaryotes and many eukaryotes encode within their genomes only one of each constituent, we also conclude that strong evolutionary pressure has minimized gene duplication events leading to the establishment of Sec paralogues. Finally, the sequence diversity of bacterial proteins as compared with their archaeal and eukaryotic counterparts is in agreement with the suggestion that bacteria were the evolutionary predecessors of archaea and eukaryotes.  相似文献   

15.
DNA repair in the Archaea is relevant to the consideration of genome maintenance and replication fidelity in the last universal common ancestor (LUCA) from two perspectives. First, these prokaryotes embody a mix of bacterial and eukaryal molecular features. Second, DNA repair proteins would have been essential in LUCA to maintain genome integrity, regardless of the environmental temperature. Yet we know very little of the basic molecular mechanisms of DNA damage and repair in the Archaea in general. Many studies on DNA repair in archaea have been conducted with hyperthermophiles because of the additional stress imposed on their macromolecules by high temperatures. In addition, of the six complete archaeal genome sequences published so far, five are thermophilic archaea. We have recently shown that the hyperthermophile Pyrococcus furiosus has an extraordinarily high capacity for repair of radiation-induced double-strand breaks and we have identified and sequenced several genes involved in DNA repair in P. furiosus. At the sequence level, only a few genes share homology with known bacterial repair genes. For instance, our phylogenetic analysis indicates that archaeal recombinases occur in two paralogous gene families, one of which is very deeply branched, and both recombinases are more closely related to the eukaryotic RAD51 and Dmc1 gene families than to the Escherichia coli recA gene. We have also identified a gene encoding a repair endo/exonuclease in the genomes of several Archaea. The archaeal sequences are highly homologous to those of the eukaryotic Rad2 family and they cluster with genes of the FEN-1 subfamily, which are known to be involved in DNA replication and repair in eukaryotes. We argue that there is a commonality of mechanisms and protein sequences, shared between prokaryotes and eukaryotes for several modes of DNA repair, reflecting diversification from a minimal set of genes thought to represent the genome of the LUCA.  相似文献   

16.
Organelle origins and ribosomal RNA   总被引:8,自引:0,他引:8  
As the detailed molecular biology of organelle genomes has unfolded, there has been a general acceptance of the view that plastids and mitochondria are of endosymbiotic, eubacterial origin. Plastid genes are strikingly similar to their eubacterial (particularly cyanobacterial) counterparts in sequence, organization, and mode of expression, and such features strongly support the hypothesis that the plastid and its genome were derived in evolution from a blue-green alga-like endosymbiont. Mitochondria, on the other hand, are problematic: mitochondrial genes are organized and expressed in remarkably diverse ways in the different major groups of eukaryotes, and in no case are these features particularly characteristic of either bacterial or nuclear genomes. There is, however, clear evidence derived from gene sequence supporting the eubacterial ancestry of mitochondria, and some of the most compelling data have come from analyses of mitochondrial ribosomal RNA (rRNA). Plant mitochondrial rRNA genes diverge in sequence at a particularly slow rate, and these genes have proven to be especially supportive of the endosymbiont hypothesis, pointing to an origin of mitochondria from within the alpha subdivision of the purple bacteria. Ribosomal RNA sequences provide a basis for the construction of global phylogenetic trees that probe the evolutionary history of organelles, and that address the question of whether mitochondria and plastids are monophyletic or polyphyletic in origin. Such studies raise the possibility that the rRNA genes of plant mitochondria originated separately from the mitochondrial rRNA genes of other eukaryotes.  相似文献   

17.
The past decade has seen the determination of complete mitochondrial genome sequences from a taxonomically diverse set of organisms. These data have allowed an unprecedented understanding of the evolution of the mitochondrial genome in terms of gene content and order, as well as genome size and structure. In addition, phylogenetic reconstructions based on mitochondrial DNA (mtDNA)-encoded protein sequences have firmly established the identities of protistan relatives of the animal, fungal and plant lineages. Analysis of the mtDNAs of these protists has provided insight into the structure of the mitochondrial genome at the origin of these three, mainly multicellular, eukaryotic groups. Further research into mtDNAs of taxa ancestral and intermediate to currently characterized organisms will help to refine pathways and modes of mtDNA evolution, as well as provide valuable phylogenetic characters to assist in unraveling the deep branching order of all eukaryotes.  相似文献   

18.
Genomic rearrangements have been studied since the beginnings of modern genetics and models for such rearrangements have been the subject of many papers over the last 10 years. However, none of the extant models can predict the evolution of genomic organization into circular unichromosomal genomes (as in most prokaryotes) and linear multichromosomal genomes (as in most eukaryotes). Very few of these models support gene duplications and losses--yet these events may be more common in evolutionary history than rearrangements and themselves cause apparent rearrangements. We propose a new evolutionary model that integrates gene duplications and losses with genome rearrangements and that leads to genomes with either one (or a very few) circular chromosome or a collection of linear chromosomes. Our model is based on existing rearrangement models and inherits their linear-time algorithms for pairwise distance computation (for rearrangement only). Moreover, our model predictions fit observations about the evolution of gene family sizes and agree with the existing predictions about the growth in the number of chromosomes in eukaryotic genomes.  相似文献   

19.
Little more than 30 years since the discovery of the Archaea, over one hundred archaeal genome sequences are now publicly available, of which ~40% have been released in the last two years. Their analysis provides an increasingly complex picture of archaeal phylogeny and evolution with the proposal of new major phyla, such as the Thaumarchaeota, and important information on the evolution of key central cellular features such as cell division. Insights have been gained into the events and processes in archaeal evolution, with a number of additional and unexpected links to the Eukaryotes revealed. Taken together, these results predict that many more surprises will be found as new archaeal genomes are sequenced.  相似文献   

20.
Genome compaction and stability in microsporidian intracellular parasites   总被引:13,自引:0,他引:13  
Microsporidian genomes are extraordinary among eukaryotes for their extreme reduction: although they are similar in form to other eukaryotic genomes, they are typically smaller than many prokaryotic genomes. At the same time, their rates of sequence evolution are among the highest for eukaryotic organisms. To explore the effects of compaction on nuclear genome evolution, we sequenced 685,000 bp of the Antonospora locustae genome (formerly Nosema locustae) and compared its organization with the recently completed genome of the human parasite Encephalitozoon cuniculi. Despite being very distantly related, the genomes of these two microsporidian species have retained an unexpected degree of synteny: 13% of genes are in the same context, and 30% of the genes were separated by a small number of short rearrangements. Microsporidian genomes are, therefore, paradoxically composed of rapidly evolving sequences harbored within a slowly evolving genome, although these two processes are sometimes considered to be coupled. Microsporidian genomes show that eukaryotic genomes (like genes) do not evolve in a clock-like fashion, and genome stability may result from compaction in addition to a lack of recombination, as has been traditionally thought to occur in bacterial and organelle genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号