首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 234 毫秒
1.
Comparative genomics has revealed that variations in bacterial and archaeal genome DNA sequences cannot be explained by only neutral mutations. Virus resistance and plasmid distribution systems have resulted in changes in bacterial and archaeal genome sequences during evolution. The restriction-modification system, a virus resistance system, leads to avoidance of palindromic DNA sequences in genomes. Clustered, regularly interspaced, short palindromic repeats (CRISPRs) found in genomes represent yet another virus resistance system. Comparative genomics has shown that bacteria and archaea have failed to gain any DNA with GC content higher than the GC content of their chromosomes. Thus, horizontally transferred DNA regions have lower GC content than the host chromosomal DNA does. Some nucleoid-associated proteins bind DNA regions with low GC content and inhibit the expression of genes contained in those regions. This form of gene repression is another type of virus resistance system. On the other hand, bacteria and archaea have used plasmids to gain additional genes. Virus resistance systems influence plasmid distribution. Interestingly, the restriction-modification system and nucleoid-associated protein genes have been distributed via plasmids. Thus, GC content and genomic signatures do not reflect bacterial and archaeal evolutionary relationships.  相似文献   

2.
Since the definition of archaea as a separate domain of life along with bacteria and eukaryotes, they have become one of the most interesting objects of modern microbiology, molecular biology, and biochemistry. Sequencing and analysis of archaeal genomes were especially important for studies on archaea because of a limited availability of genetic tools for the majority of these microorganisms and problems associated with their cultivation. Fifteen years since the publication of the first genome of an archaeon, more than one hundred complete genome sequences of representatives of different phylogenetic groups have been determined. Analysis of these genomes has expanded our knowledge of biology of archaea, their diversity and evolution, and allowed identification and characterization of new deep phylogenetic lineages of archaea. The development of genome technologies has allowed sequencing the genomes of uncultivated archaea directly from enrichment cultures, metagenomic samples, and even from single cells. Insights have been gained into the evolution of key biochemical processes in archaea, such as cell division and DNA replication, the role of horizontal gene transfer in the evolution of archaea, and new relationships between archaea and eukaryotes have been revealed.  相似文献   

3.
钟智  李宏 《生物物理学报》2008,24(5):379-392
以细菌和古菌基因组5′ UTR序列作为研究对象,分析在5′ UTR 的3个不同阅读框架中三联体AUG的分布,发现无论是细菌还是古菌基因组都在阅读框1中有非常明显的AUG缺失(depletion)。AUG的缺失表明在起始密码子上游的AUG很可能会对基因的翻译起始产生影响。分析得知:绝大部分的AUG都是以uORF(upstream open reading frame)的形式出现的,uAUG(upstream AUG)的数量很少,特别是在阅读框1中,而且在细菌基因组的阅读框1中uAUG较多地出现在了含有SD序列的基因上游。比较发现,uAUG引导的序列在同义密码子使用上的偏好性较真正的编码序列差,这可能表明细菌和古菌在同义密码子使用上的偏好性也是决定基因准确地翻译起始的重要因素之一。  相似文献   

4.
Insertion sequences (ISs) can constitute an important component of prokaryotic (bacterial and archaeal) genomes. Over 1,500 individual ISs are included at present in the ISfinder database (www-is.biotoul.fr), and these represent only a small portion of those in the available prokaryotic genome sequences and those that are being discovered in ongoing sequencing projects. In spite of this diversity, the transposition mechanisms of only a few of these ubiquitous mobile genetic elements are known, and these are all restricted to those present in bacteria. This review presents an overview of ISs within the archaeal kingdom. We first provide a general historical summary of the known properties and behaviors of archaeal ISs. We then consider how transposition might be regulated in some cases by small antisense RNAs and by termination codon readthrough. This is followed by an extensive analysis of the IS content in the sequenced archaeal genomes present in the public databases as of June 2006, which provides an overview of their distribution among the major archaeal classes and species. We show that the diversity of archaeal ISs is very great and comparable to that of bacteria. We compare archaeal ISs to known bacterial ISs and find that most are clearly members of families first described for bacteria. Several cases of lateral gene transfer between bacteria and archaea are clearly documented, notably for methanogenic archaea. However, several archaeal ISs do not have bacterial equivalents but can be grouped into Archaea-specific groups or families. In addition to ISs, we identify and list nonautonomous IS-derived elements, such as miniature inverted-repeat transposable elements. Finally, we present a possible scenario for the evolutionary history of ISs in the Archaea.  相似文献   

5.
The origins of modern proteomes   总被引:1,自引:0,他引:1  
Kurland CG  Canbäck B  Berg OG 《Biochimie》2007,89(12):1454-1463
Distributions of phylogenetically related protein domains (fold superfamilies), or FSFs, among the three Superkingdoms (trichotomy) are assessed. Very nearly 900 of the 1200 FSFs of the trichotomy are shared by two or three Superkingdoms. Parsimony analysis of FSF distributions suggests that the FSF complement of the last common ancestor to the trichotomy was more like that of modern eukaryotes than that of archaea and bacteria. Studies of length distributions among members of orthologous families of proteins present in all three Superkingdoms reveal that such lengths are significantly longer among eukaryotes than among bacteria and archaea. The data also reveal that proteins lengths of eukaryotes are more broadly distributed than they are within archaeal and bacterial members of the same orthologous families. Accordingly, selective pressure for a minimal size is significantly greater for orthologous protein lengths in archaea and bacteria than in eukaryotes. Alignments of orthologous proteins of archaea, bacteria and eukaryotes are characterized by greater sequence variation at their N-terminal and C-terminal domains, than in their central cores. Length variations tend to be localized in the terminal sequences; the conserved sequences of orthologous families are localized in a central core. These data are consistent with the interpretation that the genomes of the last common ancestor (LUCA) encoded a cohort of FSFs not very different from that of modern eukaryotes. Divergence of bacterial and archaeal genomes from that common ancestor may have been accompanied by more intensive reductive evolution of proteomes than that expressed in eukaryotes. Dollo's Law suggests that the evolution of novel FSFs is a very slow process, while laboratory experiments suggests that novel protein genesis from preexisting FSFs can be relatively rapid. Reassortment of FSFs to create novel proteins may have been mediated by genetic recombination before the advent of more efficient splicing mechanisms.  相似文献   

6.
Comparative analysis of the complete sequences of seven bacterial and three archaeal genomes leads to the first generalizations of emerging genome-based microbiology. Protein sequences are, generally, highly conserved, with ∼70% of the gene products in bacteria and archaea containing ancient conserved regions. In contrast, there is little conservation of genome organization, except for a few essential operons. The most striking conclusions derived by comparison of multiple genomes from phylogenetically distant species are that the number of universally conserved gene families is very small and that multiple events of horizontal gene transfer and genome fusion are major forces in evolution.  相似文献   

7.
Tailed double-stranded DNA viruses (order Caudovirales) represent the dominant morphotype among viruses infecting bacteria. Analysis and comparison of complete genome sequences of tailed bacterial viruses provided insights into their origin and evolution. Structural and genomic studies have unexpectedly revealed that tailed bacterial viruses are evolutionarily related to eukaryotic herpesviruses. Organisms from the third domain of life, Archaea, are also infected by viruses that, in their overall morphology, resemble tailed viruses of bacteria. However, high-resolution structural information is currently unavailable for any of these viruses, and only a few complete genomes have been sequenced so far. Here we identified nine proviruses that are clearly related to tailed bacterial viruses and integrated into chromosomes of species belonging to four different taxonomic orders of the Archaea. This more than doubled the number of genome sequences available for comparative studies. Our analyses indicate that highly mosaic tailed archaeal virus genomes evolve by homologous and illegitimate recombination with genomes of other viruses, by diversification, and by acquisition of cellular genes. Comparative genomics of these viruses and related proviruses revealed a set of conserved genes encoding putative proteins similar to virion assembly and maturation, as well as genome packaging proteins of tailed bacterial viruses and herpesviruses. Furthermore, fold prediction and structural modeling experiments suggest that the major capsid proteins of tailed archaeal viruses adopt the same topology as the corresponding proteins of tailed bacterial viruses and eukaryotic herpesviruses. Data presented in this study strongly support the hypothesis that tailed viruses infecting archaea share a common ancestry with tailed bacterial viruses and herpesviruses.  相似文献   

8.
Little more than 30 years since the discovery of the Archaea, over one hundred archaeal genome sequences are now publicly available, of which ~40% have been released in the last two years. Their analysis provides an increasingly complex picture of archaeal phylogeny and evolution with the proposal of new major phyla, such as the Thaumarchaeota, and important information on the evolution of key central cellular features such as cell division. Insights have been gained into the events and processes in archaeal evolution, with a number of additional and unexpected links to the Eukaryotes revealed. Taken together, these results predict that many more surprises will be found as new archaeal genomes are sequenced.  相似文献   

9.
In search of RNase P RNA from microbial genomes   总被引:2,自引:0,他引:2       下载免费PDF全文
Li Y  Altman S 《RNA (New York, N.Y.)》2004,10(10):1533-1540
A simple procedure has been developed to quickly retrieve and validate the DNA sequence encoding the RNA subunit of ribonuclease P (RNase P RNA) from microbial genomes. RNase P RNA sequences were identified from 94% of bacterial and archaeal complete genomes where previously no RNase P RNA was annotated. A sequence was found in camelpox virus, highly conserved in all orthopoxviruses (including smallpox virus), which could fold into a putative RNase P RNA in terms of conserved primary features and secondary structure. New structure features of RNase P RNA that enable one to distinguish bacteria from archaea and eukarya were found. This RNA is yet another RNA that can be a molecular criterion to divide the living world into three domains (bacteria, archaea, and eukarya). The catalytic center of this RNA, and its detection from some environmental whole genome shotgun sequences, is also discussed.  相似文献   

10.
Xiaohui C  Jin W 《Gene》2004,327(1):75-79
Searching for unique features of archaeal genome may shed light on the mechanism of gene regulation in primitive life forms. Statistical analysis of ATG frequency on the complete genome sequences of 16 archaea, 20 bacteria and 2 eukaryotes revealed that most of the archaeal genomes have a remarkably high ATG frequency at the position of nine nucleotide (nt) downstream of the translation initiation site (the first nucleotide of the translation initiation codon is designated as 0). To understand the role of this unique ATG in archaea, we further analyzed the ATG-initiated genes and non-ATG-initiated genes separately, and the results indicated that only the non-ATG-initiated genes contribute to the high ATG frequency at position +9. This led us to speculate that the in-frame ATG at +9 may serve as a remedial initiation site for archaea in case of initiation failure at the regular site. In addition, it seems that this phenomenon does not result from the harsh environment that archaea are usually viable according to the fact that no considerably high ATG frequency at +9 was observed in all the four thermophilic bacteria that also live in harsh environment. We proposed that the high ATG frequency at position +9 might reflect the decreased efficiency of the translation initiation machinery in archaea. Since archaea evolve very slowly, this unique characteristic of high ATG frequency at position +9 may present the primitive state of the Universal Ancestor.  相似文献   

11.
Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www. ncbi.nlm. nih.gov/COG). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 56-83% of the gene products from each of the complete bacterial and archaeal genomes and approximately 35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes.  相似文献   

12.
Meyer TE  Bansal AK 《Biochemistry》2005,44(34):11458-11465
Based largely upon analysis of ribosomal RNA, a third domain of life, called archaea, had been proposed in addition to bacteria and eukaryotes. However, quantitative analysis of 73 whole genomes shows only a two-domain division of life: into eukaryotes and prokaryotes. Thousands of orthologous genes in archaea and bacteria show an essentially unimodal distribution of sequence identities. Thus, whole genome analyses indicate that archaea are a phylum of bacteria rather than a separate domain of life. In contrast, archaeal rRNA and that of hyperthermophilic bacteria differ from the rRNA of mesophilic bacteria. Thus, there is a bimodal distribution of rRNA sequence identities which differ by 12%. This discrepancy in rRNA and gene content based analyses of whole genomes is likely due to a 15% elevated C:G content of the rRNA of archaea and hyperthermophilic bacteria. The elevated C:G content is consistent with stabilization against thermal denaturation caused by additional hydrogen bonding (3 bonds) in C:G pairs compared to A:U pairs (2 bonds). Based upon this premise, there is no reliable way to correct rRNA for such differences in base composition and it is not possible to quantitatively compare hyperthermophiles with mesophiles by the rRNA method. Furthermore, quantitative study of whole genomes shows that the extent of change in both bacterial and archaeal genes, including rRNA, has reached a limit. Thus, direct sequence comparisons work with closely related genomes, but it is not possible to differentiate the most divergent prokaryotic species, which are currently designated as separate phyla. We believe that the differences in characteristics of archaeal species is based primarily upon selection of genes and pathways compatible with the extreme environmental lifestyle, i.e., hyperthermophily.  相似文献   

13.
Studies of neutrally evolving sequences suggest that differences in eukaryotic genome sizes result from different rates of DNA loss. However, very few pseudogenes have been identified in microbial species, and the processes whereby genes and genomes deteriorate in bacteria remain largely unresolved. The typhus-causing agent, Rickettsia prowazekii, is exceptional in that as much as 24% of its 1.1-Mb genome consists of noncoding DNA and pseudogenes. To test the hypothesis that the noncoding DNA in the R. prowazekii genome represents degraded remnants of ancestral genes, we systematically examined all of the identified pseudogenes and their flanking sequences in three additional Rickettsia species. Consistent with the hypothesis, we observe sequence similarities between genes and pseudogenes in one species and intergenic DNA in another species. We show that the frequencies and average sizes of deletions are larger than insertions in neutrally evolving pseudogene sequences. Our results suggest that inactivated genetic material in the Rickettsia genomes deteriorates spontaneously due to a mutation bias for deletions and that the noncoding sequences represent DNA in the final stages of this degenerative process.  相似文献   

14.
The Z curve database: a graphic representation of genome sequences   总被引:7,自引:0,他引:7  
MOTIVATION: Genome projects for many prokaryotic and eukaryotic species have been completed and more new genome projects are being underway currently. The availability of a large number of genomic sequences for researchers creates a need to find graphic tools to study genomes in a perceivable form. The Z curve is one of such tools available for visualizing genomes. The Z curve is a unique three-dimensional curve representation for a given DNA sequence in the sense that each can be uniquely reconstructed given the other. The Z curve database for more than 1000 genomes have been established here. RESULTS: The database contains the Z curves for archaea, bacteria, eukaryota, organelles, phages, plasmids, viroids and viruses, whose genomic sequences are currently available. All the 3-dimensional Z curves and their three component curves are stored in the database. The applications of the Z curve database on comparative genomics, gene prediction, computation of G+C content with a windowless technique, prediction of replication origins and terminations of bacterial and archaeal genomes and study of local deviations from the Chargaff Parity Rule 2 etc. are presented in detail. The Z curve database reported here is a treasure trove in which biologists could find useful biological knowledge.  相似文献   

15.
16.
Recognizing the pseudogenes in bacterial genomes   总被引:9,自引:0,他引:9  
Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.  相似文献   

17.
Recognition of protein-coding genes, a classical bioinformatics issue, is an absolutely needed step for annotating newly sequenced genomes. The Z-curve algorithm, as one of the most effective methods on this issue, has been successfully applied in annotating or re-annotating many genomes, including those of bacteria, archaea and viruses. Two Z-curve based ab initio gene-finding programs have been developed: ZCURVE (for bacteria and archaea) and ZCURVE_V (for viruses and phages). ZCURVE_C (for 57 bacteria) and Zfisher (for any bacterium) are web servers for re-annotation of bacterial and archaeal genomes. The above four tools can be used for genome annotation or re-annotation, either independently or combined with the other gene-finding programs. In addition to recognizing protein-coding genes and exons, Z-curve algorithms are also effective in recognizing promoters and translation start sites. Here, we summarize the applications of Z-curve algorithms in gene finding and genome annotation.  相似文献   

18.
19.
The first bacterial genome was sequenced in 1995, and the first archaeal genome in 1996. Soon after these breakthroughs, an exponential rate of genome sequencing was established, with a doubling time of approximately 20 months for bacteria and approximately 34 months for archaea. Comparative analysis of the hundreds of sequenced bacterial and dozens of archaeal genomes leads to several generalizations on the principles of genome organization and evolution. A crucial finding that enables functional characterization of the sequenced genomes and evolutionary reconstruction is that the majority of archaeal and bacterial genes have conserved orthologs in other, often, distant organisms. However, comparative genomics also shows that horizontal gene transfer (HGT) is a dominant force of prokaryotic evolution, along with the loss of genetic material resulting in genome contraction. A crucial component of the prokaryotic world is the mobilome, the enormous collection of viruses, plasmids and other selfish elements, which are in constant exchange with more stable chromosomes and serve as HGT vehicles. Thus, the prokaryotic genome space is a tightly connected, although compartmentalized, network, a novel notion that undermines the ‘Tree of Life’ model of evolution and requires a new conceptual framework and tools for the study of prokaryotic evolution.  相似文献   

20.
Whereas the process of DNA replication is fundamentally conserved in the three domains of life, the archaeal system is closer to that of eukarya than bacteria. In the time since the complete genome sequences of several members of the archaeal domain became available, there has been a burst of research on archaeal DNA replication. These studies have led to both expected and surprising findings. This review summarizes the search for origins of replication in archaea, and our current knowledge of initiation, the process by which replication origins are recognized, the DNA molecule is unwound and the replicative helicase is loaded onto the DNA in preparation for DNA synthesis. The similarities and differences of the initiation process in archea, bacteria and eukarya are also summarized.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号