首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Insects, the most biodiverse taxonomic group, have high AT content in their mitochondrial genomes. Although codon usage tends to be AT-rich, base composition and codon usage of mitochondrial genomes may vary among taxa. Thus, we compare base composition and codon usage patterns of 49 insect mitochondrial genomes. For protein coding genes, AT content is as high as 80% in the Hymenoptera and Lepidoptera and as low as 72% in the Orthopotera. The AT content is high at positions 1 and 3, but A content is low at position 2. A close correlation occurs between codon usage and tRNA abundance in nuclear genomes. Optimal codons can pair well with the antr codons of the most abundant tRNAs. One tRNA gene translates a synonymous codon family in vertebrate mitochondrial genomes and these tRNA anticodons can pair with optimal codons. However, optimal codons cannot pair with anticodons in mtDNA ofCochiiomyia hominivorax (Dipteral: CaLliphoridae). Ten optimal codons cannot pair with tRNA anticodons in all 49 insect mitochondrial genomes; non-optimal codon-anticodon usage is common and codon usage is not influenced by tRNA abundance.  相似文献   

2.
It is generally believed that the effect of translational selection on codon usage bias is related to the number of transfer RNA genes in bacteria, which is more with respect to the high expression genes than the whole genome. Keeping this in the background, we analyzed codon usage bias with respect to asparagine, isoleucine, phenylalanine, and tyrosine amino acids. Analysis was done in seventeen bacteria with the available gene expression data and information about the tRNA gene number. In most of the bacteria, it was observed that codon usage bias and tRNA gene number were not in agreement, which was unexpected. We extended the study further to 199 bacteria, limiting to the codon usage bias in the two highly expressed genes rpoB and rpoC which encode the RNA polymerase subunits β and β′, respectively. In concordance with the result in the high expression genes, codon usage bias in rpoB and rpoC genes was also found to not be in agreement with tRNA gene number in many of these bacteria. Our study indicates that tRNA gene numbers may not be the sole determining factor for translational selection of codon usage bias in bacterial genomes.  相似文献   

3.
4.
A role for tRNA modifications in genome structure and codon usage   总被引:1,自引:0,他引:1  
Transfer RNA (tRNA) gene content is a differentiating feature of genomes that contributes to the efficiency of the translational apparatus, but the principles shaping tRNA gene copy number and codon composition are poorly understood. Here, we report that the emergence of two specific tRNA modifications shaped the structure and composition of all extant genomes. Through the analysis of more than 500 genomes, we identify two kingdom-specific tRNA modifications as major contributors that separated archaeal, bacterial, and eukaryal genomes in terms of their tRNA gene composition. We show that, contrary to prior observations, genomic codon usage and tRNA gene frequencies correlate in all kingdoms if these two modifications are taken into account and that presence or absence of these modifications explains patterns of gene expression observed in previous studies. Finally, we experimentally demonstrate that human gene expression levels correlate well with genomic codon composition if these identified modifications are considered.  相似文献   

5.
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea.  相似文献   

6.
7.
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.  相似文献   

8.
Biological systems are inherently hierarchal and multiscale in time and space. A major challenge of systems biology is to describe biological systems as a computational model, which can be used to derive novel hypothesis and drive experiments leading to new knowledge. The constraint-based reconstruction and analysis approach has been successfully applied to metabolism and to the macromolecular synthesis machinery assembly. Here, we present the first integrated stoichiometric multiscale model of metabolism and macromolecular synthesis for Escherichia coli K12 MG1655, which describes the sequence-specific synthesis and function of almost 2000 gene products at molecular detail. We added linear constraints, which couple enzyme synthesis and catalysis reactions. Comparison with experimental data showed improvement of growth phenotype prediction with the multiscale model over E. coli’s metabolic model alone. Many of the genes covered by this integrated model are well conserved across enterobacters and other, less related bacteria. We addressed the question of whether the bias in synonymous codon usage could affect the growth phenotype and environmental niches that an organism can occupy. We created two classes of in silico strains, one with more biased codon usage and one with more equilibrated codon usage than the wildtype. The reduced growth phenotype in biased strains was caused by tRNA supply shortage, indicating that expansion of tRNA gene content or tRNA codon recognition allow E. coli to respond to changes in codon usage bias. Our analysis suggests that in order to maximize growth and to adapt to new environmental niches, codon usage and tRNA content must co-evolve. These results provide further evidence for the mutation-selection-drift balance theory of codon usage bias. This integrated multiscale reconstruction successfully demonstrates that the constraint-based modeling approach is well suited to whole-cell modeling endeavors.  相似文献   

9.
Among bacteria, we have previously shown that species that are capable of rapid growth have stronger selection on codon usage than slow growing species, and possess higher numbers of rRNA and tRNA genes. This suggests that fast-growers are adapted for fast protein synthesis. There is also considerable evidence that codon usage is influenced by accuracy of translation, and some authors have argued that accuracy is more important than speed. Here we compare the strength of the two effects by studying the codon usages in high and low expression genes and on conserved and variable sites within high expression genes. We introduce a simple statistical method that can be used to assess the significance and the strength of the two types of bias in the same sets of sequences. We compare our statistical measure of codon bias to the common used codon adaptation index, and show that the new measure is preferable for three reasons for the purposes of this analysis. Across a large sample of bacterial genomes, both effects from speed and accuracy are clearly visible, although the speed effect appears to be much stronger than the accuracy effect and is found to be significant in a larger proportion of genomes. It is also difficult to explain the correlation of codon bias in the high expression genes with growth rates and numbers of copies of tRNA and rRNA genes on the basis of selection for accuracy. Hence we conclude that selection for translational speed is a dominant effect in driving codon usage bias in fast-growing bacteria, with selection for accuracy playing a small supplementary role.  相似文献   

10.
Translational selection is responsible for the unequal usage of synonymous codons in protein coding genes in a wide variety of organisms. It is one of the most subtle and pervasive forces of molecular evolution, yet, establishing the underlying causes for its idiosyncratic behaviour across living kingdoms has proven elusive to researchers over the past 20 years. In this study, a statistical model for measuring translational selection in any given genome is developed, and the test is applied to 126 fully sequenced genomes, ranging from archaea to eukaryotes. It is shown that tRNA gene redundancy and genome size are interacting forces that ultimately determine the action of translational selection, and that an optimal genome size exists for which this kind of selection is maximal. Accordingly, genome size also presents upper and lower boundaries beyond which selection on codon usage is not possible. We propose a model where the coevolution of genome size and tRNA genes explains the observed patterns in translational selection in all living organisms. This model finally unifies our understanding of codon usage across prokaryotes and eukaryotes. Helicobacter pylori, Saccharomyces cerevisiae and Homo sapiens are codon usage paradigms that can be better understood under the proposed model.  相似文献   

11.
Mycoplasma bovis is a major pathogen causing arthritis, respiratory disease and mastitis in cattle. A better understanding of its genetic features and evolution might represent evidences of surviving host environments. In this study, multiple factors influencing synonymous codon usage patterns in M. bovis (three strains’ genomes) were analyzed. The overall nucleotide content of genes in the M. bovis genome is AT-rich. Although the G and C contents at the third codon position of genes in the leading strand differ from those in the lagging strand (p<0.05), the 59 synonymous codon usage patterns of genes in the leading strand are highly similar to those in the lagging strand. The over-represented codons and the under-represented codons were identified. A comparison of the synonymous codon usage pattern of M. bovis and cattle (susceptible host) indicated the independent formation of synonymous codon usage of M. bovis. Principal component analysis revealed that (i) strand-specific mutational bias fails to affect the synonymous codon usage pattern in the leading and lagging strands, (ii) mutation pressure from nucleotide content plays a role in shaping the overall codon usage, and (iii) the major trend of synonymous codon usage has a significant correlation with the gene expression level that is estimated by the codon adaptation index. The plot of the effective number of codons against the G+C content at the third codon position also reveals that mutation pressure undoubtedly contributes to the synonymous codon usage pattern of M. bovis. Additionally, the formation of the overall codon usage is determined by certain evolutionary selections for gene function classification (30S protein, 50S protein, transposase, membrane protein, and lipoprotein) and translation elongation region of genes in M. bovis. The information could be helpful in further investigations of evolutionary mechanisms of the Mycoplasma family and heterologous expression of its functionally important proteins.  相似文献   

12.
13.
We sequenced most of the mitochondrial (mt) genomes of 2 apocritan taxa: Vanhornia eucnemidarum and Primeuchroeus spp. These mt genomes have similar nucleotide composition and codon usage to those of mt genomes reported for other Hymenoptera, with a total A + T content of 80.1% and 78.2%, respectively. Gene content corresponds to that of other metazoan mt genomes, but gene organization is not conserved. There are a total of 6 tRNA genes rearranged in V. eucnemidarum and 9 in Primeuchroeus spp. Additionally, several noncoding regions were found in the mt genome of V. eucnemidarum, as well as evidence of a sustained gene duplication involving 3 tRNA genes. We also report an inversion of the large and small ribosomal RNA genes in Primeuchroeus spp. mt genome. However, none of the rearrangements reported are phylogenetically informative with respect to the current taxon sample.  相似文献   

14.
15.
Possessing three circular chromosomes is a distinct genomic characteristic of Burkholderia cenocepacia AU 1054, a clinically important pathogen in cystic fibrosis. In this study, base composition, codon usage and functional role category were analyzed in the B. cenocepacia AU 1054 genome. Although no bias in the base and codon usage was detected between any two chromosomes, function differences did exist in the genes of each chromosome. Similar base composition and differential functional role categories indicated that genes on these three chromosomes were relatively stable and that a proper division of labor was established. Based on variations in the base or codon usage, four small gene clusters were observed in all of the genes. Multivariate analysis revealed that protein hydrophobicity played a predominant role in shaping base usage bias, while horizontal gene transfer and the gene expression level were the two most important factors that affected the codon usage bias. Interestingly, we also found that these gene clusters were correlated with different biological functions: (i) 45 pyrimidine-leading-codon preferred genes were predominantly involved in regulatory function; (ii) most drug resistance-related genes involved in 826 genes that coding for hydrophobic proteins; (iii) most of the 111 horizontal transfer genes were responsible for genomic plasticity; and (iv) 73 highly expressed genes (predicted by their codon adaptation index values) showed environmental adaptation to cystic fibrosis. Our results showed that genes with base or codon usage bias were affected by mutational pressure and natural selection, and their functions could contribute to drug assistance and transmissible activity in B. cenocepacia.  相似文献   

16.
17.
鉴于遗传密码子的简并性能够将基因遗传信息的容量提升,同义密码子使用偏嗜性得以在生物体的基因组中广泛存在。虽然同义密码子之间碱基的变化并不能导致氨基酸种类的改变,在研究mRNA半衰期、编码多肽翻译效率及肽链空间构象正确折叠的准确性和翻译等这一系列过程中发现,同义密码子使用的偏嗜性在某种程度上通过精微调控翻译机制体现其遗传学功能。同义密码子指导tRNA在翻译过程中识别核糖体的速率变化是由氨基酸的特定顺序决定,并且在新生多肽链合成时,蛋白质共翻译转运机制同时调节其空间构象的正确折叠从而保证蛋白的正常生物学功能。某些同义密码子使用偏嗜性与特定蛋白结构的形成具有显著相关性,密码子使用偏嗜性一旦改变将可能导致新生多肽空间构象出现错误折叠。结合近些年来国内外在此领域的研究成果,阐述同义密码子使用偏嗜性如何发挥精微调控翻译的生物学功能与作用。  相似文献   

18.
Aspergillus is a genus of mold fungi that includes more than 200 described species. Many members of the group are relevant pathogens and other species are economically important. Only one species has been analyzed for codon usage, and this was performed with a small number of genes. In this paper, we report the codon usage patterns of eight completely sequenced genomes which belong to this genus. The results suggest that selection for translational efficiency and accuracy are the major factors shaping codon usage in all of the species studied so far, and therefore they were active in the last common ancestor of the group. Composition and molecular distances analyses show that highly expressed genes evolve slower at synonymous sites. We identified a conserved core of translationally optimal codons and study the tRNA gene pool in each genome. We found that the great majority of preferred triplets match the respective cognate tRNA with more copies in the respective genome. We discuss the possible scenarios that can explain the observed differences among the species analyzed. Finally we highlight the biotechnological application of this research regarding heterologous protein expression.  相似文献   

19.
The cichlid fishes of the East African Great Lakes represent a model especially suited to study adaptive radiation and speciation. With several African cichlid genome projects being in progress, a promising set of closely related genomes is emerging, which is expected to serve as a valuable data base to solve questions on genotype-phenotype relations. The mitochondrial (mt) genomes presented here are the first results of the assembly and annotation process for two closely related but eco-morphologically highly distinct Lake Tanganyika cichlids, Petrochromis trewavasae and Tropheus moorii. The genomic sequences comprise 16,588 bp (P. trewavasae) and 16,590 bp (T. moorii), and exhibit the typical mitochondrial structure, with 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, and a non-coding control region. Analyses confirmed that the two species are very closely related with an overall sequence similarity of 96%. We analyzed the newly generated sequences in the phylogenetic context of 21 published labroid fish mitochondrial genomes. Consistent with other vertebrates, the D-loop region was found to evolve faster than protein-coding genes, which in turn are followed by the rRNAs; the tRNAs vary greatly in the rate of sequence evolution, but on average evolve the slowest. Within the group of coding genes, ND6 evolves most rapidly. Codon usage is similar among examined cichlid tribes and labroid families; although a slight shift in usage patterns down the gene tree could be observed. Despite having a clearly different nucleotide composition, ND6 showed a similar codon usage. C-terminal ends of Cox1 exhibit variations, where the varying number of amino acids is related to the structure of the obtained phylogenetic tree. This variation may be of functional relevance for Cox1 synthesis.  相似文献   

20.
Goodarzi H  Torabi N  Najafabadi HS  Archetti M 《Gene》2008,407(1-2):30-41
In the work presented, the changes in codon and amino acid contents have been studied as a function of environmental conditions by comparing pairs of homologs in a group of extremophilic/non-extremophilic genomes. Our results obtained based on such analysis highlights a number of notable observations: (i) the overall preference of amino acid usages in the proteins of a given organism is significantly affected by major environmental factors. The changes in amino acid preferences (amino acid usage profiles) in an extremophile compared to its non-extremophile relative recurs in the organisms of similar extreme habitats. (ii) On the other hand, changes in codon usage preferences in these extremophilic/non-extremophilic pairs, lack such persistency not only in different genome-pairs but also in the individual genes of a specific pair. (iii) We have noted a correlation between cellular function and codon usage profiles of the genes in the studied pairs. (iv) Based on this correlation, we could obtain a decent prediction of cellular functions solely based on codon usage profile data. (v) Comparisons made between two sets of randomly generated genomes suggest that different patterns of codon usage changes in genes of different functional categories result in a partial resistance towards the changes in the concentration of a given amino acid. This buffering capacity might explain the observed differences in codon usage trends in genes of different functions. In the end, we suggest codon usage and amino acid profiles as powerful tools that can be utilized to improve function predictions and genome-environment mappings.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号