首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The genetic code provides the translation table necessary to transform the information contained in DNA into the language of proteins. In this table, a correspondence between each codon and each amino acid is established: tRNA is the main adaptor that links the two. Although the genetic code is nearly universal, several variants of this code have been described in a wide range of nuclear and organellar systems, especially in metazoan mitochondria. These variants are generally found by searching for conserved positions that consistently code for a specific alternative amino acid in a new species. We have devised an accurate computational method to automate these comparisons, and have tested it with 626 metazoan mitochondrial genomes. Our results indicate that several arthropods have a new genetic code and translate the codon AGG as lysine instead of serine (as in the invertebrate mitochondrial genetic code) or arginine (as in the standard genetic code). We have investigated the evolution of the genetic code in the arthropods and found several events of parallel evolution in which the AGG codon was reassigned between serine and lysine. Our analyses also revealed correlated evolution between the arthropod genetic codes and the tRNA-Lys/-Ser, which show specific point mutations at the anticodons. These rather simple mutations, together with a low usage of the AGG codon, might explain the recurrence of the AGG reassignments.  相似文献   

3.
The organization of the canonical genetic code needs to be thoroughly illuminated. Here we reorder the four nucleotides-adenine, thymine, guanine and cytosine-according to their emergence in evolution, and apply the organizational rules to devising an algebraic representation for the canonical genetic code. Under a framework of the devised code, we quantify codon and amino acid usages from a large collection of 917 prokaryotic genome sequences, and associate the usages with its intrinsic structure and classification schemes as well as amino acid physicochemical properties. Our results show that the algebraic representation of the code is structurally equivalent to a content-centric organization of the code and that codon and amino acid usages under different classification schemes were correlated closely with GC content, implying a set of rules governing composition dynamics across a wide variety of prokaryotic genome sequences. These results also indicate that codons and amino acids are not randomly allocated in the code, where the six-fold degenerate codons and their amino acids have important balancing roles for error minimization. Therefore, the content-centric code is of great usefulness in deciphering its hitherto unknown regularities as well as the dynamics of nucleotide, codon, and amino acid compositions.  相似文献   

4.
Multiple synonymous codons code for the same amino acid, resulting in the degeneracy of the genetic code and in the preferred used of some codons called codon bias usage (CBU). We performed a large-scale analysis of codon usage bias analysing the distribution of the codon adaptation index (CAI) and the codon relative adaptiveness index (RA) in 4868 bacterial genomes. We found that CAI values differ significantly between protein functional domains and part of the protein outside domains and show how CAI, GC content and preferred usage of polymerase III alpha subunits are related. Additionally, we give evidence of the association between CAI and bacterial phenotypes.  相似文献   

5.
The codon table for the canonical genetic code can be rearranged in such a way that the code is divided into four quarters and two halves according to the variability of their GC and purine contents, respectively. For prokaryotic genomes, when the genomic GC content increases, their amino acid contents tend to be restricted to the GC-rich quarter and the purine-content insensitive half, where all codons are fourfold degenerate and relatively mutation-tolerant. Conversely, when the genomic GC content decreases, most of the codons retract to the AUrich quarter and the purine-content sensitive half; most of the codons not only remain encoding physicochemically diversified amino acids but also vary when transversion (between purine and pyrimidine) happens. Amino acids with sixfolddegenerate codons are distributed into all four quarters and across the two halves; their fourfold-degenerate codons are all partitioned into the purine-insensitive half in favorite of robustness against mutations. The features manifested in the rearranged codon table explain most of the intrinsic relationship between protein coding sequences (the informational content) and amino acid compositions (the functional content). The renovated codon table is useful in predicting abundant amino acids and positioning the amino acids with related or distinct physicochemical properties.  相似文献   

6.
In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core "house-keeping" functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables-GC and purine contents-of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern-the symmetric pattern-where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes.  相似文献   

7.
Aminoacyl-tRNA synthetases (aaRSs) are responsible for creating the pool of correctly charged aminoacyl-tRNAs that are necessary for the translation of genetic information (mRNA) by the ribosome. Each aaRS belongs to either one of only two classes with two different mechanisms of aminoacylation, making use of either the 2'OH (Class I) or the 3'OH (Class II) of the terminal A76 of the tRNA and approaching the tRNA either from the minor groove (2'OH) or the major groove (3'OH). Here, an asymmetric pattern typical of differentiation is uncovered in the partition of the codon repertoire, as defined by the mechanism of aminoacylation of each corresponding tRNA. This pattern can be reproduced in a unique cascade of successive binary decisions that progressively reduces codon ambiguity. The deduced order of differentiation is manifestly driven by the reduction of translation errors. A simple rule can be defined, decoding each codon sequence in its binary class, thereby providing both the code and the key to decode it. Assuming that the partition into two mechanisms of tRNA aminoacylation is a relic that dates back to the invention of the genetic code in the RNA World, a model for the assignment of amino acids in the codon table can be derived. The model implies that the stop codon was always there, as the codon whose tRNA cannot be charged with any amino acid, and makes the prediction of an ultimate differentiation step, which is found to correspond to the codon assignment of the 22nd amino acid pyrrolysine in archaebacteria.  相似文献   

8.
The standard codon table is a primary tool for basic understanding of molecular biology. In the minds of many, the table’s orderly arrangement of bases and amino acids is synonymous with the true genetic code, i.e., the biological coding principle itself. However, developments in the field reveal a much more complex and interesting picture. In this article, we review the traditional codon table and its limitations in light of the true complexity of the genetic code. We suggest the codon table be brought up to date and, as a step, we present a novel superposition of the BLOSUM62 matrix and an allowed point mutation matrix. This superposition depicts an important aspect of the true genetic code—its ability to tolerate mutations and mistranslations.  相似文献   

9.
The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by ‘quartets’ of codons with four-fold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If two-fold degeneracy is postulated for the first position of the codon, there could have been 10 amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutamic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an ‘intruder’ into the genetic code, and that it may have displaced another amino acid such as ornithine, or may even have displaced lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons. The introduction of tRNA into protein synthesis may have been a cataclysmic and comparatively sudden event, since duplication of tRNA takes place readily, and point mutations could rapidly differentiate members of the family of duplicates from each. Two tRNAs for different amino acids may have a common ancestor that existed more recently than the separation of the prokaryotes and eukaryotes. This is shown by homology of twoE. coli tRNAs for glycine and valine, and two yeast tRNAs for arginine and lysine.  相似文献   

10.
Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon-anticodon interaction, all consistent with more efficient translation.  相似文献   

11.
Since the genetic code first was determined, many have claimed that it is organized adaptively, so as to assign similar codons to similar amino acids. This claim has proved difficult to establish due to the absence of relevant comparative data on alternative primordial codes and of objective measures of amino acid exchangeability. Here we use a recently developed measure of exchangeability to evaluate a null hypothesis and two alternative hypotheses about the adaptiveness of the genetic code. The null hypothesis that there is no tendency for exchangeable amino acids to be assigned to similar codons can be excluded here as expected from earlier work. The first alternative hypothesis is that any such correlation between codon distance and amino acid distance is due to incremental mechanisms of code evolution, and not to adaptation to reduce deleterious effects of future mutations. More specifically, new codon assignments that occur by ambiguity reduction or by codon capture will tend to give rise to correlations, whether due to the condition of amino acid ambiguity, or to the condition of similarity between a new tRNA synthetase (or tRNA) and its parent. The second alternative hypothesis, the adaptive hypothesis, then may be defined as an excess relative to what may be expected given the incremental nature of evolution, reflecting true adaptation for robustness rather than an incidental effect. The results reported here indicate that most of the nonrandomness in the amino acids to codon assignments can be explained by incremental code evolution, with a small residue of orderliness that may reflect code adaptation.  相似文献   

12.
We propose that glycine was the first amino acid to be incorporated into the genetic code, followed by serine, aspartic and/or glutamic acid—small hydrophilic amino acids that all have codons in the bottom right-hand corner of the standard genetic code table. Because primordial ribosomal synthesis is presumed to have been rudimentary, this stage would have been characterized by the synthesis of short, water-soluble peptides, the first of which would have comprised polyglycine. Evolution of the code is proposed to have occurred by the duplication and mutation of tRNA sequences, which produced a radiation of codon assignment outwards from the bottom right-hand corner. As a result of this expansion, we propose a trend from small hydrophilic to hydrophobic amino acids, with selection for longer polypeptides requiring a hydrophobic core for folding and stability driving the incorporation of hydrophobic amino acids into the code.  相似文献   

13.
New insights into the arrangement of the genetic code table, based on the analysis of the physico-chemical properties of its molecular constituents, are reported in this paper. It will be demonstrated that the code has a twofold symmetry that is not apparent from the conventional code table, but becomes apparent when the codon-anticodon energies are listed for each triplet. The evolutionary development of the current code based on single base replacement mutations (transitions) from an 'iso-energetic' degenerated subset of 16 of the 64 codons is discussed. The energy landscape of all 64 codons is presented. A detailed analysis of the energy changes due to mutations in the 3rd, 1st or 2nd position of a codon reveals that the modern genetic code is highly robust. Changes come in small discrete steps that can be quantified in relation to the thermal noise of the system. The relation of the individual codon to its neighbours in the rearranged codon table can be completely understood based on thermodynamic considerations.  相似文献   

14.
Understanding how codons became associated with their specific amino acids is fundamental to deriving a theory for the origin of the genetic code. Carl Woese and coworkers designed a series of experiments to test associations between amino acids and nucleobases that may have played a role in establishing the genetic code. Through these experiments it was found that a property of amino acids called the polar requirement (PR) is correlated with the organization of the codon table. No other property of amino acids has been found that correlates with the codon table as well as PR, indicating that PR is uniquely related to the modern genetic code. Using molecular dynamics simulations of amino acids in solutions of water and dimethylpyridine used to experimentally measure PR, we show that variations in the partitioning between the two phases as described by radial distribution functions correlate well with the measured PRs. Partition coefficients based on probability densities of the amino acids in each phase have the linear behavior with base concentration as suggested by PR experiments.  相似文献   

15.
鉴于遗传密码子的简并性能够将基因遗传信息的容量提升,同义密码子使用偏嗜性得以在生物体的基因组中广泛存在。虽然同义密码子之间碱基的变化并不能导致氨基酸种类的改变,在研究mRNA半衰期、编码多肽翻译效率及肽链空间构象正确折叠的准确性和翻译等这一系列过程中发现,同义密码子使用的偏嗜性在某种程度上通过精微调控翻译机制体现其遗传学功能。同义密码子指导tRNA在翻译过程中识别核糖体的速率变化是由氨基酸的特定顺序决定,并且在新生多肽链合成时,蛋白质共翻译转运机制同时调节其空间构象的正确折叠从而保证蛋白的正常生物学功能。某些同义密码子使用偏嗜性与特定蛋白结构的形成具有显著相关性,密码子使用偏嗜性一旦改变将可能导致新生多肽空间构象出现错误折叠。结合近些年来国内外在此领域的研究成果,阐述同义密码子使用偏嗜性如何发挥精微调控翻译的生物学功能与作用。  相似文献   

16.
The high conservation of the genetic code and its fundamental role in genome decoding suggest that its evolution is highly restricted or even frozen. However, various prokaryotic and eukaryotic genetic code alterations, several alternative tRNA-dependent amino acid biosynthesis pathways, regulation of tRNA decoding by diverse nucleoside modifications and recent in vivo incorporation of non-natural amino acids into prokaryotic and eukaryotic proteins, show that the code evolves and is surprisingly flexible. The cellular mechanisms and the proteome buffering capacity that support such evolutionary processes remain unclear. Here we explore the hypothesis that codon misreading and reassignment played fundamental roles in the development of the genetic code and we show how a fungal codon reassignment is enlightening its evolution.  相似文献   

17.
18.
The coevolution theory of the genetic code, which postulates that prebiotic synthesis was an inadequate source of all twenty protein amino acids, and therefore some of them had to be derived from the coevolving pathways of amino acid biosynthesis, has been assessed in the light of the discoveries of the past three decades. Its four fundamental tenets regarding the essentiality of amino acid biosynthesis, role of pretran synthesis, biosynthetic imprint on codon allocations and mutability of the encoded amino acids are proven by the new knowledge. Of the factors that guided the evolutionary selection of the universal code, the relative contributions of Amino Acid Biosynthesis: Error Minimization: Stereochemical Interaction are estimated to first approximation as 40,000,000:400:1, which suggests that amino acid biosynthesis represents the dominant factor shaping the code. The utility of the coevolution theory is demonstrated by its opening up experimental expansions of the code and providing a basis for locating the root of life.  相似文献   

19.
In the past, 2 kinds of Markov models have been considered to describe protein sequence evolution. Codon-level models have been mechanistic with a small number of parameters designed to take into account features, such as transition-transversion bias, codon frequency bias, and synonymous-nonsynonymous amino acid substitution bias. Amino acid models have been empirical, attempting to summarize the replacement patterns observed in large quantities of data and not explicitly considering the distinct factors that shape protein evolution. We have estimated the first empirical codon model (ECM). Previous codon models assume that protein evolution proceeds only by successive single nucleotide substitutions, but our results indicate that model accuracy is significantly improved by incorporating instantaneous doublet and triplet changes. We also find that the affiliations between codons, the amino acid each encodes and the physicochemical properties of the amino acids are main factors driving the process of codon evolution. Neither multiple nucleotide changes nor the strong influence of the genetic code nor amino acids' physicochemical properties form a part of standard mechanistic models and their views of how codon evolution proceeds. We have implemented the ECM for likelihood-based phylogenetic analysis, and an assessment of its ability to describe protein evolution shows that it consistently outperforms comparable mechanistic codon models. We point out the biological interpretation of our ECM and possible consequences for studies of selection.  相似文献   

20.
Fuglsang A 《Gene》2008,410(1):82-88
The effective number of codons (Nc) used in a gene is one of the most commonly used measures of synonymous codon usage bias, owing much of its popularity to the fact that it is species independent and that simulation studies have shown that it is less dependent of gene length than other measures. In this paper I provide a clear and practically meaningful definition of bias discrepancy (BD; when the degree of codon bias varies within a degeneracy class). Moreover I evaluate the impact of BD and amino acid usage on estimates of Nc. It is shown that both factors have a significant effect on accuracy and precision. Both amino acid usage and BD influence accuracy considerably, especially in short genes. Finally, I demonstrate how the definition of bias discrepancy can be applied to investigate if codon usage is influenced by selection and I discuss this test in relation to the incongruous literature that exists for Buchnera sp. APS and Borrelia burgdorferi.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号