首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Studies on the origin of the genetic code compare measures of the degree of error minimization of the standard code with measures produced by random variant codes but do not take into account codon usage, which was probably highly biased during the origin of the code. Codon usage bias could play an important role in the minimization of the chemical distances between amino acids because the importance of errors depends also on the frequency of the different codons. Here I show that when codon usage is taken into account, the degree of error minimization of the standard code may be dramatically reduced, and shifting to alternative codes often increases the degree of error minimization. This is especially true with a high CG content, which was probably the case during the origin of the code. I also show that the frequency of codes that perform better than the standard code, in terms of relative efficiency, is much higher in the neighborhood of the standard code itself, even when not considering codon usage bias; therefore alternative codes that differ only slightly from the standard code are more likely to evolve than some previous analyses suggested. My conclusions are that the standard genetic code is far from being an optimum with respect to error minimization and must have arisen for reasons other than error minimization.[Reviewing Editor: Martin Kreitman]  相似文献   

2.
Selection on Codon Usage for Error Minimization at the Protein Level   总被引:1,自引:0,他引:1  
Given the structure of the genetic code, synonymous codons differ in their capacity to minimize the effects of errors due to mutation or mistranslation. I suggest that this may lead, in protein-coding genes, to a preference for codons that minimize the impact of errors at the protein level. I develop a theoretical measure of error minimization for each codon, based on amino acid similarity. This measure is used to calculate the degree of error minimization for 82 genes of Drosophila melanogaster and 432 rodent genes and to study its relationship with CG content, the degree of codon usage bias, and the rate of nucleotide substitution. I show that (i) Drosophila and rodent genes tend to prefer codons that minimize errors; (ii) this cannot be merely the effect of mutation bias; (iii) the degree of error minimization is correlated with the degree of codon usage bias; (iv) the amino acids that contribute more to codon usage bias are the ones for which synonymous codons differ more in the capacity to minimize errors; and (v) the degree of error minimization is correlated with the rate of nonsynonymous substitution. These results suggest that natural selection for error minimization at the protein level plays a role in the evolution of coding sequences in Drosophila and rodents.Reviewing Editor: Dr. Massimo Di Giulio  相似文献   

3.
4.
The canonical genetic code has been reported both to be error minimizing and to show stereochemical associations between coding triplets and binding sites. In order to test whether these two properties are unexpectedly overlapping, we generated 200,000 randomized genetic codes using each of five randomization schemes, with and without randomization of stop codons. Comparison of the code error (difference in polar requirement for single-nucleotide codon interchanges) with the coding triplet concentrations in RNA binding sites for eight amino acids shows that these properties are independent and uncorrelated. Thus, one is not the result of the other, and error minimization and triplet associations probably arose independently during the history of the genetic code. We explicitly show that prior fixation of a stereochemical core is consistent with an effective later minimization of error. [Reviewing Editor : Dr. Stephen Freeland]  相似文献   

5.
Abstract The influence of local base composition on mutations in chloroplast DNA (cpDNA) is studied in detail and the resulting, empirically derived, mutation dynamics are used to analyze both base composition and codon usage bias. A 4 × 4 substitution matrix is generated for each of the 16 possible flanking base combinations (contexts) using 17,253 noncoding sites, 1309 of which are variable, from an alignment of three complete grass chloroplast genome sequences. It is shown that substitution bias at these sites is correlated with flanking base composition and that the A+T content of these flanking sites as well as the number of flanking pyrimidines on the same strand appears to have general influences on substitution properties. The context-dependent equilibrium base frequencies predicted from these matrices are then applied to two analyses. The first examines whether or not context dependency of mutations is sufficient to generate average compositional differences between noncoding cpDNA and silent sites of coding sequences. It is found that these two classes of sites exist, on average, in very different contexts and that the observed mutation dynamics are expected to generate significant differences in overall composition bias that are similar to the differences observed in cpDNA. Context dependency, however, cannot account for all of the observed differences: although silent sites in coding regions appear to be at the equilibrium predicted, noncoding cpDNA has a significantly lower A+T content than expected from its own substitution dynamics, possibly due to the influence of indels. The second study examines the codon usage of low-expression chloroplast genes. When context is accounted for, codon usage is very similar to what is predicted by the substitution dynamics of noncoding cpDNA. However, certain codon groups show significant deviation when followed by a purine in a manner suggesting some form of weak selection other than translation efficiency. Overall, the findings indicate that a full understanding of mutational dynamics is critical to understanding the role selection plays in generating composition bias and sequence structure.  相似文献   

6.
Statistical and biochemical studies of the genetic code have found evidence of nonrandom patterns in the distribution of codon assignments. It has, for example, been shown that the code minimizes the effects of point mutation or mistranslation: erroneous codons are either synonymous or code for an amino acid with chemical properties very similar to those of the one that would have been present had the error not occurred. This work has suggested that the second base of codons is less efficient in this respect, by about three orders of magnitude, than the first and third bases. These results are based on the assumption that all forms of error at all bases are equally likely. We extend this work to investigate (1) the effect of weighting transition errors differently from transversion errors and (2) the effect of weighting each base differently, depending on reported mistranslation biases. We find that if the bias affects all codon positions equally, as might be expected were the code adapted to a mutational environment with transition/transversion bias, then any reasonable transition/transversion bias increases the relative efficiency of the second base by an order of magnitude. In addition, if we employ weightings to allow for biases in translation, then only 1 in every million random alternative codes generated is more efficient than the natural code. We thus conclude not only that the natural genetic code is extremely efficient at minimizing the effects of errors, but also that its structure reflects biases in these errors, as might be expected were the code the product of selection. Received: 25 July 1997 / Accepted: 9 January 1998  相似文献   

7.
Distances between amino acids were derived from the polar requirement measure of amino acid polarity and Benner and co-workers' (1994) 74-100 PAM matrix. These distances were used to examine the average effects of amino acid substitutions due to single-base errors in the standard genetic code and equally degenerate randomized variants of the standard code. Second-position transitions conserved all distances on average, an order of magnitude more than did second-position transversions. In contrast, first-position transitions and transversions were about equally conservative. In comparison with randomized codes, second-position transitions in the standard code significantly conserved mean square differences in polar requirement and mean Benner matrix-based distances, but mean absolute value differences in polar requirement were not significantly conserved. The discrepancy suggests that these commonly used distance measures may be insufficient for strict hypothesis testing without more information. The translational consequences of single-base errors were then examined in different codon contexts, and similarities between these contexts explored with a hierarchical cluster analysis. In one cluster of codon contexts corresponding to the RNY and GNR codons, second-position transversions between C and G and transitions between C and U were most conservative of both polar requirement and the matrix-based distance. In another cluster of codon contexts, second-position transitions between A and G were most conservative. Despite the claims of previous authors to the contrary, it is shown theoretically that the standard code may have been shaped by position-invariant forces such as mutation and base content. These forces may have left heterogeneous signatures in the code because of differences in translational fidelity by codon position. A scenario for the origin of the code is presented wherein selection for error minimization could have occurred multiple times in disjoint parts of the code through a phyletic process of competition between lineages. This process permits error minimization without the disruption of previously useful messages, and does not predict that the code is optimally error-minimizing with respect to modern error. Instead, the code may be a record of genetic process and patterns of mutation before the radiation of modern organisms and organelles. Received: 28 July 1997 / Accepted: 23 January 1998  相似文献   

8.
为了分析鲨烯合酶(squalene synthase, SS)基因密码子的使用方式及其影响因素,利用codon W和SPSS 16.0软件对47条来自不同物种的SS基因进行多元统计分析、对应性分析.SS基因密码子1~3位碱基的GC含量(GC1, GC2和GC3)依次为51.33%、34.65%和54.37%,3个位点的GC含量均呈极显著相关关系(p<0.01),对应性分析的结果表明,第1轴显示30.71%的差异,有效密码子数和GC3、GC1和GC2的均值与GC3之间的相关性均达极显著水平(p<0.01).筛选出的26个最优密码子的第3位碱基均为G或C.以MEGA 5.0构建的基于SS蛋白质序列的进化树比基于RSCU的聚类更符合传统的系统发育观点.SS基因密码子偏好以G/C结尾,使用模式受选择和突变影响,突变对密码子偏好影响较大.  相似文献   

9.
In this study we reconstruct the evolution of codon usage bias in the chloroplast gene rbcL using a phylogeny of 92 green-plant taxa. We employ a measure of codon usage bias that accounts for chloroplast genomic nucleotide content, as an attempt to limit plausible explanations for patterns of codon bias evolution to selection- or drift-based processes. This measure uses maximum likelihood-ratio tests to compare the performance of two models, one in which a single codon is overrepresented and one in which two codons are overrepresented. The measure allowed us to analyze both the extent of bias in each lineage and the evolution of codon choice across the phylogeny. Despite predictions based primarily on the low G+C content of the chloroplast and the high functional importance of rbcL, we found large differences in the extent of bias, suggesting differential molecular selection that is clade specific. The seed plants and simple leafy liverworts each independently derived a low level of bias in rbcL, perhaps indicating relaxed selectional constraint on molecular changes in the gene. Overrepresentation of a single codon was typically plesiomorphic, and transitions to overrepresentation of two codons occurred commonly across the phylogeny, possibly indicating biochemical selection. The total codon bias in each taxon, when regressed against the total bias of each amino acid, suggested that twofold amino acids play a strong role in inflating the level of codon usage bias in rbcL, despite the fact that twofolds compose a minority of residues in this gene. Those amino acids that contributed most to the total codon usage bias of each taxon are known through amino acid knockout and replacement to be of high functional importance. This suggests that codon usage bias may be constrained by particular amino acids and, thus, may serve as a good predictor of what residues are most important for protein fitness. Present address (Joshua T. Herbeck): JBP Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543, USA  相似文献   

10.
The Case for an Error Minimizing Standard Genetic Code   总被引:1,自引:1,他引:0  
Since discovering the pattern by which amino acids are assigned to codons within the standard genetic code, investigators have explored the idea that natural selection placed biochemically similar amino acids near to one another in coding space so as to minimize the impact of mutations and/or mistranslations. The analytical evidence to support this theory has grown in sophistication and strength over the years, and counterclaims questioning its plausibility and quantitative support have yet to transcend some significant weaknesses in their approach. These weaknesses are illustrated here by means of a simple simulation model for adaptive genetic code evolution. There remain ill explored facets of the `error minimizing' code hypothesis, however, including the mechanism and pathway by which an adaptive pattern of codon assignments emerged, the extent to which natural selection created synonym redundancy, its role in shaping the amino acid and nucleotide languages, and even the correct interpretation of the adaptive codon assignment pattern: these represent fertile areas for future research.  相似文献   

11.
A novel subtype of influenza A virus 09H1N1 has rapidly spread across the world. Evolutionary analyses of this virus have revealed that 09H1N1 is a triple reassortant of segments from swine, avian and human influenza viruses. In this study, we investigated factors shaping the codon usage bias of 09H1N1 and carried out cluster analysis of 60 strains of influenza A virus from different subtypes based on their codon usage bias. We discovered that more preferentially used codons of 09H1N1 are A-ended or U-ended...  相似文献   

12.
The persistence of life requires populations to adapt at a rate commensurate with the dynamics of their environment. Successful populations that inhabit highly variable environments have evolved mechanisms to increase the likelihood of successful adaptation. We introduce a 64 × 64 matrix to quantify base-specific mutation potential, analyzing four different replicative systems, error-prone PCR, mouse antibodies, a nematode, and Drosophila. Mutational tendencies are correlated with the structural evolution of proteins. In systems under strong selective pressure, mutational biases are shown to favor the adaptive search of space, either by base mutation or by recombination. Such adaptability is discussed within the context of the genetic code at the levels of replication and codon usage.Supplementary material to this paper is available in electronic form.Reviewing Editor: Dr. Edward Trifonov  相似文献   

13.
Optimality of codon usage in Escherichia coli due to load minimization   总被引:2,自引:0,他引:2  
The canonical genetic code is known to be highly efficient in minimizing the effects of mistranslational errors and point mutations, an ability which in term is designated "load minimization". One parameter involved in calculating the load minimizing property of the genetic code is codon usage. In most bacteria, synonymous codons are not used with equal frequencies. Different factors have been proposed to contribute to codon usage preference. It has been shown that the codon preference is correlated with the composition of the tRNA pool. Selection for translational efficiency and translational accuracy both result in such a correlation. In this work, it is shown that codon usage bias in Escherichia coli works so as to minimize the consequences of translational errors, i.e. optimized for load minimization.  相似文献   

14.
The persistent difficulties in the production of protein at high levels in heterologous systems, as well as the inability to understand pathologies associated with protein aggregation, highlight our limited knowledge on the mechanisms of protein folding in vivo. Attempts to improve yield and quality of recombinant proteins are diverse, frequently involving optimization of the cell growth temperature, the use of synonymous codons and/or the co-expression of tRNAs, chaperones and folding catalysts among others. Although protein secondary structure can be determined largely by the amino acid sequence, protein folding within the cell is affected by a range of factors beyond amino acid sequence. The folding pathway of a nascent polypeptide can be affected by transient interactions with other proteins and ligands, the ribosome, translocation through a pore membrane, redox conditions, among others. The translation rate as well as the translation machinery itself can dramatically affect protein folding, and thus the structure and function of the protein product. This review addresses current efforts to better understand how the use of synonymous codons in the mRNA and the availability of tRNAs can modulate translation kinetics, affecting the folding, the structure and the biological activity of proteins.  相似文献   

15.
在基因组学水平上研究密码子使用偏性模式、成因并分析进化过程中的选择压力在基因组学研究中有重要意义。文章概述了目前提出的密码子使用偏性的量化方法及实现原理。目前研究发现:有些量化密码子偏性的方法受高表达基因参考数据集未完全注释的限制,不同密码子位置对变异和选择的影响不同,以及不同密码子位置处GC含量和嘌呤含量的贡献不同。由此展望密码子偏性量化方法发展方向为:需要设计不需要相关参考基因集合先验知识的密码子使用偏性量化方法;考虑不同位置处背景核苷酸组成的密码子使用偏性的量化方法;同时考虑基因表达水平的密码子使用偏性量化方法。最后,归纳了目前可用的密码子使用偏性的量化工具和数据库。  相似文献   

16.
为探究滇黄精(Polygonatum kingianum)叶绿体全基因组特征和密码子使用偏性,利用第二代测序技术对滇黄精嫩叶进行测序,再经组装与注释后得到其叶绿体基因组全序列,通过MISA、EMBOSS和CodonW等软件对滇黄精叶绿体全基因组的SSR位点、系统发育及密码子偏好性进行分析。结果表明,滇黄精完整叶绿体基因组长度为155 852 bp,基因组平均GC含量为37.7%,其大、小单拷贝区(LSC)长度分别为84 633和185 25 bp,反向重复区长度为26 347 bp,注释了132个基因,包括86个蛋白编码基因、38个tRNA基因和8个核糖rRNA基因。叶绿体基因组中共有69个SSR位点,绝大多数属于单碱基重复的A/T类型。系统发育分析表明滇黄精与格脉黄精(P. tessellatum)亲缘关系近,可能与分布地域有关。密码子偏好性分析表明,滇黄精叶绿体基因组密码子使用模式受到自然选择影响大于突变因素,最终确定9个最优密码子。因此, 滇黄精叶绿体基因组遗传结构和系统发育位置及其密码子偏倚的分析,为叶绿体基因工程研究提供理论依据。  相似文献   

17.
Using a comparative genomics approach we demonstrate a negative correlation between the number of codon reassignments undergone by 222 mitochondrial genomes and the mitochondrial genome size, the number of mitochondrial ORFs, and the sizes of the large and small subunit mitochondrial rRNAs. In addition, we show that the TGA-to-tryptophan codon reassignment, which has occurred 11 times in mitochondrial genomes, is found in mitochondrial genomes smaller than those which have not undergone the reassignment. We therefore propose that mitochondrial codon reassignments occur in a wide range of phyla, particularly in Metazoa, due to a reduced “proteomic constraint” on the mitochondrial genetic code, compared to the nuclear genetic code. The reduced proteomic constraint reflects the small size of the mitochondrial-encoded proteome and allows codon reassignments to occur with less likelihood of lethality. In addition, we demonstrate a striking link between nonsense codon reassignments and the decoding properties of naturally occurring nonsense suppressor tRNAs. This suggests that natural preexisting nonsense suppression facilitated nonsense codon reassignments and constitutes a novel mechanism of genetic code change. These findings explain for the first time the identity of the stop codons and amino acids reassigned in mitochondrial and nuclear genomes. Nonsense suppressor tRNAs provided the raw material for nonsense codon reassignments, implying that the properties of the tRNA anticodon have dictated the identity of nonsense codon reassignments. Electronic Supplementary Material The online version of this article (doi:) contains supplementary material, which is available to authorized users. [Reviewing Editor: Dr. Laura Landweber]  相似文献   

18.
We consider a model of the origin of genetic code organization incorporating the biosynthetic relationships between amino acids and their physicochemical properties. We study the behavior of the genetic code in the set of codes subject both to biosynthetic constraints and to the constraint that the biosynthetic classes of amino acids must occupy only their own codon domain, as observed in the genetic code. Therefore, this set contains the smallest number of elements ever analyzed in similar studies. Under these conditions and if, as predicted by physicochemical postulates, the amino acid properties played a fundamental role in genetic code organization, it can be expected that the code must display an extremely high level of optimization. This prediction is not supported by our analysis, which indicates, for instance, a minimization percentage of only 80%. These observations can therefore be more easily explained by the coevolution theory of genetic code origin, which postulates a role that is important but not fundamental for the amino acid properties in the structuring of the code. We have also investigated the shape of the optimization landscape that might have arisen during genetic code origin. Here, too, the results seem to favor the coevolution theory because, for instance, the fact that only a few amino acid exchanges would have been sufficient to transform the genetic code (which is not a local minimum) into a much better optimized code, and that such exchanges did not actually take place, seems to suggest that, for instance, the reduction of translation errors was not the main adaptive theme structuring the genetic code.  相似文献   

19.
20.
余劲聪  方柏山 《生物信息学》2011,9(3):242-246,249
密码子用法数据库(CUD)是密码子用法与密码子优化研究领域的一个重要的在线服务,为了找出该数据库中潜在的两种不再适用的记录,即已过期的陈旧记录和在遗传密码类型上实际无法有效支持的记录(简称不支持记录),本文通过结合前期自主研发的两个软件BestCodon与CUDassist,组建了CUD自动寻错平台CUDer,并应用于上述问题的研究。结果发现,CUD中存在317条陈旧记录与4条不支持记录。对于陈旧记录,这些记录的物种分类号在NCBI中已发生变更,研究者应该避免使用这些记录的相关数据;对于不支持记录,本文借助CUDer计算得到这些记录正确的密码子用法表(CUT),弥补了当前CUD的不足。此外,该平台也为面向CUD的自动化数据处理,提供了一个新的软件框架,具有一定的借鉴意义。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号