首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Studies on the origin of the genetic code compare measures of the degree of error minimization of the standard code with measures produced by random variant codes but do not take into account codon usage, which was probably highly biased during the origin of the code. Codon usage bias could play an important role in the minimization of the chemical distances between amino acids because the importance of errors depends also on the frequency of the different codons. Here I show that when codon usage is taken into account, the degree of error minimization of the standard code may be dramatically reduced, and shifting to alternative codes often increases the degree of error minimization. This is especially true with a high CG content, which was probably the case during the origin of the code. I also show that the frequency of codes that perform better than the standard code, in terms of relative efficiency, is much higher in the neighborhood of the standard code itself, even when not considering codon usage bias; therefore alternative codes that differ only slightly from the standard code are more likely to evolve than some previous analyses suggested. My conclusions are that the standard genetic code is far from being an optimum with respect to error minimization and must have arisen for reasons other than error minimization.[Reviewing Editor: Martin Kreitman]  相似文献   

2.
Statistical and biochemical studies of the genetic code have found evidence of nonrandom patterns in the distribution of codon assignments. It has, for example, been shown that the code minimizes the effects of point mutation or mistranslation: erroneous codons are either synonymous or code for an amino acid with chemical properties very similar to those of the one that would have been present had the error not occurred. This work has suggested that the second base of codons is less efficient in this respect, by about three orders of magnitude, than the first and third bases. These results are based on the assumption that all forms of error at all bases are equally likely. We extend this work to investigate (1) the effect of weighting transition errors differently from transversion errors and (2) the effect of weighting each base differently, depending on reported mistranslation biases. We find that if the bias affects all codon positions equally, as might be expected were the code adapted to a mutational environment with transition/transversion bias, then any reasonable transition/transversion bias increases the relative efficiency of the second base by an order of magnitude. In addition, if we employ weightings to allow for biases in translation, then only 1 in every million random alternative codes generated is more efficient than the natural code. We thus conclude not only that the natural genetic code is extremely efficient at minimizing the effects of errors, but also that its structure reflects biases in these errors, as might be expected were the code the product of selection. Received: 25 July 1997 / Accepted: 9 January 1998  相似文献   

3.
We simulate a deterministic population genetic model for the coevolution of genetic codes and protein-coding genes. We use very simple assumptions about translation, mutation, and protein fitness to calculate mutation-selection equilibria of codon frequencies and fitness in a large asexual population with a given genetic code. We then compute the fitnesses of altered genetic codes that compete to invade the population by translating its genes with higher fitness. Codes and genes coevolve in a succession of stages, alternating between genetic equilibration and code invasion, from an initial wholly ambiguous coding state to a diversified frozen coding state. Our simulations almost always resulted in partially redundant frozen genetic codes. Also, the range of simulated physicochemical properties among encoded amino acids in frozen codes was always less than maximal. These results did not require the assumption of historical constraints on the number and type of amino acids available to codes nor on the complexity of proteins, stereochemical constraints on the translational apparatus, nor mechanistic constraints on genetic code change. Both the extent and timing of amino-acid diversification in genetic codes were strongly affected by the message mutation rate and strength of missense selection. Our results suggest that various omnipresent phenomena that distribute codons over sites with different selective requirements—such as the persistence of nonsynonymous mutations at equilibrium, the positive selection of the same codon in different types of sites, and translational ambiguity—predispose the evolution of redundancy and of reduced amino acid diversity in genetic codes. Received: 21 December 2000 / Accepted: 12 March 2001  相似文献   

4.
A computer program was used to test Wong's coevolution theory of the genetic code. The codon correlations between the codons of biosynthetically related amino acids in the universal genetic code and in randomly generated genetic codes were compared. It was determined that many codon correlations are also present within random genetic codes and that among the random codes there are always several which have many more correlations than that found in the universal code. Although the number of correlations depends on the choice of biosynthetically related amino acids, the probability of choosing a random genetic code with the same or greater number of codon correlations as the universal genetic code was found to vary from 0.1% to 34% (with respect to a fairly complete listing of related amino acids). Thus, Wong's theory that the genetic code arose by coevolution with the biosynthetic pathways of amino acids, based on codon correlations between biosynthetically related amino acids, is statistical in nature. Received: 8 August 1996 / Accepted: 26 December 1996  相似文献   

5.
We have assumed that the coevolution theory of genetic code origin (Wong JT, Proc Natl Acad Sci USA 72:1909–1912, 1975) is essentially correct. This theory makes it possible to identify at least 10 evolutionary stages through which genetic code organization might have passed prior to reaching its current form. The calculation of the minimization level of all these evolutionary stages leads to the following conclusions. (1) The minimization percentages increased linearly with the number of amino acids codified in the codes of the various evolutionary stages when only the sense changes are considered in the analysis. This seems to favor the physicochemical theory of genetic code origin even if, as discussed in the paper, this observation is also compatible with the coevolution theory. (2) For the first seven evolutionary stages of the genetic code, this trend is less clear and indeed is inverted when we consider the global optimisation of the codes due to both sense changes and synonymous changes. This inverse correlation between minimization percentages and the number of amino acids codified in the codes of the intermediate stages seems to favor neither the physicochemical nor the stereochemical theories of genetic code origin, as it is in the early and intermediate stages of code development that these theories would expect minimization to have played a crucial role, and this does not seem to be the case. However, these results are in agreement with the coevolution theory, which attributes a role to the physicochemical properties of amino acids that, while important, is nevertheless subordinate to the mechanism which concedes codons from the precursor amino acids to the product amino acids as the primary factor determining the evolutionary structuring of the genetic code. The results are therefore discussed in the context of the various theories proposed to explain genetic code origin. Received: 25 October 1998 / Accepted: 19 February 1999  相似文献   

6.
Distances between amino acids were derived from the polar requirement measure of amino acid polarity and Benner and co-workers' (1994) 74-100 PAM matrix. These distances were used to examine the average effects of amino acid substitutions due to single-base errors in the standard genetic code and equally degenerate randomized variants of the standard code. Second-position transitions conserved all distances on average, an order of magnitude more than did second-position transversions. In contrast, first-position transitions and transversions were about equally conservative. In comparison with randomized codes, second-position transitions in the standard code significantly conserved mean square differences in polar requirement and mean Benner matrix-based distances, but mean absolute value differences in polar requirement were not significantly conserved. The discrepancy suggests that these commonly used distance measures may be insufficient for strict hypothesis testing without more information. The translational consequences of single-base errors were then examined in different codon contexts, and similarities between these contexts explored with a hierarchical cluster analysis. In one cluster of codon contexts corresponding to the RNY and GNR codons, second-position transversions between C and G and transitions between C and U were most conservative of both polar requirement and the matrix-based distance. In another cluster of codon contexts, second-position transitions between A and G were most conservative. Despite the claims of previous authors to the contrary, it is shown theoretically that the standard code may have been shaped by position-invariant forces such as mutation and base content. These forces may have left heterogeneous signatures in the code because of differences in translational fidelity by codon position. A scenario for the origin of the code is presented wherein selection for error minimization could have occurred multiple times in disjoint parts of the code through a phyletic process of competition between lineages. This process permits error minimization without the disruption of previously useful messages, and does not predict that the code is optimally error-minimizing with respect to modern error. Instead, the code may be a record of genetic process and patterns of mutation before the radiation of modern organisms and organelles. Received: 28 July 1997 / Accepted: 23 January 1998  相似文献   

7.
Characteristic features of tRNA such as the anticodon sequence and modified nucleotides in the anticodon loop are thought to be crucial effectors for promoting or restricting codon reassignment. Our recent findings on basepairing rules between anticodon and codon in various metazoan mitochondria suggest that the complete loss of a codon is not necessarily essential for codon reassignment to take place. We postulate that a possible competition between two tRNAs with cognate anticodon sequences towards the relevant codon to be varied has a potential role in codon reassignment. Our proposition can be viewed as an expanded version of the codon capture theory proposed by Osawa and Jukes (J Mol Evol 28: 271–278, 1989). Received: 28 December 2000 / Accepted: 12 March 2001  相似文献   

8.
A paper (Amirnovin R, J Mol Evol 44:473–476, 1997) seems to undermine the validity of the coevolution theory of genetic code origin by shedding doubt on the connection between the biosynthetic relationships between amino acids and the organization of the genetic code, at a time when the literature on the topic takes this for granted. However, as a few papers cite this paper as evidence against the coevolution theory, and to cast aside all doubt on the subject, we have decided to reanalyze the statistical bases on which this theory is founded. We come to the following conclusions: (1) the methods used in the above referred paper contain certain mistakes, and (2) the statistical foundations on which the coevolution theory is based are extremely robust. We have done this by critically appraising Amirnovin's paper and suggesting an alternative method based on the generation of random codes which, along with the method reported in the literature, allows us to evaluate the significance, in the genetic code, of different sets of amino acid pairs in biosynthetic relationships. In particular, by using this method and after building up a certain set of amino acid pairs reflecting the expectations of the coevolution theory, we show that the presence of this set in the genetic code would be obtained, purely by chance, with a probability of 6 × 10−5. This observation seems to provide particularly strong support to the coevolution theory. Received: 28 June 1999 / Accepted: 23 October 1999  相似文献   

9.
We have previously proposed an SNS hypothesis on the origin of the genetic code (Ikehara and Yoshida 1998). The hypothesis predicts that the universal genetic code originated from the SNS code composed of 16 codons and 10 amino acids (S and N mean G or C and either of four bases, respectively). But, it must have been very difficult to create the SNS code at one stroke in the beginning. Therefore, we searched for a simpler code than the SNS code, which could still encode water-soluble globular proteins with appropriate three-dimensional structures at a high probability using four conditions for globular protein formation (hydropathy, α-helix, β-sheet, and β-turn formations). Four amino acids (Gly [G], Ala [A], Asp [D], and Val [V]) encoded by the GNC code satisfied the four structural conditions well, but other codes in rows and columns in the universal genetic code table do not, except for the GNG code, a slightly modified form of the GNC code. Three three-amino acid systems ([D], Leu and Tyr; [D], Tyr and Met; Glu, Pro and Ile) also satisfied the above four conditions. But, some amino acids in the three systems are far more complex than those encoded by the GNC code. In addition, the amino acids in the three-amino acid systems are scattered in the universal genetic code table. Thus, we concluded that the universal genetic code originated not from a three-amino acid system but from a four-amino acid system, the GNC code encoding [GADV]-proteins, as the most primitive genetic code. Received: 11 June 2001 / Accepted: 11 October 2001  相似文献   

10.
We consider a model of the origin of genetic code organization incorporating the biosynthetic relationships between amino acids and their physicochemical properties. We study the behavior of the genetic code in the set of codes subject both to biosynthetic constraints and to the constraint that the biosynthetic classes of amino acids must occupy only their own codon domain, as observed in the genetic code. Therefore, this set contains the smallest number of elements ever analyzed in similar studies. Under these conditions and if, as predicted by physicochemical postulates, the amino acid properties played a fundamental role in genetic code organization, it can be expected that the code must display an extremely high level of optimization. This prediction is not supported by our analysis, which indicates, for instance, a minimization percentage of only 80%. These observations can therefore be more easily explained by the coevolution theory of genetic code origin, which postulates a role that is important but not fundamental for the amino acid properties in the structuring of the code. We have also investigated the shape of the optimization landscape that might have arisen during genetic code origin. Here, too, the results seem to favor the coevolution theory because, for instance, the fact that only a few amino acid exchanges would have been sufficient to transform the genetic code (which is not a local minimum) into a much better optimized code, and that such exchanges did not actually take place, seems to suggest that, for instance, the reduction of translation errors was not the main adaptive theme structuring the genetic code.  相似文献   

11.
I attempt to sketch a unified picture of the origin of living organisms in their genetic, bioenergetic, and structural aspects. Only selection at a higher level than for individual selfish genes could power the cooperative macromolecular coevolution required for evolving the genetic code. The protein synthesis machinery is too complex to have evolved before membranes. Therefore a symbiosis of membranes, replicators, and catalysts probably mediated the origin of the code and the transition from a nucleic acid world of independent molecular replicators to a nucleic acid/protein/lipid world of reproducing organisms. Membranes initially functioned as supramolecular structures to which different replicators attached and were selected as a higher-level reproductive unit: the proto-organism. I discuss the roles of stereochemistry, gene divergence, codon capture, and selection in the code's origin. I argue that proteins were primarily structural not enzymatic and that the first biological membranes consisted of amphipathic peptidyl-tRNAs and prebiotic mixed lipids. The peptidyl-tRNAs functioned as genetically-specified lipid analogues with hydrophobic tails (ancestral signal peptides) and hydrophilic polynucleotide heads. Protoribosomes arose from two cooperating RNAs: peptidyl transferase (large subunit) and mRNA-binder (small subunit). Early proteins had a second key role: coupling energy flow to the phosphorylation of gene and peptide precursors, probably by lithophosphorylation by membrane-anchored kinases scavenging geothermal polyphosphate stocks. These key evolutionary steps probably occurred on the outer surface of an `inside out-cell' or obcell, which evolved an unambiguous hydrophobic code with four prebiotic amino acids and proline, and initiation by isoleucine anticodon CAU; early proteins and nucleozymes were all membrane-attached. To improve replication, translation, and lithophosphorylation, hydrophilic substrate-binding and catalytic domains were later added to signal peptides, yielding a ten-acid doublet code. A primitive proto-ecology of molecular scavenging, parasitism, and predation evolved among obcells. I propose a new theory for the origin of the first cell: fusion of two cup-shaped obcells, or hemicells, to make a protocell with double envelope, internal genome and ribosomes, protocytosol, and periplasm. Only then did water-soluble enzymes, amino acid biosynthesis, and intermediary metabolism evolve in a concentrated autocatalytic internal cytosolic soup, causing 12 new amino acid assignments, termination, and rapid freezing of the 22-acid code. Anticodons were recruited sequentially: GNN, CNN, INN, and *UNN. CO2 fixation, photoreduction, and lipid synthesis probably evolved in the protocell before photophosphorylation. Signal recognition particles, chaperones, compartmented proteases, and peptidoglycan arose prior to the last common ancestor of life, a complex autotrophic, anaerobic green bacterium. Received: 19 February 2001 / Accepted: 9 April 2001  相似文献   

12.
A new method for looking at relationships between nucleotide sequences has been used to analyze divergence both within and between the families of isoaccepting tRNA sets. A dendrogram of the relationships between 21 tRNA sets with different amino acid specificities is presented as the result of the analysis. Methionine initiator tRNAs are included as a separate set. The dendrogram has been interpreted with respect to the final stage of the evolutionary pathway with the development of highly specific tRNAs from ambiguous molecular adaptors. The location of the sets on the dendrogram was therefore analyzed in relation to hypotheses on the origin of the genetic code: the coevolution theory, the physicochemical hypothesis, and the hypothesis of ambiguity reduction of the genetic code. Pairs of 16 sets of isoacceptor tRNAs, whose amino acids are in biosynthetic relationships, occupied contiguous positions on the dendrogram, thus supporting the coevolution theory of the genetic code. Received: 4 May 1998 / Accepted: 11 July 1998  相似文献   

13.
14.
Optimality of codon usage in Escherichia coli due to load minimization   总被引:2,自引:0,他引:2  
The canonical genetic code is known to be highly efficient in minimizing the effects of mistranslational errors and point mutations, an ability which in term is designated "load minimization". One parameter involved in calculating the load minimizing property of the genetic code is codon usage. In most bacteria, synonymous codons are not used with equal frequencies. Different factors have been proposed to contribute to codon usage preference. It has been shown that the codon preference is correlated with the composition of the tRNA pool. Selection for translational efficiency and translational accuracy both result in such a correlation. In this work, it is shown that codon usage bias in Escherichia coli works so as to minimize the consequences of translational errors, i.e. optimized for load minimization.  相似文献   

15.
The genetic code is not random but instead is organized in such a way that single nucleotide substitutions are more likely to result in changes between similar amino acids. This fidelity, or error minimization, has been proposed to be an adaptation within the genetic code. Many models have been proposed to measure this adaptation within the genetic code. However, we find that none of these consider codon usage differences between species. Furthermore, use of different indices of amino acid physicochemical characteristics leads to different estimations of this adaptation within the code. In this study, we try to establish a more accurate model to address this problem. In our model, a weighting scheme is established for mistranslation biases of the three different codon positions, transition/transversion biases, and codon usage. Different indices of amino acids physicochemical characteristics are also considered. In contrast to pervious work, our results show that the natural genetic code is not fully optimized for error minimization. The genetic code, therefore, is not the most optimized one for error minimization, but one that balances between flexibility and fidelity for different species.  相似文献   

16.
Two forces are in general, hypothesized to have influenced the origin of the organization of the genetic code: the physicochemical properties of amino acids and their biosynthetic relationships. In view of this, we have considered a model incorporating these two forces. In particular, we have studied the optimization level of the physicochemical properties of amino acids in the set of amino acid permutation codes that respects the biosynthetic relationships between amino acids. Where the properties of amino acids are represented by polarity and molecular volume we obtain indetermination percentages in the organization of the genetic code of approximately 40%. This indicates that the contingent factor played a significant role in structuring the genetic code. Furthermore, this result is in agreement with the genetic code coevolution hypothesis, which attributes a merely ancillary role to the properties of amino acids while it suggests that it was their biosynthetic relationships that organized the code. Furthermore, this result does not favor the stereochemical models proposed to explain the origin of the genetic code. On the other hand, where the properties of amino acids are represented by polarity alone, we obtain an indetermination percentage of at least 21.5%. This might suggest that the polarity distances played an important role and would therefore provide evidence in favor of the physicochemical hypothesis of genetic code origin. Although, overall, the analysis might have given stronger support to the latter hypothesis, this did not actually occur. The results are therefore discussed in the context of the different theories proposed to explain the origin of the genetic code. Received: 10 September 1996 / Accepted: 3 March 1997  相似文献   

17.
For the comprehensive analyses of deviant codes in protistan mitochondria (mt), we sequenced about a 1.1-kb region of a mitochondrial (mt) gene, the cytochrome c oxidase subunit I (coxI) in two chlorarachniophytes, the filose amoeba Euglypha rotunda, the cryptomonad Cryptomonas ovata, the prymnesiophyte (haptophyte) Diacronema vlkianum (Pavlovales), and the diatom Melosira ambigua. As a result of this analysis, we noticed that the UGA codon is assigned to tryptophan (Trp) instead of being a signal for translational termination in two chlorarachniophytes and in E. rotunda. The same type of deviant code was reported previously in animals, fungi, ciliates, kinetoplastids, Chondrus crispus (a red alga), Acanthamoeba castellanii (an amoeboid protozoon), and three of the four prymnesiophyte orders with the exception of the Pavlovales. A phylogenetic analysis based on the COXI sequences of 56 eukaryotes indicated that the organisms bearing the modified code, UGA for Trp, are not monophyletic. Based on these studies, we propose that the ancestral mitochondrion was bearing the universal genetic code and subsequently reassigned the codon to Trp independently, at least in the lineage of ciliates, kinetoplastids, rhodophytes, prymnesiophytes, and fungi. We also discuss how this codon was directionally captured by Trp tRNA. Received: 26 January 1998 / Accepted: 24 April 1998  相似文献   

18.
In the plant chloroplast genome the codon usage of the highly expressed psbA gene is unique and is adapted to the tRNA population, probably due to selection for translation efficiency. In this study the role of selection on codon usage in each of the fully sequenced chloroplast genomes, in addition to Chlamydomonas reinhardtii, is investigated by measuring adaptation to this pattern of codon usage. A method is developed which tests selection on each gene individually by constructing sequences with the same amino acid composition as the gene and randomly assigning codons based on the nucleotide composition of noncoding regions of that genome. The codon bias of the actual gene is then compared to a distribution of random sequences. The data indicate that within the algae selection is strong in Cyanophora paradoxa, affecting a majority of genes, of intermediate intensity in Odontella sinensis, and weaker in Porphyra purpurea and Euglena gracilis. In the plants, selection is found to be quite weak in Pinus thunbergii and the angiosperms but there is evidence that an intermediate level of selection exists in the liverwort Marchantia polymorpha. The role of selection is then further investigated in two comparative studies. It is shown that average relative codon bias is correlated with expression level and that, despite saturation levels of substitution, there is a strong correlation among the algae genomes in the degree of codon bias of homologous genes. All of these data indicate that selection for translation efficiency plays a significant role in determining the codon bias of chloroplast genes but that it acts with different intensities in different lineages. In general it is stronger in the algae than the higher plants, but within the algae Euglena is found to have several unusual features which are noted. The factors that might be responsible for this variation in intensity among the various genomes are discussed. Received: 6 June 1997 / Accepted: 24 July 1997  相似文献   

19.
Mitochondrial genetic codons can be categorized by four patterns of nucleotide-site degeneracy based on varying combinations of twofold- or nondegenerate sites at first codon positions and twofold- or fourfold-degenerate sites at third codon positions. Herein, a model of molecular evolution is introduced that uses these patterns to calculate expected substitution frequencies for each codon position and substitution type relative to overall number of synonymous or nonsynonymous substitutions. Regions of the pocket gopher cytochrome oxidase subunit I (COI) and cytochrome b (cyt-b) genes are analyzed using this model. Chi-square distributions are used to produce relative goodness-of-fit (GF) scores for measuring the difference between substitution frequencies predicted by the codon-degeneracy model (CDM), and frequencies inferred using a well-supported phylogenetic tree of closely related species. The GF scores for expected and observed synonymous (GFsyn= 0.429, p= 0.807) and nonsynonymous (GFns= 2.309, p= 0.679) substitution frequencies resulted in a failure to reject the CDM as a null hypothesis for the molecular evolution of COI and cyt-b in pocket gophers. Alternative tree topologies and calculations of transition bias for these data result in higher GF scores. Received: 25 March 1999 / Accepted: 17 September 1999  相似文献   

20.
We examined a region of high variability in the mosaic mercury resistance (mer) operon of natural bacterial isolates from the primate intestinal microbiota. The region between the merP and merA genes of nine mer loci was sequenced and either the merC, the merF, or no gene was present. Two novel merC genes were identified. Overall nucleotide diversity, π (per 100 sites), of the merC gene was greater (49.63) than adjacent merP (35.82) and merA (32.58) genes. However, the consequences of this variability for the predicted structure of the MerC protein are limited and putative functional elements (metal-binding ligands and transmembrane domains) are strongly conserved. Comparison of codon usage of the merTP, merC, and merA genes suggests that several merC genes are not coeval with their flanking sequences. Although evidence of homologous recombination within the very variable merC genes is not apparent, the flanking regions have higher homologies than merC, and recombination appears to be driving their overall sequence identities higher. The synonymous codon usage bias (ENC) values suggest greater variability in expression of the merC gene than in flanking genes in six different bacterial hosts. We propose a model for the evolution of MerC as a host-dependent, adventitious module of the mer operon. Received: 2 June 2000 / Accepted: 23 October 2000  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号