首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A model for topological coding of proteins is proposed. The model is based on the capacity of hydrogen bonds (property of connectivity) to fix conformations of protein molecules. The protein chain is modeled by an n -arc graph with the following elements: vertices (alpha -carbon atoms), structural edges (peptide bonds) and connectivity edges (virtual edges connecting non-adjacent atoms). It was shown that 64 conformations of the 4-arc graph can be described in the binary system by matrices of six variables which form a supermatrix containing four blocks. On the basis of correspondences between the pairs of variables in matrices and four letters of the genetic code matrices and supermatrix are converted, respectively, into the triplets and the table of the genetic code. An algorithm admitting computer programming is proposed for coding the n -arc graph and protein chain. Connectivity operators (polar amino acids) are assigned to blocks of triplets coding for cyclic conformations (G, A-in the second position), while anti-connectivity operators (non-polar amino acids) correspond to blocks of triplets coding for open conformations (C, U-in the second position). Amino acids coded by triplets differing by the first base have different structures. The third base for C, U and G, A is degenerated. Properties of the real genetic code are in full agreement with the model. The model provides an insight into the topological nature of the genetic code and can be used for development of algorithms for the prediction of the protein structure.  相似文献   

2.
Summary The 20 naturally occurring amino acids are characterized by 20 variables: pKNH 2, pKCOOH, pI, molecular weight, substituent van der Waals volume, seven1H and13C nuclear magnetic resonance shift variables, and eight hydrophobicity-hydrophilicity scales. The 20-dimensional data set is reduced to a few new dimensions by principal components analysis. The three first principal components reveal relationships between the properties of the amino acids and the genetic code. Thus the amino acids coded for by adenosine (A), uracil (U), or cytosine (C) in their second codon position (corresponding to U, A, or G in the second anticodon position) are grouped in these components. No grouping was detected for the amino acids coded for by guanine (G) in the second codon position (corresponding to C in the second anticodon position). The results show that a relationship exists between the physical-chemical properties of the amino acids and which of the A (U), U (A), or C (G) nucleotide is used in the second codon (anticodon) position. The amino acids coded for by G (C) in the second codon (anticodon) position do not participate in this relationship.  相似文献   

3.
Enthalpies (delta H++) and entropies (delta S++) of activation for the reaction of 18 N'-hydroxysuccinimide esters of N-protected proteinaceous amino acids with p-anisidine were measured and free enthalpies of activation (delta G++) at 25 degrees C were calculated on this basis. A regular correlation between delta G++s and the corresponding amino acid codons was found. To obtain this correlation all the codons had to be arranged in a closed ring in which the consecutive codons were connected by one-step mutational changes. One-step mutations appeared as a regular series: 2,3,3,3,1,3,3,3,1,3,3,3,1,3,3,3,2,3,3,3. (the numbers denote a codon position in which a change took place). There were three such 'one-step mutation periods' in the ring, each containing 20 codons (in each block of 16 codons with A, U and C, in the central position and 4 codons containing G in the central position). The end of the third period (UG) and the beginning of the first period were bridged by the four codons of glycine with G in the second position. The values of delta G++ change similarly in each period, increasing upon approaching Lys, Pro, and Ile. The periodical relation between the chemical reactivities of the coded amino acids (reflected by delta G++s) and the structure of their codons could be of importance for the origin of the genetic code i.e. for selection of proper codons for the definite amino acids.  相似文献   

4.
In mammalian mitochondrial codes, fourfold codons wobble-pair with UNN anticodons so that U wobbles with U, C, A and G. Twofold pyrimidine-terminated codons pair with GNN and twofold purine-terminated codons pair with UNN. These properties enable a prediction to be made for evolution of the universal genetic code. It was postulated (1) that an archetypal code of 16 quartets coded for 15 amino acids. If this code used UNN anticodons, then duplication of tRNA genes, followed by mutations in the anticodons and aminoacylation sites, would give rise to the present universal code.  相似文献   

5.
The code within the codons   总被引:6,自引:0,他引:6  
F J Taylor  D Coates 《Bio Systems》1989,22(3):177-187
For the first time it is shown that each of the three codon bases has a general correlation with a different, predictable amino acid property, depending on position within the codon. In addition to the previously recognized link between the mid-base and the hydrophobic-hydrophilic spectrum, we show that, with the exception of G, the first base is generally invariant within a synthetic pathway. G--coded amino acids show a different order, being found only at the head of the synthetic pathways. The redundancy of the nature of the third base has a previously unrecognised relationship with molecular weight. The bases U and A (transversions) are associated with the most sharply defined or opposite states in both the first and second position, C somewhat less so or intermediate, anf G neutral. The apparently systematic nature of these relationships has profound implications for the origin of the genetic code. It appears to be the remains of the first language of the cell, predating the tRNA/ribosome system, persisting with remarkably little change at a deeper level of organisation than the codon language.  相似文献   

6.
Molar content of guanine plus cytosine (G + C) and optimal growth temperature (OGT) are main factors characterizing the frequency distribution of amino acids in prokaryotes. Previous work, using multivariate exploratory methods, has emphasized ascertainment of biological factors underlying variability between genomes, but the strength of each identified factor on amino acid content has not been quantified. We combine the flexibility of the phylogenetic mixed model (PMM) with the power of Bayesian inference via Markov Chain Monte Carlo (MCMC) methods, to obtain a novel evolutionary picture of amino acid usage in prokaryotic genomes. We implement a Bayesian PMM which incorporates the feature that evolutionary history makes observed data interdependent. As in previous studies with PMM, we present a variance partition; however, attention is also given to the posterior distribution of "systematic effects" that may shed light about the relative importance of and relationships between evolutionary forces acting at the genomic level. In particular, we analyzed influences of G + C, OGT, and respiratory metabolism. Estimates of G + C effects were significant for amino acids coded by G + C or molar content of adenine plus thymine (A + T) in first and second bases. OGT had an important effect on 12 amino acids, probably reflecting complex patterns of protein modifications, to cope with varying environments. The effect of respiratory metabolism was less clear, probably due to the already reported association of G + C with aerobic metabolism. A "heritability" parameter was always high and significant, reinforcing the importance of accommodating phylogenetic relationships in these analyses. "Heritable" component correlations displayed a pattern that tended to cluster "pure" G + C (A + T) in first and second codon positions, suggesting an inherited departure from linear regression on G + C.  相似文献   

7.
The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by ‘quartets’ of codons with four-fold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If two-fold degeneracy is postulated for the first position of the codon, there could have been 10 amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutamic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an ‘intruder’ into the genetic code, and that it may have displaced another amino acid such as ornithine, or may even have displaced lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons. The introduction of tRNA into protein synthesis may have been a cataclysmic and comparatively sudden event, since duplication of tRNA takes place readily, and point mutations could rapidly differentiate members of the family of duplicates from each. Two tRNAs for different amino acids may have a common ancestor that existed more recently than the separation of the prokaryotes and eukaryotes. This is shown by homology of twoE. coli tRNAs for glycine and valine, and two yeast tRNAs for arginine and lysine.  相似文献   

8.
Summary AGA and AGG (AGR) are arginine codons in the universal genetic code. These codons are read as serine or are used as stop codons in metazoan mitochondria. The arginine residues coded by AGR in yeast orTrypanosoma are coded by arginine CGN throughout metazoan mitochondria. AGR serine sites in metazoan mitochondria are occupied mainly in corresponding sites in yeast orTrypanosoma mitochondria by UCN serine, AGY serine, or codons for amino acids other than serine or arginine. Based on these observations, we propose the following evolutionary events. AGR codons became unassigned because of deletion of tRNA Arg (UCU) and elimination of AGR codons by conversion to CGN arginine codons. Upon acquisition by serine tRNA of pairing ability with AGR codons, some codons for amino acids other than arginine mutated to AGR, and were caputed by anticodon GCU in serine tRNA. During vertebrate mitochondrial evolution, AGR stop codons presumably were created from UAG stop by deletion of the first nucleotide U and by use of R as the third nucleotide that had existed next to the ancestral UAG stop.  相似文献   

9.
Summary Starting from the assumption that specific steric and energetic interactions between amino acids and their respective anticodons could exist, the evolution of the genetic code is deduced from purely chemical and physical reasons. In this model the amino acids are intercalated between the two first anticodon bases and their carbon bound hydrogen atoms are assumed to penetrate into the electron clouds of the bases. By these means a gain in energy and a fixation of the amino acid is obtained in such a way that the anticodon nucleotides could be determinant for the nature of the amino acids.  相似文献   

10.
The genetic code is comprised of a system concerning the distribution of doublets of the first two codon bases among amino acids. According to this system a definite order in the relative distribution of the first and the second codon bases coincides with a definite order among the common amino acids and their distribution for the number of hydrogen atoms per molecule (an unexpected parameter). The pattern of the relative distribution of the first and the second codon bases suggests it originated from a crystalline-like structure in which the set of bases AUGC served as an elementary structural unit and the base doublets played the role of structural analogs to the amino acids. These hypothetical crystalline-like aggregates are composed of the free molecules of amino acids and bases, and although different in their composition, should have an even number of hydrogen atoms per standard structural module.  相似文献   

11.
In the course of experimental approach to the chemical evolution in the primeval sea, we have found that the main products from formaldehyde and hydroxylamine are glycine, alanine, serine, aspartic acid etc., and the products from glycine and formaldehyde are serine and aspartic acid. Guanine is found in the two-letter genetic codons of all these amino acids.Based upon the finding and taking into consideration the probable synthetic pathways of nucleotide bases and protein amino acids in the course of chemical evolution and a correlation between the two-letter codons and the number of carbon atoms in the carbon skeleton of amino acids, 1 have been led to a working hypothesis on the interdependent genesis of nucleotide bases, protein amino acids, and primitive genetic code as shown in Table I.Protein amino acids can be classified into two groups: Purine Group amino acids and Pyrimidine Group amino acids. Purine bases and Pyrimidine bases are predominant in two-letter codons of amino acids belonging to the former and the latter group respectively.Guanine, adenine, and amino acids of the Purine Group may be regarded as synthesized from C1 and C2 compounds and N1 compounds (including C1N1 compunds such as HCN), probably through glycine, in the early stage of chemical evolution.Uracil, cytosine, and amino acids of the Pyrimidine Group may be regarded as synthesized directly or indirectly from three-carbon chain compounds. This synthesis became possible after the accumulation of three-carbon chain compounds and their derivatives in the primeval sea.The Purine Group can be further classified into a Guanine or (Gly+nC1) Subgroup and an Adenine or (Gly+nC2) Subgroup or simply nC2 Subgroup. The Pyrimidine Group can be further classified into a Uracil or C3C6C9 Subgroup and a Cytosine or C5-chain Subgroup (Table I).It is suggested that the primitive genetic code was established by a specific interaction between amino acids and their respective nucleotide bases. The interaction was dependent upon their concentration in the primeval environments and the binding constants between amino acids and their respective bases.Presented at the International Symposium (Lipmann Symposium) on The Concepts of Chemical Recognition in Biology held in Grignon near Versailles (France) on July 18–20, 1979.  相似文献   

12.
Temporal order ("chronology") of appearance of amino acids and their respective codons on evolutionary scene is reconstructed. A consensus chronology of amino acids is built on the basis of 60 different criteria each offering certain temporal order. After several steps of filtering the chronology vectors are averaged resulting in the consensus order: G, A, D, V, P, S, E, (L, T), R, (I, Q, N), H, K, C, F, Y, M, W. It reveals two important features: the amino acids synthesized in imitation experiments of S. Miller appeared first, while the amino acids associated with codon capture events came last. The reconstruction of codon chronology is based on the above consensus temporal order of amino acids, supplemented by the stability and complementarity rules first suggested by M. Eigen and P. Schuster, and on the earlier established processivity rule. At no point in the reconstruction the consensus amino-acid chronology was in conflict with these three rules. The derived genealogy of all 64 codons suggested several important predictions that are confirmed. The reconstruction of the origin and evolutionary history of the triplet code becomes, thus, a powerful research tool for molecular evolution studies, especially in its early stages.  相似文献   

13.
V V Sukhodolets 《Genetika》1985,21(10):1589-1600
According to a certain order in sets of the two first codon bases, 20 common amino acids can be divided into 5 families each containing 4 amino acids; the corresponding order in the distribution of codon bases can be easily detected, if common amino acids are distributed for the numbers of hydrogen atoms per molecule (Sukhodolets, 1980). In the present paper, the order in the distribution of codon bases is explained on the basis of the hypothesis claiming the prebiological existence of crystalline associates composed of amino acids and bases as free molecules. In these heterogeneous crystalline associates amino acids were analogs to the base douplets and the arrangement of molecules followed a certain rule, namely: 40 protons per molecular complex forming a standard structural compartment. It is proposed that the crystalline associates existed as lyotropic liquid crystals with hydrocarbons as solvent. The genetical code allows to discover two different original crystallization types for bases and amino acids. Therefore, the life possibly originates from combining in the same structure different crystallization patterns, which resulted in formation of a finite crystalline associate.  相似文献   

14.
Hydropathic anti-complementarity of amino acids based on the genetic code   总被引:15,自引:0,他引:15  
An interesting pattern in the genetic code has been discovered. Codons for hydrophilic and hydrophobic amino acids on one strand of DNA are complemented by codons for hydrophobic and hydrophilic amino acids on the other DNA strand, respectively. The average tendency of codons for "uncharged" (slightly hydrophilic) amino acids is to be complemented by codons for "uncharged" amino acids.  相似文献   

15.
Qiu Y  Zhu L 《Bio Systems》2000,56(2-3):139-144
We rearrange the genetic code and present a table of codons. The chemical properties of the amino acids coded by codons, and the evolutionary trend of codons are well reflected in the order of this table, from which two rules can be drawn: (1) the polarity/non-polarity and hydrophilicity/hydrophobicity of amino acids coded for by codons alternate row by row in the table; (2) in general, the lower down in the table, the earlier the codons are in terms of evolution.  相似文献   

16.
A full length cDNA clone that directs the in vitro synthesis of human histone H2A isoprotein H2A.X has been isolated and sequenced. H2A.X contains 142 amino acid residues, 13 more than human H2A.1. The sequence of the first 120 residues of H2A.X is almost identical to that of human H2A.1. The sequence of the carboxy-terminal 22 residues of H2A.X is unrelated to any known sequence in vertebrate histone H2A; however, it contains a sequence homologous with those of several lower eukaryotes. This homology centers on the carboxy-terminal tetrapeptide which in H2A.X is SerGlnGluTyr. Homologous sequences are found in H2As of three types of yeasts, in Tetrahymena and Drosophila. Seven of the nine carboxy-terminal amino acids of H2A.X are identical with those of S. cerevisiae H2A.1. It is suggested that this H2A carboxy-terminal motif may be present in all eukaryotes. The H2A.X cDNA is 1585 bases long followed by a polyA tail. There are 73 nucleotides in the 5' UTR, 432 in the coding region, and 1080 in the 3' UTR. Even though H2A.X is considered a basal histone, being synthesized in G1 as well as in S-phase, and its mRNA contains polyA addition motifs and a polyA tail, its mRNA also contains the conserved stem-loop and U7 binding sequences involved in the processing and stability of replication type histone mRNAs. Two forms of H2A.X mRNA, consistent with the two sets of processing signals were found in proliferating cell cultures. One, about 1600 bases long, contains polyA; the other, about 575 bases long, lacks polyA. The short form behaves as a replication type histone mRNA, decreasing in amount when cell cultures are incubated with inhibitors of DNA synthesis, while the longer behaves as a basal type histone mRNA.  相似文献   

17.
The frequencies of A, C, G, and T in mitochondrial DNA vary among species due to unequal rates of mutation between the bases. The frequencies of bases at fourfold degenerate sites respond directly to mutation pressure. At first and second positions, selection reduces the degree of frequency variation. Using a simple evolutionary model, we show that first position sites are less constrained by selection than second position sites and, therefore, that the frequencies of bases at first position are more responsive to mutation pressure than those at second position. We define a measure of distance between amino acids that is dependent on eight measured physical properties and a similarity measure that is the inverse of this distance. Columns 1, 2, 3, and 4 of the genetic code correspond to codons with U, C, A, and G in their second position, respectively. The similarity of amino acids in the four columns decreases systematically from column 1 to column 2 to column 3 to column 4. We then show that the responsiveness of first position bases to mutation pressure is dependent on the second position base and follows the same decreasing trend through the four columns. Again, this shows the correlation between physical properties and responsiveness. We determine a proximity measure for each amino acid, which is the average similarity between an amino acid and all others that are accessible via single point mutations in the mitochondrial genetic code structure. We also define a responsiveness for each amino acid, which measures how rapidly an amino acid frequency changes as a result of mutation pressure acting on the base frequencies. We show that there is a strong correlation between responsiveness and proximity, and that both these quantities are also correlated with the mutability of amino acids estimated from the mtREV substitution rate matrix. We also consider the variation of base frequencies between strands and between genes on a strand. These trends are consistent with the patterns expected from analysis of the variation among genomes. [Reviewing Editor: Dr. David Pollock]  相似文献   

18.
M Hasegawa  T A Yano 《Origins of life》1975,6(1-2):219-227
The entropy of the amino acid sequences coded by DNA is considered as a measure of diversity of variety of proteins, and is taken as a measure of evolution. The DNA or m-RNA sequence is considered as a stationary second-order Markov chain composed of four kinds of bases. Because of the biased nature of the genetic code table, increase of entropy of amino acid sequences is possible with biased nucleotide sequence. Thus the biased DNA base composition and the extreme rarity of the base doublet CpG of higher organisms are explained. It is expected that the amino acid composition was highly biased at the days of the origin of the genetic code table, and the more frequent amino acids have tended to get rarer, and the rarer ones more frequent. This tendency is observed in the evolution of hemoglobin, cytochrome C, fibrinopeptide, immunoglobulin and lysozyme, and protein as a whole.  相似文献   

19.
H Hartman 《Origins of life》1975,6(3):423-427
An evolutionary scheme is postulated in which the bases enter the genetic code in a definite temporal sequence and the correlated amino acids are assigned definite functions in the evolving system. The scheme requires a singlet code (guanine coding for glycine) evolving into a doublet code (guanine-cytosine doublet coding for gly (GG), ala (GC), arg (CG), pro (CC). The doublet code evolves into a triplet code. Polymerization of nucleotides is thought to have been by block polymerization rather than by a template mechanism. The proteins formed at first were simple structural peptides. No direct nucleotide-amino acid stereo-chemical interaction was required. Rather an adaptor-type indirect mechanism is thought to have been functioning since the origin.  相似文献   

20.
Two ideas have essentially been used to explain the origin of the genetic code: Crick's frozen accident and Woese's amino acid-codon specific chemical interaction. Whatever the origin and codon-amino acid correlation, it is difficult to imagine the sudden appearance of the genetic code in its present form of 64 codons coding for 20 amino acids without appealing to some evolutionary process. On the contrary, it is more reasonable to assume that it evolved from a much simpler initial state in which a few triplets were coding for each of a small number of amino acids. Analysis of genetic code through information theory and the metabolism of pyrimidine biosynthesis provide evidence that suggests that the genetic code could have begun in an RNA world with the two letters A and U grouped in eight triplets coding for seven amino acids and one stop signal. This code could have progressively evolved by making gradual use of letters G and C to end with 64 triplets coding for 20 amino acids and three stop signals. According to proposed evidence, DNA could have appeared after the four-letter structure was already achieved. In the newborn DNA world, T substituted U to get higher physicochemical and genetic stability.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号