首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
An evolutionary scheme is postulated in which the bases enter the genetic code in a definite temporal sequence and the correlated amino acids are assigned definite functions in the evolving system.The scheme requires a singlet code (guanine coding for glycine) evolving into a doublet code (guanine-cytosine doublet coding for gly (GG), ala (GC), arg (CG), pro (CC)). The doublet code evolves into a triplet code. Polymerization of nucleotides is thought to have been by block polymerization rather than by a template mechanism. The proteins formed at first were simple structural peptides. No direct nucleotide-amino acid stereo-chemical interaction was required. Rather an adaptor-type indirect mechanism is thought to have been functioning since the origin.  相似文献   

2.
A progene hypothesis has been proposed earlier to explain the mechanism of origin of the self-reproducing genetic system. Progenes (precursors of the genetic system) are mixed anhydrides of an amino acid and deoxyribotrinucleotide at the 3'-gamma-terminal phosphate (NpNpNppp-AA); they are produced from dinucleotides (NpNp) and 3'-gamma-aminoacylnucleotidylates (Nppp-AA) as a result of specific interaction between amino acid and dinucleotide. The postulated mechanism of progene formation accounts for the selection of substances, including chirality, the origin of the genetic code as well as for the mechanisms of formation, self-reproduction and evolution of the simpliest genetic system ("gene--polypeptide"). A stereochemical analysis of the progene formation mechanism has allowed us to support the main statements of the hypothesis that relate to the origin of the genetic code and to selection of substances. Atomic groups that could be responsible for the specificity of interaction between dinucleotides and amino acids in progene formation have been revealed. Stereochemical evidence for the physicochemical basis of the origin of the existing genetic code have been produced: 1) a special role of the second nucleotide in the codon is demonstrated in amino acid coding by the progene hypothesis principle; 2) an advantage of T against U in such coding is demonstrated; 3) for 16 amino acids out of 20 an agreement has been obtained between the optimal dinucleotide as revealed by the stereochemical analysis and the codon dinucleotides; 4) an explanation for the third nucleotide selection mechanism is offered. A restoration of the prebiotic code, based on these results, has indicated that the code contains 32 codons, is statistical and group-wise. It encodes 7 groups of isofunctional amino acids: 3 overlapping groups of non-polar amino acids 1) medium-size hydrophobic amino acids (chiefly Val, n-Val and a-But), 2) small and medium-size non-polar amino acids (chiefly Ala Val, n-Val a-But and Gly), 3) small non-polar amino acids (Gly, Ala, a-But) and 4 groups of polar amino acids--1) hydroxy--+dicarbonic (Asp, Glu, Ser and Thr), 2) dicarbonic (Asp and Glu), 3) hydroxy (Ser and Thr) and 4) basic (Arg and Lys). The code includes about 20 amino acids among which are 15-17 canonical and a few common non-canonical. The prebiotic code explains many properties of the existing genetic code and is capable of evolving into the latter by way of a gradual replacement of the physicochemical coding mechanism by the enzymatic coding mechanism.  相似文献   

3.
Since the early days of the discovery of the genetic code nonrandom patterns have been searched for in the code in the hope of providing information about its origin and early evolution. Here we present a new classification scheme of the genetic code that is based on a binary representation of the purines and pyrimidines. This scheme reveals known patterns more clearly than the common one, for instance, the classification of strong, mixed, and weak codons as well as the ordering of codon families. Furthermore, new patterns have been found that have not been described before: Nearly all quantitative amino acid properties, such as Woeses polarity and the specific volume, show a perfect correlation to Lagerkvists codon–anticodon binding strength. Our new scheme leads to new ideas about the evolution of the genetic code. It is hypothesized that it started with a binary doublet code and developed via a quaternary doublet code into the contemporary triplet code. Furthermore, arguments are presented against suggestions that a simpler code, where only the midbase was informational, was at the origin of the genetic code.  相似文献   

4.
The doublet or nearest-neighbour ratios of the nucleotides in various computer-generated sequences of DNA have been counted to find out which sequences would have the same ratios as those measured for guinea-pig DNA by Russell et al. (1976). Their data shows that the ratio patterns for all nuclear DNA fractions except satellite, ribosomal and tRNA coding DNA are similar irrespective of G+C content and are characterised by the amount of the doublet CpG being less than 30% of that expected on a random basis. To construct and analyse such theoretical sequences, methods have been developed which allow to be counted the doublet frequencies that random DNA and the DNA expected to code for any amino acid (AA) sequence would have were they analysed experimentally. These methods permit the C+G content to be altered and the frequency of the doublet CpG to be lowered without affecting the information stored in the DNA. The former is achieved by selecting codons for a given AA that are high or low in G and C while the latter requires selecting against triplets that either contain CpG or will cause a CpG to occur between codons.The results show that no DNA sequence that we have been able to construct using the unrestricted genetic code has doublet ratios similar to those observed. However, the DNA expected to code for a group of 27 vertebrate proteins (5237 AAs) of diverse functions has doublet ratios virtually identical to those measured experimentally for the 47% G+C fraction, provided that 77·5 % of the CpG is eliminated. The data for the 34–43% G+C fractions are matched well by the protein sequence provided that codons low in G and C are selected and, again, that 77·5% of CpG is eliminated. We have been unable to match the data for the satellite DNA. A perhaps surprising result was that DNA with a random sequence of nucleotides but subjected to the removal of 80% of CpG had doublet ratios that were similar to the experimental data but matched them less well than the doublets of protein-coding DNA. This result probably does no more than emphasise that a significant part of the match of the pattern of doublet ratios of guinea-pig DNA derives from the elimination of CpG.The effect of evolution (random mutation) on the doublet ratios of protein-coding DNA has been investigated by assuming a mutation rate of 3·10?9 per base per generation (the mutation rate of haemoglobin) and seeing how a great many generations of such mutation affect the doublet ratios. The results show that it will take ~1·5·106 generations for the number of termination codons to double and ~2·107 generations for the doublet ratios to become indistinguishable from those of random DNA. This seems to imply either that selection acts over the whole of the DNA or that the mutation rate of haemoglobin DNA is unusually high. The results, as a whole, support the view that, whether or not all non-satellite DNA actually codes for protein, its sequences are similar to those that would code for proteins.  相似文献   

5.
6.
Error detection and correction properties are fundamental for informative codes. Hamming's distance allows us to study this noise resistance. We present codes characterized by the resistance optimization to nonsense mutational effects. The calculation of the cumulated Hamming's distance allowing to determine the number of optimal codes and their structure can be detailed. The principle of these laws of optimization of resistance consists of choosing constituent codons connected by mutational neighbouring in such a way that random application of mutations on such a code minimize the occurrence of nonsense n-uplets or terminators. New coding symmetries are then described and screened using Galois's polynomials properties and Baudot's code. Such a study can be applied to any length of the codons. Here we present the principles of this optimization for the most simple doublet codes. Another constraint is discussed: the distribution of optimal subcodes for synonymity and the frequencies of utilization of the different codons.We compare these results to those of the present genetic code, and we observe that all coded amino acids (except the particular case of SER) are using optimal sub-codes of synonymity.This work suggests that the appearance of the genetic code was provoked by mutations while optimizing on several levels its resistance to their effects. Thus genetic coding would have been the best automata that could be produced in prebiotic conditions.  相似文献   

7.
8.
Ammonite phylogeny has mainly been established based on a stratigraphic approach, with cladistics underconsidered. The main arguments against the use of cladistics are the supposed large amount of homoplasy and the small number of characters. Resolving the phylogeny of the Hildoceratidae (Early Jurassic) is especially challenging because of its large diversity and disparity. Many forms that have not been determined as closely related in previous studies exhibit very similar shapes. Moreover, some groups are morphologically very different, adding difficulties to building a unique coding scheme at a low taxonomic resolution (i.e. species). Here we propose an integrated coding scheme of the peristome shape and the ornamentation, allowing an increased level of comparison. The shape of the peristome is used as a new reference to locate ornamental features and propose new homology hypotheses. In total, 105 taxa have been analysed for 47 characters. We code continuous characters by their means and ranges ± one standard deviation. We test two weighting schemes: equal weights standardized by unit range and implied weighting with several concavity constants. This work has led to redefinition of the phylogenetic inclusivenesses of all the hildoceratid subfamilies. The new coding scheme based on peristome shapes provides the fewest homoplastic characters. The schemes appear promising to improve phylogenetic analyses in ammonoids as well as molluscs as a whole by creating a general coding framework.  相似文献   

9.
A model for topological coding of proteins is proposed. The model is based on the capacity of hydrogen bonds (property of connectivity) to fix conformations of protein molecules. The protein chain is modeled by an n -arc graph with the following elements: vertices (alpha -carbon atoms), structural edges (peptide bonds) and connectivity edges (virtual edges connecting non-adjacent atoms). It was shown that 64 conformations of the 4-arc graph can be described in the binary system by matrices of six variables which form a supermatrix containing four blocks. On the basis of correspondences between the pairs of variables in matrices and four letters of the genetic code matrices and supermatrix are converted, respectively, into the triplets and the table of the genetic code. An algorithm admitting computer programming is proposed for coding the n -arc graph and protein chain. Connectivity operators (polar amino acids) are assigned to blocks of triplets coding for cyclic conformations (G, A-in the second position), while anti-connectivity operators (non-polar amino acids) correspond to blocks of triplets coding for open conformations (C, U-in the second position). Amino acids coded by triplets differing by the first base have different structures. The third base for C, U and G, A is degenerated. Properties of the real genetic code are in full agreement with the model. The model provides an insight into the topological nature of the genetic code and can be used for development of algorithms for the prediction of the protein structure.  相似文献   

10.
Microgene Polymerization Reaction (MPR) is used as an experimental system to artificially simulate evolution of short, non-repetitive homo-duplex DNA into multiply-repetitive products that can code for functional proteins. Blunt-end ligation by DNA polymerase is crucial in expansion of homo-duplexes (HDs) into head-to-tail multiple repeats in MPR. The propagation mechanism is known, but formation of the initial doublet (ID) by juxtaposing two HDs and polymerization through the gap has been ambiguous. Initiation events with pairs of HDs using Real-Time PCR were more frequent at higher HD concentrations and slightly below the melting temperature. A process molecularity of about 3.1, calculated from the amplification efficiency and the difference in PCR cycles at which propagation was detected at varying HD concentrations, led to a simple mechanism for ID formation: the gap between two HDs is bridged by a third. Considering thermodynamic aspects of the presumed intermediate “nucleation complex” can predict relative propensity for the process with other HDs.  相似文献   

11.
A Tremolieres 《Biochimie》1980,62(7):493-496
In this article it is suggested that the first coding system started from specific interactions between the nucleotide part of nucleotidic cofactors and enzymes. These interactions generated a first primitive code of four words of one letter each for the four primittive amino acids (phenylalanine, lysine, glycine and proline); when the triplet code (which allowed the integration of 20 amino acids into proteins) progressively appeared, it must have been modulated by the existence of this first coding system.  相似文献   

12.
The logic of genetic control of development may be based on a binary epigenetic code. This paper revises the author's previous scheme dealing with the numerology of annelid metamerism in these terms. Certain features of the code had been deduced to be combinatorial, others not. This paradoxical contrast is resolved here by the interpretation that these features relate to different operations of the code; the combinatiorial to coding identity of units, the non-combinatorial to coding production of units. Consideration of a second paradox in the theory of epigenetic coding leads to a new solution which further provides a basis for epimorphic regeneration, and may in particular throw light on the "regeneration-duplication" phenomenon. A possible test of the model is also put forward.  相似文献   

13.
Recent evidence for evolution of the genetic code.   总被引:44,自引:2,他引:42       下载免费PDF全文
The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.  相似文献   

14.
Conflict between Amino Acid and Nucleotide Characters   总被引:5,自引:0,他引:5  
Slowly evolving characters, such as amino acids and replacement substitutions, have generally been favored over faster evolving characters for inferring phylogenetic relationships. However, amino acids constitute composite characters and, because of the degenerate genetic code, are subject to convergence. Based on an analysis of atpB and rbcL in 567 seed plants, we show that silent substitutions may be more phylogenetically informative than replacement substitutions and that artifacts caused by composite characters and/or convergence cause clades on amino acid trees to conflict with nucleotide trees and independent evidence. These findings indicate that coding nucleotide sequences only as amino acid characters for phylogenetic analysis provides little benefit and may yield misleading results.  相似文献   

15.
Summary Unrelated organisms with DNA of extreme G + C content (25% or 70%) are found to share very specific patterns of nearest neighbour base doublet frequency in their DNAs. This is shown to be a result of restrictions on the extremity of amino acid composition in their proteins, combined with a maximisation of the use of one type of base pair in redundant codon positions. Inferences are made about the universal nature of the genetic code and the proportion of DNA used for specifying protein in different species. The composition of coding DNA strands in these organisms is also discussed.  相似文献   

16.
Summary Doublet preference analysis was carried out on coding and noncoding regions ofEscherichia coli, Saccharomyces cerevisiae, and human mitochondrial and nuclear DNA. The preference pattern in 1–2 and 2–3 doublets inE. coli andS. cerevisiae correlated with that in noncoding regions. The 3-1 doublet preference inE. coli genes with low optimal codon frequency and inS. cerevisiae genes also showed a correlation with each of their noncoding doublet preference. A mechanism to explain these double preference correlations in doublet preference is presented: mutational biases, the origin of the noncoding region doublet preference, evolved so as to maintain the 1–2 and 2–3 doublet preference, which is determined by codon usage. These biases then acted on the 3-1 doublet, which was almost free of coding constraints, resulting in a similar preference in this doublet.  相似文献   

17.
Isolation and characterization of the porcine growth hormone gene   总被引:24,自引:0,他引:24  
P D Vize  J R Wells 《Gene》1987,55(2-3):339-344
A cosmid clone containing the entire porcine growth hormone (PGH) gene has been isolated using a full-length PGH cDNA as the hybridization probe. The gene within the cosmid was subcloned into plasmids and completely sequenced. The coding, promoter, and both 5'- and 3'-noncoding sequences of the PGH gene were found to be highly conserved when compared to the previously sequenced genes coding for rat, human and bovine growth hormones, and also to the human placental lactogen gene. The high degree of conservation between the 5'- and 3'-noncoding regions of the genes from these different species indicates that growth hormone genes may be evolving by some unusual mechanism. The PGH gene was found to contain the unusual variant GC donor splice site.  相似文献   

18.
Group graph of the genetic code   总被引:1,自引:0,他引:1  
The genetic code doublets can be divided into two octets of completely degenerate and ambiguous coding dinucleotides. These two octets have the algebraic property of lying on continuously connected planes on the group graph (a tesseract) of the Cartesian product of two Klein 4-groups of nucleotide exchange operators. The K X K group can also be broken into four cosets, one of which has completely degenerate coding elements, and another that has completely ambiguous coding elements. The two octets of coding doublets have the further algebraic property that the product of their internal exchange operators naturally divide into two exactly equivalent sets. These properties of the genetic code are relevant to unraveling error-detecting and error-correcting (proof-reading) aspects of the genetic code and may be helpful in understanding the context-sensitive grammar of genetic language.  相似文献   

19.
Two ideas have essentially been used to explain the origin of the genetic code: Crick's frozen accident and Woese's amino acid-codon specific chemical interaction. Whatever the origin and codon-amino acid correlation, it is difficult to imagine the sudden appearance of the genetic code in its present form of 64 codons coding for 20 amino acids without appealing to some evolutionary process. On the contrary, it is more reasonable to assume that it evolved from a much simpler initial state in which a few triplets were coding for each of a small number of amino acids. Analysis of genetic code through information theory and the metabolism of pyrimidine biosynthesis provide evidence that suggests that the genetic code could have begun in an RNA world with the two letters A and U grouped in eight triplets coding for seven amino acids and one stop signal. This code could have progressively evolved by making gradual use of letters G and C to end with 64 triplets coding for 20 amino acids and three stop signals. According to proposed evidence, DNA could have appeared after the four-letter structure was already achieved. In the newborn DNA world, T substituted U to get higher physicochemical and genetic stability.  相似文献   

20.
We present a method whereby some mRNAs which code for enzymes can be isolated by affinity chromatography of newly synthesized polypeptides bound to their polysome complexes. Using this method we have isolated tyrosinase-mRNA from Xenopus laevis oocytes and have analysed the translation products from the RNAs thus obtained. In vitro translation reveals the presence of two mRNAs coding for polypeptides of molecular weights of 20 000 and 32 000, respectively. The larger molecule corresponds to the molecular weight of nascent tyrosinase. Furthermore, microinjection of the mRNA into Xenopus oocytes results in the synthesis of active tyrosinase. Since this isolation method is dependent on the ability of nascent enzymes to bind to their substrate analogue, it is thought that this approach may be appropriate for obtaining mRNAs coding for other enzymes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号