首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Sequence Evolution of Drosophila Mitochondrial DNA   总被引:18,自引:3,他引:15       下载免费PDF全文
We have compared nucleotide sequences of corresponding segments of the mitochondrial DNA (mtDNA) molecules of Drosophila yakuba and Drosophila melanogaster, which contain the genes for six proteins and seven tRNAs. The overall frequency of substitution between the nucleotide sequences of these protein genes is 7.2%. As was found for mtDNAs from closely related mammals, most substitutions (86%) in Drosophila mitochondrial protein genes do not result in an amino acid replacement. However, the frequencies of transitions and transversions are approximately equal in Drosophila mtDNAs, which is in contrast to the vast excess of transitions over transversions in mammalian mtDNAs. In Drosophila mtDNAs the frequency of C----T substitutions per codon in the third position is 2.5 times greater among codons of two-codon families than among codons of four-codon families; this is contrary to the hypothesis that third position silent substitutions are neutral in regard to selection. In the third position of codons of four-codon families transversions are 4.6 times more frequent than transitions and A----T substitutions account for 86% of all transversions. Ninety-four percent of all codons in the Drosophila mtDNA segments analyzed end in A or T. However, as this alone cannot account for the observed high frequency of A----T substitutions there must be either a disproportionately high rate of A----T mutation in Drosophila mtDNA or selection bias for the products of A----T mutation. --Consideration of the frequencies of interchange of AGA and AGT codons in the corresponding D. yakuba and D. melanogaster mitochondrial protein genes provides strong support for the view that AGA specifies serine in the Drosophila mitochondrial genetic code.  相似文献   

2.
Summary The mRNA sequences of beta hemoglobin for human, mouse and rabbit were examined. Observations included the following: (1) there is a significant bias against the use of codons only one nucleotide different from terminating codons; (2) less than 4% of the codons end in adenine; (3), guanine is the most common third position nucleotide but it never follows a second position cytosine; (4) nearest neighbor (doublet) nucleotides are non-random with the greatest contributor to non-randomness being the third position suggesting that codon choice for a given amino acid rather than a choice among amino acids is the more important contributor; (5) the CG dinucleotide is even rarer in positions other than the first and second of the codon than it is in those two, suggesting that the need for arginine has in fact elevated the CG frequency in those positions; (6) 77 per cent of the nucleotides are unsubstituted among these three taxa, which could be a sampling effect, but there is strong evidence that about one-third of them are in fact unsubstitutable because of selective constrainsts; (7) the two longest stretches of unsubstituted nucleotides (32 and 35 consecutive nucleotides) surround the points of the two non-coding insertion sequences; (8) over half the substitutions occur in the third nucleotide position of the codons; (9) silent (non-amino acid changing) substitutions occur at about four times the rate of non-silent substitutions on the basis of their relative opportunity to occur; (10) silent substitutions occur slightly but significantly more often in codons that also have non-silent substitutions than independence of the two events would predict; (11) substitutions occur in adjacent nucleotides significantly more often than chance would predict; (12) among four-fold degenerate codons, third position transitions (principally cytosine-uracil interchanges) outnumber transversions by two to one although the reverse ratio would be expected.The analysis of these messengers provided an opportunity to evaluate the random evolutionary hit (REH) theory. I observed that: (1) the REH theory is premised upon five assumptions, all false; (2) the theory leads to contradictory estimates of the number of varions; (3) the REH values are underestimates; (4) the REH values frequently violate the triangle inequality; (5) the REH values, contrary to claim, are not concordant either with accepted point mutations (PAMs) or augmented distances; (6) the REH values are more likely than values uncorrected for multiple substitutions to give incorrect phylogenies; and (7) the REH values have statistical problems probably associated with a large variance in its fundamental parameter, re. From this I conclude that REH theory is not suitable for its intended purpose of estimating from protein sequences of nucleotide substitutions since the common ancestor of two gene products.  相似文献   

3.
The phylogeny of Greya Busck (Lepidoptera: Prodoxidae) was inferred from nucleotide sequence variation across a 765-bp region in the cytochrome oxidase I and II genes of the mitochondrial genome. Most parsimonious relationships of 25 haplotypes from 16 Greya species and two outgroup genera (Tetragma and Prodoxus) showed substantial congruence with the species relationships indicated by morphological variation. Differences between mitochondrial and morphological trees were found primarily in the positions of two species, G. variabilis and G. pectinifera, and in the branching order of the three major species groups in the genus. Conflicts between the data sets were examined by comparing levels of homoplasy in characters supporting alternative hypotheses. The phylogeny of Greya species suggests that host-plant association at the family level and larval feeding mode are conservative characters. Transition/transversion ratios estimated by reconstruction of nucleotide substitutions on the phylogeny had a range of 2.0-9.3, when different subsets of the phylogeny were used. The decline of this ratio with the increase in maximum sequence divergence among taxa indicates that transitions are masked by transversions along deeper internodes or long branches of the phylogeny. Among transitions, substitutions of A-->G and T-->C outnumbered their reciprocal substitutions by 2-6 times, presumably because of the approximately 4:1 (77%) A+T-bias in nucleotide base composition. Of all transversions, 73%-80% were A<-->T substitutions, 85% of which occurred at third positions of codons; these estimates did not decrease with an increase in maximum sequence divergence of taxa included in the analysis. The high frequency of A<-->T substitutions is either a reflection or an explanation of the 92% A+T bias at third codon positions.   相似文献   

4.
Summary We report and compare the DNA sequences of 14 silkmoth (Antheraea polyphemus) chorion genes, derived from either cDNA or chromosomal DNA clones. Seven of these genes are members of the A multigene family, and seven are members of the B family. Where available, the previously reported (Jones and Kafatos 1980) intronic and extragenic flanking DNA sequences are also considered. Closely related sequences are compared, revealing the types of spontaneous mutations that were fixed during paralogous evolution. Segmental mutations (i.e. mutations other than substitutions) are nearly always interpretable as small duplications or deletions. related to small direct repeats. Segmental mutations are strongly constrained in the coding regions, although they do occur. Nucleotide substitutions also appear to be under selective constraints: relatively few substitutions leading to amino acid replacements are accepted, silent substitutions leading to some codons (especially purine-terminated ones) are disfavored, and different compositional biases are maintained in different parts of the sequences. Other sequence differences can be interpreted as indicative of neutral drift, including most differences in non-coding regions and most T/C transitions in third-base positions. In the non-coding regions, which are thought to be only loosely constrained by selection, transitions are observed more frequently than might be expected: they account for 52% of all substitutions, and they appear to be favored two to threefold over transversions when allowance is made for the skewed base composition of these regions.  相似文献   

5.
The complete sequence of honeybee (Apis mellifera) mitochondrial DNA is reported being 16,343 bp long in the strain sequenced. Relative to their positions in the Drosophila map, 11 of the tRNA genes are in altered positions, but the other genes and regions are in the same relative positions. Comparisons of the predicted protein sequences indicate that the honeybee mitochondrial genetic code is the same as that for Drosophila; but the anticodons of two tRNAs differ between these two insects. The base composition shows extreme bias, being 84.9% AT (cf. 78.6% in Drosophila yakuba). In protein-encoding genes, the AT bias is strongest at the third codon positions (which in some cases lack guanines altogether), and least in second codon positions. Multiple stepwise regression analysis of the predicted products of the protein-encoding genes shows a significant association between the numbers of occurrences of amino acids and %T in codon family, but not with the number of codons per codon family or other parameters associated with codon family base composition. Differences in amino acid abundances are apparent between the predicted Apis and Drosophila proteins, with a relative abundance in the Apis proteins of lysine and a relative deficiency of alanine. Drosophila alanine residues are as often replaced by serine as conserved in Apis. The differences in abundances between Drosophila and Apis are associated with %AT in the codon families, and the degree of divergence in amino acid composition between proteins correlates with the divergence in %AT at the second codon positions. Overall, transversions are about twice as abundant as transitions when comparing Drosophila and Apis protein-encoding genes, but this ratio varies between codon positions. Marked excesses of transitions over chance expectation are seen for the third positions of protein-coding genes and for the gene for the small subunit of ribosomal RNA. For the third codon positions the excess of transitions is adequately explained as due to the restriction of observable substitutions to transitions for conserved amino acids with two-codon families; the excess of transitions over expectation for the small ribosomal subunit suggests that the conservation of nucleotide size is favored by selection.  相似文献   

6.
T Ohama  A Muto    S Osawa 《Nucleic acids research》1990,18(6):1565-1569
The GC (G + C, or G or C)-contents of codon silent positions in all two-codon sets and three codons AUY/A (IIe), and in most of the family boxes of Micrococcus luteus (genomic GC-content: 74%) are 95% to 100% in both the highly and weakly expressed genes. In some family boxes, there is a decrease in NNC codons and an increase in NNG codons from the highly expressed to weakly expressed genes without apparent involvement of NNU and NNA codons. From these observations, we conclude that the selective use of synonymous codons in M. luteus may be largely determined by GC-biased mutation pressure and that in the highly expressed genes tRNAs would act as a weak selection pressure in some family boxes. Available data suggest that the effect of selection pressure by tRNAs on the synonymous codon choice becomes more apparent in the highly expressed genes in eubacteria with intermediate GC-contents such as Escherichia coli and Bacillus subtilis, and that the U/C ratio of the codon third positions in NNU/C-type two-codon sets in the weakly expressed genes would represent the approximate magnitude of directional mutation pressure throughout eubacteria.  相似文献   

7.
Degeneracy in the genetic code is known to minimise the deleterious effects of the most frequent base substitutions: transitions at the third base of codons are generally synonymous substitutions. Transversions that alter degeneracy were reported by Rumer. Here the other transversions are shown to leave invariant degeneracy when applied to the first base of codons. As a summary, degeneracy is considered with respect to all three types of base substitutions, the transitions and the two types of transversions. The symmetries of degeneracy by base substitutions are independent of the representation of the genetic code and discussed with respect to the quasi-universality of the genetic code.  相似文献   

8.
Summary The DNA sequence was determined for the cytochrome c oxidase II (COII), tRNALys, and ATPase 8 genes from the mitochondrial genome of the meadow vole, Microtus pennsylvanicus. When compared to other rodents, three different patterns of evolutionary divergence were found. Nucleotide variation in tRNALys is concentrated in the TC loop. Nucleotide variation in the COII gene in three genera of rodents (Microtus, Mus, Rattus) consists predominantly of transitions in the third base positions of codons. The predicted amino acid sequence in highly conserved (>92% similarity). Analysis of the ATPase 8 gene among four genera (Microtus, Cricetulus, Mus, Rattus) revealed more detectable transversions than transitions, many fixed first and second position mutations, and considerable amino acid divergence. The rate of nucleotide substitution at nonsynonymous sites in the ATPase 8 gene is 10 times the rate in the COII gene. In contrast, the estimated absolute mutation rate as determined by analysis of nucleotide substitutions at fourfold degenerate sites probably is the same for the two genes. The primary sequences of the ATPase 8 and COII peptides are constrained differently, but each peptide is conserved in terms of predicted secondary-level configuration.  相似文献   

9.
Patterns of nucleotide substitution in pseudogenes and functional genes   总被引:26,自引:0,他引:26  
Summary The pattern of point mutations is inferred from nucleotide substitutions in pseudogenes. The pattern obtained suggests that transition mutations occur somewhat more frequently than transversion mutations and that mutations result more often in A or T than in G or C. Our results are discussed with respect to the predictions from Topal and Fresco's model for the molecular basis of point (substitution) mutations (Nature 263:285–289, 1976). The pattern of nucleotide substitution at the first and second positions of codons in functional genes is quite similar to that in pseudogenes, but the relative frequency of the transition CT in the sense strand is drastically reduced and those of the transversions CG and GC are doubled. The differences between the two patterns can be explained by the observation that in the protein evolution amino acid substitutions occur mainly between amino acids with similar biochemical properties (Grantham, Science 185:862–864, 1974). Our results for the patterns of nucleotide substitutions in pseudogenes and in functional genes lead to the prediction that both the coding and non-coding regions of protein coding genes should have high frequencies of A and T. Available data show that the non-coding regions are indeed high in A and T but the coding regions are low in T, though high in A.  相似文献   

10.
From investigation of eight Flavobacterium sp. genes encoding enzyme proteins, it was found that six genes had nonstop frames (NSFs) on the antisense strands, and base sequences of the genes are mainly composed of repeating triplet sequence(s), 5'-GNC-3' (where G and C are guanine and cytosine, and N is either of the four bases), in the reading frames. Thus, we concluded that the biased nucleotide sequences on the sense strands produce NSFs on the corresponding antisense strands. Furthermore, from the precise alignments of both nucleotide and amino acid sequences of two related Flavobacterium sp. genes, nyIB and nyIB', it was found that base replacements might have occurred symmetrically in the codons. That is, transversions between G and C were observed at high frequencies at the first and third positions of codons, but not at the second positions. At the first position, AG base transitions were observed much more than similar CT transitions, whereas CT transitions were found at the third positions at a relatively high frequency. These suggest that symmetrical base replacements in codons might be the main contribution to evolution in Flavobacterium sp. genes.  相似文献   

11.
Comparison of complete genome sequences for different variants of hepatitis C virus (HCV) reveals several different constraints on sequence change. Synonymous changes are suppressed in coding regions at both 5′ and 3′ ends of the genome. No evidence was found for the existence of alternative reading frames or for a lower mutation frequency in these regions. Instead, suppression may be due to constraints imposed by RNA secondary structures identified within the core and NS5b genes. Nonsynonymous substitutions are less frequent than synonymous ones except in the hypervariable region of E2 and, to a lesser extent, in E1, NS2, and NS5b. Transitions are more frequent than transversions, particularly at the third position of codons where the bias is 16:1. In addition, nucleotide substitutions may not occur symmetrically since there is a bias toward G or C at the third position of codons, while T ↔ C transitions were twice as frequent as A ↔ G transitions. These different biases do not affect the phylogenetic analysis of HCV variants but need to be taken into account in interpreting sequence change in longitudinal studies. Received: 9 September 1996 / Accepted: 20 April 1997  相似文献   

12.
Summary Chou-Fasman parameters, measuring preferences of each amino acid for different conformational regions in proteins, were used to obtain an amino acid difference index of conformational parameter distance (CPD) values. CPD values were found to be significantly lower for amino acid exchanges representing in the genetic code transitions of purines, GA than for exchanges representing either transitions of pyrimidines, CU, or transversions of purines and pyrimidines. Inasmuch as the distribution of CPD values in these non GA exchanges resembles that obtained for amino acid pairs with double or triple base differences in their underlying codons, we conclude that the genetic code was not particularly designed to minimize effects of mutation on protein conformation. That natural selection minimizes these changes, however, was shown by tabulating results obtained by the maximum parsimony method for eight protein genealogies with a total occurrence of 4574 base substitutions. At the beginning position of the codons GA transitions were in very great excess over other base substitutions, and, conversely, CU transitions were deficient. At the middle position of the codons only fast evolving proteins showed an excess of GA transitions, as though selection mainly preserved conformation in these proteins while weeding out mutations affecting chemical properties of functional sites in slow evolving proteins. In both fast and slow evolving proteins the net direction of transitions and transversions was found to be from G beginning codons to non-G beginning codons resulting in more commonly occurring amino acids, especially alanine with its generalized conformational properties, being replaced at suitable sites by amino acids with more specialized conformational and chemical properties. Historical circumstances pertaining to the origin of the genetic code and the nature of primordial proteins could account for such directional changes leading to increases in the functional density of proteins.In order to further explore the course of protein evolution, a modified parsimony algorithm was developed for constructing protein genealogies on the basis of minimum CPD length. The algorithm's ability to judge with finer discrimination that in protein evolution certain pathways of amino acid substitution should occur more readily than others was considered a potential advantage over strict maximum parsimony. In developing this CPD algorithm, the path of minimum CPD length through intermediate amino acids allowed by the genetic code for each pair of amino acids was determined. It was found that amino acid exchanges representing two base changes have a considerably lower average CPD value per base substitution than the amino acid exchanges representing single base changes. Amino acid exchanges representing three base changes have yet a further marked reduction in CPD per base change. This shows how extreme constraining effects of stabilizing selection can be circumvented, for by way of intermediate amino acids almost any amino acid can ultimately be substituted for another without damage to an evolving protein's conformation during the process.  相似文献   

13.
14.
Forbidden synonymous substitutions in coding regions   总被引:2,自引:0,他引:2  
In the evolution of highly conserved genes, a few "synonymous" substitutions at third bases that would not alter the protein sequence are forbidden or very rare, presumably as a result of functional requirements of the gene or the messenger RNA. Another 10% or 20% of codons are significantly less variable by synonymous substitution than are the majority of codons. The changes that occur at the majority of third bases are subject to codon usage restrictions. These usage restrictions control sequence similarities between very distant genes. For example, 70% of third bases are identical in calmodulin genes of man and trypanosome. Third-base similarities of distant genes for conserved proteins are mathematically predicted, on the basis of the G+C composition of third bases. These observations indicate the need for reexamination of methods used to calculate synonymous substitutions.   相似文献   

15.
The mutations of 76 haemophilia B patients representing the whole population registered with the Malm? haemophilia centre (42) and referrals from the UK, were characterised. RFLP haplotype analysis of the defective genes indicated that 51 single base pair substitutions were definitely of independent origin and 27 of these were CpG----TpG or CpA transitions. This represents a 38-fold excess over other single-base changes. Most of such transitions (82%) occur at 9 CpG sites occupying critical positions (transitions at 3 sites substitute essential arginines, while at 6 sites transition to TpG creates stop codons). Sixteen of the 18 possible transitions at these 9 sites cause clear haemophilia B and should be fully ascertained in our haemophilia B population. This allowed the direct estimate of the rate of CpG transitions. This is 1.05 x 10(-7) substitutions per base per gamete per generation. The marked excess of CpG transitions in haemophilia B appears partly due to the high proportion of CpG sites at critical positions (at least 9 out of 20). We propose that this follows from the fact that male hemizygosity makes X-linked genes particularly susceptible to selective forces that tend to fix CpG sites arising at critical positions.  相似文献   

16.
The nucleotide sequences of closely related members of a gene family can be used to investigate spontaneous mutations. Here we analyse the sequences of different yeast invertase genes which are more than 93% identical in the coding region and share some very similar, but not identical sequences in the noncoding flanking regions. Since all except one of the invertase genes are active, most of the base substitutions are silent. Within the coding region the base substitutions are unevenly distributed, indicating that parts of the genes were homogenized, probably via gene conversion. Transitions occurred more frequently than transversions in both, coding and noncoding regions. In the coding region pyrimidine transitions were the most abundant event due to silent changes mainly in the third codon position. In the noncoding region pyrimidine and purine transitions were found at equal frequencies. Transversions inverting base pairs (A-T and G-C) outnumber transversions changing base pairs (A-C and G-T). While the spectrum of mutations in the coding region is influenced by selective pressure to maintain the amino acid sequence, the spectrum in the noncoding region may be much less affected by selective pressure.  相似文献   

17.
18.
It is understood that DNA and amino acid substitution rates are highly sequence context-dependent, e.g., C --> T substitutions in vertebrates may occur much more frequently at CpG sites and that cysteine substitution rates may depend on support of the context for participation in a disulfide bond. Furthermore, many applications rely on quantitative models of nucleotide or amino acid substitution, including phylogenetic inference and identification of amino acid sequence positions involved in functional specificity. We describe quantification of the context dependence of nucleotide substitution rates using baboon, chimpanzee, and human genomic sequence data generated by the NISC Comparative Sequencing Program. Relative mutation rates are reported for the 96 classes of mutations of the form 5' alphabetagamma 3' --> 5' alphadeltagamma 3', where alpha, beta, gamma, and delta are nucleotides and beta not equal delta, based on maximum likelihood calculations. Our results confirm that C --> T substitutions are enhanced at CpG sites compared with other transitions, relatively independent of the identity of the preceding nucleotide. While, as expected, transitions generally occur more frequently than transversions, we find that the most frequent transversions involve the C at CpG sites (CpG transversions) and that their rate is comparable to the rate of transitions at non-CpG sites. A four-class model of the rates of context-dependent evolution of primate DNA sequences, CpG transitions > non-CpG transitions approximately CpG transversions > non-CpG transversions, captures qualitative features of the mutation spectrum. We find that despite qualitative similarity of mutation rates among different genomic regions, there are statistically significant differences.  相似文献   

19.
To study the rate and pattern of nucleotide substitution in mitochondrial DNA (mtDNA), we cloned and sequenced a 975-bp segment of mtDNA from Drosophila melanogaster, D. simulans, and D. mauritiana containing the genes for three transfer RNAs and parts of two protein- coding genes, ND2 and COI. Statistical analysis of synonymous substitutions revealed a predominance of transitions over transversions among the three species, a finding differing from previous results obtained from a comparison of D. melanogaster and D. yakuba. The number of transitions observed was nearly the same for each species comparison, including D. yakuba, despite the differences in divergence times. However, transversions seemed to increase steadily with increasing divergence time. By contrast, nonsynonymous substitutions in the ND2 gene showed a predominance of transversions over transitions. Most transversions were between A and T and seemed to be due to some kind of mutational bias to which the A + T-rich mtDNA of Drosophila species may be subject. The overall rate of nucleotide substitution in Drosophila mtDNA appears to be slightly faster (approximately 1.4 times) than that of the Adh gene. This contrasts with the result obtained for mammals, in which the mtDNA evolves approximately 10 times faster than single-copy nuclear DNA. We have also shown that the start codon of the COI gene is GTGA in D. simulans and GTAA in D. mauritiana. These codons are different from that of D. melanogaster (ATAA).   相似文献   

20.
Directed protein evolution is the most versatile method for studying protein structure-function relationships, and for tailoring a protein's properties to the needs of industrial applications. In this review, we performed a statistical analysis on the genetic code to study the extent and consequence of the organization of the genetic code on amino acid substitution patterns generated in directed evolution experiments. In detail, we analyzed amino acid substitution patterns caused by (a) a single nucleotide (nt) exchange at each position of all 64 codons, and (b) two subsequent nt exchanges (first and second nt, first and third nt, second and third nt). Additionally, transitions and transversions mutations were compared at the level of amino acid substitution patterns. The latter analysis showed that single nucleotide substitution in a codon generates only 39.5% of the natural diversity on the protein level with 5.2-7 amino acid substitutions per codon. Transversions generate more complex amino acid substitution patterns (increased number and chemically more diverse amino acid substitutions) than transitions. Simultaneous nt exchanges at both first and second nt of a codon generates very diverse amino acid substitution patterns, achieving 83.2% of the natural diversity. The statistical analysis described in this review sets the objectives for novel random mutagenesis methods that address the consequences of the organization of the genetic code. Random mutagenesis methods that favor transversions or introduce consecutive nt exchanges can contribute in this regard.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号