共查询到20条相似文献,搜索用时 15 毫秒
1.
Synonymous and nonsynonymous substitutions in mammalian genes and the nearly neutral theory 总被引:10,自引:0,他引:10
Ohta Tomoko 《Journal of molecular evolution》1995,40(1):56-63
The nearly neutral theory of molecular evolution predicts larger generation-time effects for synonymous than for nonsynonymous substitutions. This prediction is tested using the sequences of 49 single-copy genes by calculating the average and variance of synonymous and nonsynonymous substitutions in mammalian star phylogenies (rodentia, artiodactyla, and primates). The average pattern of the 49 genes supports the prediction of the nearly neutral theory, with some notable exceptions.The nearly neutral theory also predicts that the variance of the evolutionary rate is larger than the value predicted by the completely neutral theory. This prediction is tested by examining the dispersion index (ratio of the variance to the mean), which is positively correlated with the average substitution number. After weighting by the lineage effects, this correlation almost disappears for nonsynonymous substitutions, but not quite so for synonymous substitutions. After weighting, the dispersion indices of both synonymous and nonsynonymous substitutions still exceed values expected under the simple Poisson process. The results indicate that both the systematic bias in evolutionary rate among the lineages and the episodic type of rate variation are contributing to the large variance. The former is more significant to synonymous substitutions than to nonsynonymous substitutions. Isochore evolution may be similar to synonymous substitutions. The rate and pattern found here are consistent with the nearly neutral theory, such that the relative contributions of drift and selection differ between the two types of substitutions. The results are also consistent with Gillespie's episodic selection theory. 相似文献
2.
Rates of synonymous and nonsynonymous nucleotide substitutions and codon usage bias (ENC) were estimated for a number of
nuclear and chloroplast genes in a sample of centric and pennate diatoms. The results suggest that DNA evolution has taken
place, on an average, at a slower rate in the chloroplast genes than in the nuclear genes: a rate variation pattern similar
to that observed in land plants. Synonymous substitution rates in the chloroplast genes show a negative association with the
degree of codon usage bias, suggesting that genes with a higher degree of codon usage bias have evolved at a slower rate.
While this relationship has been shown in both prokaryotes and multicellular eukaryotes, it has not been demonstrated before
in diatoms.
Received: 3 June 1998 / Accepted: 11 August 1998 相似文献
3.
Synonymous substitution rates in mitochondrial and nuclear genes of Drosophila were compared. To make accurate comparisons, we considered the following: (1) relative synonymous rates, which do not require
divergence time estimates, should be used; (2) methods estimating divergence should take into account base composition; (3)
only very closely related species should be used to avoid effects of saturation; (4) the heterogeneity of rates should be
examined. We modified the methods estimating synonymous substitution numbers to account for base composition bias. By using
these methods, we found that mitochondrial genes have 1.7–3.4 times higher synonymous substitution rates than the fastest
nuclear genes or 4.5–9.0 times higher rates than the average nuclear genes. The average rate of synonymous transversions was
2.7 (estimated from the melanogaster species subgroup) or 2.9 (estimated from the obscura group) times higher in mitochondrial genes than in nuclear genes. Synonymous transversions in mitochondrial genes occurred
at an approximately equivalent rate to those in the fastest nuclear genes. This last result is not consistent with the hypothesis
that the difference in turnover rates between mitochondrial and nuclear genomes is the major factor determining higher synonymous
substitution rates in mtDNA. We conclude that the difference in synonymous substitution rates is due to a combination of two
factors: a higher transitional mutation rate in mtDNA and constraints on nuclear genes due to selection for codon usage.
Received: 27 November 1996 / Accepted: 8 May 1997 相似文献
4.
Berg OG 《Journal of molecular evolution》1999,48(4):398-407
The synonymous divergence between Escherichia coli and Salmonella typhimurium is explained in a model where there is a large variation between mutation rates at different nucleotide sites in the genome.
The model is based on the experimental observation that spontaneous mutation rates can vary over several orders of magnitude
at different sites in a gene. Such site-specific variation must be taken into account when studying synonymous divergence
and will result in an apparent saturation below the level expected from an assumption of uniform rates. Recently, it has been
suggested that codon preference in enterobacteria has a very large site-specific variation and that the synonymous divergence
between different species, e.g., E. coli and Salmonella, is saturated. In the present communication it is shown that when site-specific variation in mutation rates is introduced,
there is no need to invoke assumptions of saturation and a large variability in codon preference. The same rate variation
will also bring average mutation rates as estimated from synonymous sequence divergence into numerical agreement with experimental
values.
Received: 10 July 1998 / Accepted: 20 August 1998 相似文献
5.
To characterize the coding-sequence divergence of closely related genomes, we compared DNA sequence divergence between sequences
from a Brassica rapa ssp. pekinensis EST library isolated from flower buds and genomic sequences from Arabidopsis thaliana. The specific objectives were (i) to determine the distribution of and relationship between K
a and K
s, (ii) to identify genes with the lowest and highest K
a:K
s values, and (iii) to evaluate how codon usage has diverged between two closely related species. We found that the distribution
of K
a:K
s was unimodal, and that substitution rates were more variable at nonsynonymous than synonymous sites, and detected no evidence
that K
a and K
s were positively correlated. Several genes had K
a:K
s values equal to or near zero, as expected for genes that have evolved under strong selective constraint. In contrast, there
were no genes with K
a:K
s >1 and thus we found no strong evidence that any of the 218 sequences we analyzed have evolved in response to positive selection.
We detected a stronger codon bias but a lower frequency of GC at synonymous sites in A. thaliana than B. rapa. Moreover, there has been a shift in the profile of most commonly used synonymous codons since these two species diverged
from one another. This shift in codon usage may have been caused by stronger selection acting on codon usage or by a shift
in the direction of mutational bias in the B. rapa phylogenetic lineage. 相似文献
6.
de Miranda AB Alvarez-Valin F Jabbari K Degrave WM Bernardi G 《Journal of molecular evolution》2000,50(1):45-55
Mycobacterium tuberculosis and Mycobacterium leprae are the ethiological agents of tuberculosis and leprosy, respectively. After performing extensive comparisons between genes
from these two GC-rich bacterial species, we were able to construct a set of 275 homologous genes. Since these two bacterial
species also have a very low growth rate, translational selection could not be so determinant in their codon preferences as
it is in other fast-growing bacteria. Indeed, principal-components analysis of codon usage from this set of homologous genes
revealed that the codon choices in M. tuberculosis and M. leprae are correlated not only with compositional constraints and translational selection, but also with the degree of amino acid
conservation and the hydrophobicity of the encoded proteins. Finally, significant correlations were found between GC3 and synonymous distances as well as between synonymous and nonsynonymous distances.
Received: 30 October 1998 / Accepted: 16 August 1999 相似文献
7.
An Evaluation of Measures of Synonymous Codon Usage Bias 总被引:14,自引:0,他引:14
Synonymous codons are not generally used at equal frequencies, and this trend is observed for most genes and organisms. Several
methods have been proposed and used to estimate the degree of the nonrandom use of the different synonymous codons. The estimates
obtained by these methods, however, show different levels of both precision and dispersion when coding regions of a finite
number of codons are under analysis. Here, we present a study, based on computer simulation, of how the different methods
proposed to evaluate the nonrandom use of synonymous codons are affected by the length of the coding region analyzed. The
results show that some of these methods are heavily influenced by the number of codons and that the comparison of codon usage
bias between coding regions of different lengths shows a methodological bias under different conditions of nonrandom use of
synonymous codons. The study of the dispersion of the estimates obtained by the different methods gives, on the other hand,
an indication of the methods to be applied to compare values of codon usage bias among coding regions of equivalent length.
Received: 10 September 1997 / Accepted: 23 March 1998 相似文献
8.
Kazuhisa Tsunoyama Matthew I. Bellgard Takashi Gojobori 《Journal of molecular evolution》2001,53(4-5):456-464
It has been observed that synonymous substitution rates vary among genes in various organisms, although the cause of the
variation is unresolved. At the intragenic level, however, the variation of synonymous substitutions is somewhat controversial.
By developing a rigorous statistical test and applying the test to 418 homologous gene pairs between mouse and rat, we found
that more than 90% of gene pairs showed a statistical significance in intragenic variation of synonymous substitution rates.
Moreover, by examining all conceivable possibilities for the cause of the variation, we successfully found that intragenic
variation of synonymous substitutions in mammalian genes is caused mainly by a nonrandom mutation due to the methylation of
CpG dinucleotides rather than by functional constraints.
Received: 12 January 2001 / Accepted: 28 February 2001 相似文献
9.
Fernando Alvarez-Valin Kamel Jabbari Giorgio Bernardi 《Journal of molecular evolution》1998,46(1):37-44
Previous investigations indicated that synonymous and nonsynonymous substitution rates are correlated in mammalian genes.
In the present work, this correlation has been studied at the intragenic level using a dataset of 48 orthologous genes from
species belonging to at least four different mammalian orders. The results obtained show that the intragenic variability in
synonymous rates is correlated with that of nonsynonymous rates. Moreover, the variation in GC level (and especially of C
level) of silent positions along each gene is correlated with the variation in synonymous rate. These results reinforce the
previous conclusions that synonymous and nonsynonymous rates as well as GC levels of silent positions are to some extent under
common selective constraints.
Received: 10 July 1997 / Accepted: 13 August 1997 相似文献
10.
Richard M. Kliman 《Journal of molecular evolution》1999,49(3):343-351
Evidence from a variety of sources indicates that selection has influenced synonymous codon usage in Drosophila. It has generally been difficult, however, to distinguish selection that acted in the distant past from ongoing selection.
However, under a neutral model, polymorphisms usually reflect more recent mutations than fixed differences between species
and may, therefore, be useful for inferring recent selection. If the ancestral state is preferred, selection should shift
the frequency distribution of derived states/site toward lower values; if the ancestral is unpreferred, selection should increase
the number of derived states/site. Polymorphisms were classified as ancestrally preferred or unpreferred for several genes
of D. simulans and D. melanogaster. A computer simulation of coalescence was employed to derive the expected frequency distributions of derived states/site under
various modifications of the Wright–Fisher neutral model, and distributions of test statistics (t and Mann–Whitney U) were derived by appropriate sampling. One-tailed tests were applied to transformed frequency data to assess whether the
two frequency distributions deviated from neutral expectations in the direction predicted by selection on codon usage. Several
genes from D. simulans appear to be subject to recent selection on synonymous codons, including one gene with low codon bias, esterase-6. Selection may also be acting in D. melanogaster.
Received: 15 April 1998 / Accepted: 13 May 1999 相似文献
11.
In many unicellular organisms, invertebrates, and plants, synonymous codon usage biases result from a coadaptation between
codon usage and tRNAs abundance to optimize the efficiency of protein synthesis. However, it remains unclear whether natural
selection acts at the level of the speed or the accuracy of mRNAs translation. Here we show that codon usage can improve the
fidelity of protein synthesis in multicellular species. As predicted by the model of selection for translational accuracy,
we find that the frequency of codons optimal for translation is significantly higher at codons encoding for conserved amino
acids than at codons encoding for nonconserved amino acids in 548 genes compared between Caenorhabditis elegans and Homo sapiens. Although this model predicts that codon bias correlates positively with gene length, a negative correlation between codon
bias and gene length has been observed in eukaryotes. This suggests that selection for fidelity of protein synthesis is not
the main factor responsible for codon biases. The relationship between codon bias and gene length remains unexplained. Exploring
the differences in gene expression process in eukaryotes and prokaryotes should provide new insights to understand this key
question of codon usage.
Received: 18 June 2000 / Accepted: 10 November 2000 相似文献
12.
A survey of the patterns of synonymous codon preference in the HIV env gene reveals a correlation between the codon bias and the mutability requirements of different regions of the protein. At
hypervariable regions in gp120 one finds a greater proportion of codons that tend to mutate nonsynonymously, but to a target
that is similar in hydrophobicity and volume. We argue that this strategy results from a compromise between the selective
pressure placed on the virus by the induced immune response, which favors amino acid substitutions in the complementarity
determining regions, and the negative selection against missense mutations that violate structural constraints of the env protein.
Received: 9 June 1997 / Accepted: 25 May 1998 相似文献
13.
Pavesi A 《Journal of molecular evolution》2000,50(3):284-295
In viruses an increased coding ability is provided by overlapping genes, in which two alternative open reading frames (ORFs)
may be translated to yield two distinct proteins. The identification of signature sequences in overlapping genes is a topic
of particular interest, since additional out-of-frame coding regions can be nested within known genes. In this work, a novel
feature peculiar to overlapping coding regions is presented. It was detected by analysis of a sample set of 21 virus genomic
sequences and consisted in the repeated occurrence of a cluster of basic amino acid residues, encoded by a frame, combined
to a stretch of acidic residues, encoded by the corresponding overlapping frame. A computer scan of an additional set of virus
sequences demonstrated that this feature is common to several other known overlapping ORFs and led to prediction of a novel
overlapping gene in hepatitis G virus (HGV). The occurrence of a bifunctional coding region in HGV was also supported by its
extremely lower rate of synonymous nucleotide substitutions compared to that observed in the other gene regions of the HGV
genome. Analysis of the amino acid sequence that was deduced from the putative overlapping gene revealed a high content of
basic residues and the presence of a nuclear targeting signal; these characteristics suggest that a core-like protein may
be expressed by this novel ORF.
Received: 21 July 1999 / Accepted: 26 October 1999 相似文献
14.
Agarwal SM 《Biochemical and biophysical research communications》2005,337(4):1192-1197
The present study examines 783 human-mouse orthologous gene pairs for their pattern of sequence evolution, contrasting mammalia, eukaryota, coelomata, and bilateria specific human intronless genes. Such comparisons may be of use in understanding the general evolution of human genome. Evolutionary rate analyses indicate that mammalia specific human intronless genes are evolving faster as compared to other intronless genes specific to eukaryotic lineage, indicating towards their rapid evolution. The observations indicates that the genes conserved in eukaryota, coelomata, and bilateria, that is, proteins that arose earlier in evolution as compared to mammalia specific genes evolve slowly and are subjected to negative selection. The cause underlying rate variations was also explored. Although mutational bias might slightly fasten the nonsynonymous rates in mammalia specific genes, it is unlikely to be major cause of rate difference between the various categories. Furthermore, rate of divergence of mammalia specific intronless genes has been related to functional classification using the protein family annotation. Protein function was found in some cases to have larger impact on the rate of evolution of genes. Also, the codon usage pattern of mammalia specific intronless genes do not seem to differ much from those of other intronless genes conserved solely in eukaryotic lineage. 相似文献
15.
Fernando Alvarez-Valin Kamel Jabbari Nicolas Carels Giorgio Bernardi 《Journal of molecular evolution》1999,49(3):330-342
In this work, we have investigated the relationships between synonymous and nonsynonymous rates and base composition in coding
sequences from Gramineae to analyze the factors underlying the variation in substitutional rates. We have shown that in these genes the rates of nucleotide
divergence, both synonymous and nonsynonymous, are, to some extent, dependent on each other and on the base composition. In
the first place, the variation in nonsynonymous rate is related to the GC level at the second codon position (the higher the
GC2 level, the higher the amino acid replacement rate). The correlation is especially strong with T2, the coefficients being significant in the three data sets analyzed. This correlation between nonsynonymous rate and base
composition at the second codon position is also detectable at the intragenic level, which implies that the factors that tend
to increase the intergenic variance in nonsynonymous rates also affect the intragenic variance. On the other hand, we have
shown that the synonymous rate is strongly correlated with the GC3 level. This correlation is observed both across genes and at the intragenic level. Similarly, the nonsynonymous rate is also
affected at the intragenic level by GC3 level, like the silent rate. In fact, synonymous and nonsynonymous rates exhibit a parallel behavior in relation to GC3 level, indicating that the intragenic patterns of both silent and amino acid divergence rates are influenced in a similar
way by the intragenic variation of GC3. This result, taken together with the fact that the number of genes displaying intragenic correlation coefficients between
synonymous and nonsynonymous rates is not very high, but higher than random expectation (in the three data sets analyzed),
strongly suggests that the processes of silent and amino acid replacement divergence are, at least in part, driven by common
evolutionary forces in genes from Gramineae.
Received: 2 July 1998 / Accepted: 18 April 1999 相似文献
16.
Maximum Likelihood Estimation on Large Phylogenies and Analysis of Adaptive Evolution in Human Influenza Virus A 总被引:1,自引:0,他引:1
Yang Z 《Journal of molecular evolution》2000,51(5):423-432
Algorithmic details to obtain maximum likelihood estimates of parameters on a large phylogeny are discussed. On a large tree,
an efficient approach is to optimize branch lengths one at a time while updating parameters in the substitution model simultaneously.
Codon substitution models that allow for variable nonsynonymous/synonymous rate ratios (ω=d
N/d
S) among sites are used to analyze a data set of human influenza virus type A hemagglutinin (HA) genes. The data set has 349
sequences. Methods for obtaining approximate estimates of branch lengths for codon models are explored, and the estimates
are used to test for positive selection and to identify sites under selection. Compared with results obtained from the exact
method estimating all parameters by maximum likelihood, the approximate methods produced reliable results. The analysis identified
a number of sites in the viral gene under diversifying Darwinian selection and demonstrated the importance of including many
sequences in the data in detecting positive selection at individual sites.
Received: 25 April 2000 / Accepted: 24 July 2000 相似文献
17.
A+T content, phylogenetic relationships, codon usage, evolutionary rates, and ratio of synonymous versus non-synonymous substitutions
have been studied in partial sequences of the atpD and aroQ/pheA genes of primary (Buchnera) and secondary symbionts of aphids and a set of selected non-symbiotic bacteria, belonging to the five subdivisions of the
Proteobacteria. Compared to the homologous genes of the last group, both genes belonging to Buchnera behave in a similar way, showing a higher A+T content, forming a monophyletic group, a loss in codon bias, especially in
third base position, an evolutionary acceleration and an increase in the number of non-synonymous substitutions, confirming
previous results reported elsewhere for other genes. When available, these properties have been partly observed with the secondary
symbionts, but with values that are intermediate between Buchnera and free living Proteobacteria. They show high A+T content, but not as high as Buchnera, a non-solved phylogenetic position between Buchnera, and the other γ-Proteobacteria, a loss in codon bias, again not as high as in Buchnera and a significant evolutionary acceleration in the case of the three atpD genes, but not when considering aroQ/pheA genes. These results give support to the hypothesis that they are symbionts at different stages of the symbiotic accommodation
to the host. 相似文献
18.
Pavesi A 《Journal of molecular evolution》2001,53(2):104-113
The GB virus C/hepatitis G virus (GBV-C/HGV) is a newly identified human RNA virus, belonging to the Flaviviridae family. Persistent infection by GBV-C/HGV is common in humans, and genetically divergent isolates have been identified in
different parts of the world. Due to the absence of a real pathogenic role of GBV-C/HGV in liver disease and its extremely
low mutation rate, this virus is a potential marker to trace prehistoric links between human populations. In this study, origin
and evolution of GBV-C/HGV were examined using a set of fully sequenced strains of worldwide origin. A first phylogenetic
analysis, addressed to the short (255 nucleotides) NS5A overlapping coding region by the neighbor-joining method, suggested
an ancient African origin of GBV-C/HGV. This notion was confirmed when the same analysis was applied to the genomic regions
showing the lowest rate of synonymous substitutions, covering one-fourth (2184 nucleotides) of the total coding potential
of the virus genome. By using a multivariate statistical method and extending the analysis to the complete coding region,
fine details of the evolutionary history of GBV-C/HGV were further elucidated. By this approach, isolates from Southeast Asia
appeared to be the most closely related to those of African origin, consistent with a major route of ancient human migrations
from Africa to southeastern parts of the Asian continent.
Received: 26 October 2000 / Accepted: 28 February 2001 相似文献
19.
Yasuo Ina 《Journal of molecular evolution》1998,46(5):521-533
A simple method for estimating the transition/transversion ratio was developed. This method can be applied to not only two
sequences but also more than two sequences. The statistical properties of the method and some other methods were examined
by numerical computation and computer simulation. The results obtained showed that, in terms of bias and variance, the new
method gives a better estimate of the transition/transversion ratio than do the other examined methods. The new method was
applied to human and chimpanzee mitochondrial control region sequences.
Received: 22 September 1997 / Accepted: 1 November 1997 相似文献
20.
We consider a model of the origin of genetic code organization incorporating the biosynthetic relationships between amino acids and their physicochemical properties. We study the behavior of the genetic code in the set of codes subject both to biosynthetic constraints and to the constraint that the biosynthetic classes of amino acids must occupy only their own codon domain, as observed in the genetic code. Therefore, this set contains the smallest number of elements ever analyzed in similar studies. Under these conditions and if, as predicted by physicochemical postulates, the amino acid properties played a fundamental role in genetic code organization, it can be expected that the code must display an extremely high level of optimization. This prediction is not supported by our analysis, which indicates, for instance, a minimization percentage of only 80%. These observations can therefore be more easily explained by the coevolution theory of genetic code origin, which postulates a role that is important but not fundamental for the amino acid properties in the structuring of the code. We have also investigated the shape of the optimization landscape that might have arisen during genetic code origin. Here, too, the results seem to favor the coevolution theory because, for instance, the fact that only a few amino acid exchanges would have been sufficient to transform the genetic code (which is not a local minimum) into a much better optimized code, and that such exchanges did not actually take place, seems to suggest that, for instance, the reduction of translation errors was not the main adaptive theme structuring the genetic code. 相似文献