首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
R Nielsen  D M Weinreich 《Genetics》1999,153(1):497-506
McDonald/Kreitman tests performed on animal mtDNA consistently reveal significant deviations from strict neutrality in the direction of an excess number of polymorphic nonsynonymous sites, which is consistent with purifying selection acting on nonsynonymous sites. We show that under models of recurrent neutral and deleterious mutations, the mean age of segregating neutral mutations is greater than the mean age of segregating selected mutations, even in the absence of recombination. We develop a test of the hypothesis that the mean age of segregating synonymous mutations equals the mean age of segregating nonsynonymous mutations in a sample of DNA sequences. The power of this age-of-mutation test and the power of the McDonald/Kreitman test are explored by computer simulations. We apply the new test to 25 previously published mitochondrial data sets and find weak evidence for selection against nonsynonymous mutations.  相似文献   

2.
There is now a wealth of evidence that some of the most important regions of the genome are found outside those that encode proteins, and noncoding regions of the genome have been shown to be subject to substantial levels of selective constraint, particularly in Drosophila. Recent work has suggested that these regions may also have been subject to the action of positive selection, with large fractions of noncoding divergence having been driven to fixation by adaptive evolution. However, this work has focused on Drosophila melanogaster, which is thought to have experienced a reduction in effective population size (N(e)), and thus a reduction in the efficacy of selection, compared with its closest relative Drosophila simulans. Here, we examine patterns of evolution at several classes of noncoding DNA in D. simulans and find that all noncoding DNA is subject to the action of negative selection, indicated by reduced levels of polymorphism and divergence and a skew in the frequency spectrum toward rare variants. We find that the signature of negative selection on noncoding DNA and nonsynonymous sites is obscured to some extent by purifying selection acting on preferred to unpreferred synonymous codon mutations. We investigate the extent to which divergence in noncoding DNA is inferred to be the product of positive selection and to what extent these inferences depend on selection on synonymous sites and demography. Based on patterns of polymorphism and divergence for different classes of synonymous substitution, we find the divergence excess inferred in noncoding DNA and nonsynonymous sites in the D. simulans lineage difficult to reconcile with demographic explanations.  相似文献   

3.
Approximate methods for estimating the numbers of synonymous and nonsynonymous substitutions between two DNA sequences involve three steps: counting of synonymous and nonsynonymous sites in the two sequences, counting of synonymous and nonsynonymous differences between the two sequences, and correcting for multiple substitutions at the same site. We examine complexities involved in those steps and propose a new approximate method that takes into account two major features of DNA sequence evolution: transition/transversion rate bias and base/codon frequency bias. We compare the new method with maximum likelihood, as well as several other approximate methods, by examining infinitely long sequences, performing computer simulations, and analyzing a real data set. The results suggest that when there are transition/transversion rate biases and base/codon frequency biases, previously described approximate methods for estimating the nonsynonymous/synonymous rate ratio may involve serious biases, and the bias can be both positive and negative. The new method is, in general, superior to earlier approximate methods and may be useful for analyzing large data sets, although maximum likelihood appears to always be the method of choice.  相似文献   

4.
Within-patient HIV populations evolve rapidly because of a high mutation rate, short generation time, and strong positive selection pressures. Previous studies have identified "consistent patterns" of viral sequence evolution. Just before HIV infection progresses to AIDS, evolution seems to slow markedly, and the genetic diversity of the viral population drops. This evolutionary slowdown could be caused either by a reduction in the average viral replication rate or because selection pressures weaken with the collapse of the immune system. The former hypothesis (which we denote "cellular exhaustion") predicts a simultaneous reduction in both synonymous and nonsynonymous evolution, whereas the latter hypothesis (denoted "immune relaxation") predicts that only nonsynonymous evolution will slow. In this paper, we present a set of statistical procedures for distinguishing between these alternative hypotheses using DNA sequences sampled over the course of infection. The first component is a new method for estimating evolutionary rates that takes advantage of the temporal information in longitudinal DNA sequence samples. Second, we develop a set of probability models for the analysis of evolutionary rates in HIV populations in vivo. Application of these models to both synonymous and nonsynonymous evolution affords a comparison of the cellular-exhaustion and immune-relaxation hypotheses. We apply the procedures to longitudinal data sets in which sequences of the env gene were sampled over the entire course of infection. Our analyses (1) statistically confirm that an evolutionary slowdown occurs late in infection, (2) strongly support the immune-relaxation hypothesis, and (3) indicate that the cessation of nonsynonymous evolution is associated with disease progression.  相似文献   

5.
We examined rates of DNA sequence evolution in 12 populations of Escherichia coli propagated in a glucose minimal medium for 20,000 generations. Previous work saw mutations mediated by mobile elements in these populations, but the extent of other genomic changes was not investigated. Four of the populations evolved defects in DNA repair and became mutators. Some 500 bp was sequenced in each of 36 genes for 50 clones, including 2 ancestral variants, 2 clones from each population at generation 10,000, and 2 from each at generation 20,000. Ten mutations were found in total, all point mutations including mostly synonymous substitutions and nonsynonymous polymorphisms; all 10 were found in mutator populations. We compared the observed sequence evolution to predictions based on different scenarios. The number of synonymous substitutions is lower than predicted from measured mutation rates in E. coli, but the number is higher than rates based on comparing E. coli and Salmonella genomes. Extrapolating to the entire genome, these data predict about 250 synonymous substitutions on average per mutator population, but only about 3 synonymous substitutions per nonmutator population, during 20,000 generations. These data illustrate the challenge of finding sequence variation among bacterial isolates that share such a recent ancestor. However, this limited variation also provides a useful baseline for research aimed at finding the beneficial substitutions in these populations.  相似文献   

6.
Yang Z  Nielsen R  Goldman N  Pedersen AM 《Genetics》2000,155(1):431-449
Comparison of relative fixation rates of synonymous (silent) and nonsynonymous (amino acid-altering) mutations provides a means for understanding the mechanisms of molecular sequence evolution. The nonsynonymous/synonymous rate ratio (omega = d(N)d(S)) is an important indicator of selective pressure at the protein level, with omega = 1 meaning neutral mutations, omega < 1 purifying selection, and omega > 1 diversifying positive selection. Amino acid sites in a protein are expected to be under different selective pressures and have different underlying omega ratios. We develop models that account for heterogeneous omega ratios among amino acid sites and apply them to phylogenetic analyses of protein-coding DNA sequences. These models are useful for testing for adaptive molecular evolution and identifying amino acid sites under diversifying selection. Ten data sets of genes from nuclear, mitochondrial, and viral genomes are analyzed to estimate the distributions of omega among sites. In all data sets analyzed, the selective pressure indicated by the omega ratio is found to be highly heterogeneous among sites. Previously unsuspected Darwinian selection is detected in several genes in which the average omega ratio across sites is <1, but in which some sites are clearly under diversifying selection with omega > 1. Genes undergoing positive selection include the beta-globin gene from vertebrates, mitochondrial protein-coding genes from hominoids, the hemagglutinin (HA) gene from human influenza virus A, and HIV-1 env, vif, and pol genes. Tests for the presence of positively selected sites and their subsequent identification appear quite robust to the specific distributional form assumed for omega and can be achieved using any of several models we implement. However, we encountered difficulties in estimating the precise distribution of omega among sites from real data sets.  相似文献   

7.
The rate of molecular evolution can vary among lineages. Sources of this variation have differential effects on synonymous and nonsynonymous substitution rates. Changes in effective population size or patterns of natural selection will mainly alter nonsynonymous substitution rates. Changes in generation length or mutation rates are likely to have an impact on both synonymous and nonsynonymous substitution rates. By comparing changes in synonymous and nonsynonymous rates, the relative contributions of the driving forces of evolution can be better characterized. Here, we introduce a procedure for estimating the chronological rates of synonymous and nonsynonymous substitutions on the branches of an evolutionary tree. Because the widely used ratio of nonsynonymous and synonymous rates is not designed to detect simultaneous increases or simultaneous decreases in synonymous and nonsynonymous rates, the estimation of these rates rather than their ratio can improve characterization of the evolutionary process. With our Bayesian approach, we analyze cytochrome oxidase subunit I evolution in primates and infer that nonsynonymous rates have a greater tendency to change over time than do synonymous rates. Our analysis of these data also suggests that rates have been positively correlated.  相似文献   

8.
C A Wise  M Sraml  S Easteal 《Genetics》1998,148(1):409-421
To test whether patterns of mitochondrial DNA (mtDNA) variation are consistent with a neutral model of molecular evolution, nucleotide sequences were determined for the 1041 bp of the NADH dehydrogenase subunit 2 (ND2) gene in 20 geographically diverse humans and 20 common chimpanzees. Contingency tests of neutrality were performed using four mutational categories for the ND2 molecule: synonymous and nonsynonymous mutations in the transmembrane regions, and synonymous and nonsynonymous mutations in the surface regions. The following three topological mutational categories were also used: intraspecific tips, intraspecific interiors, and interspecific fixed differences. The analyses reveal a significantly greater number of nonsynonymous polymorphisms within human transmembrane regions than expected based on interspecific comparisons, and they are inconsistent with a neutral equilibrium model. This pattern of excess nonsynonymous polymorphism is not seen within chimpanzees. Statistical tests of neutrality, such as TAJIMA''s D test, and the D and F tests proposed by FU and LI, indicate an excess of low frequency polymorphisms in the human data, but not in the chimpanzee data. This is consistent with recent directional selection, a population bottleneck or background selection of slightly deleterious mutations in human mtDNA samples. The analyses further support the idea that mitochondrial genome evolution is governed by selective forces that have the potential to affect its use as a "neutral" marker in evolutionary and population genetic studies.  相似文献   

9.
The use of codon substitution models to compare synonymous and nonsynonymous substitution rates is a widely used approach to detecting positive Darwinian selection affecting protein evolution. However, in several recent papers, Hughes and colleagues claim that codon-based likelihood-ratio tests (LRTs) are logically flawed as they lack prior hypotheses and fail to accommodate random fluctuations in synonymous and nonsynonymous substitutions Friedman and Hughes (2007) also used site-based LRTs to analyze 605 gene families consisting of human and mouse paralogues. They found that the outcome of the tests was largely determined by irrelevant factors such as the GC content at the third codon positions and the synonymous rate d(S), but not by the nonsynonymous rate d(N) or the d(N)/d(S) ratio, factors that should be related to selection. Here, we reanalyze those data. Contra Friedman and Hughes, we found that the test results are related to sequence length and the average d(N)/d(S) ratio. We examine the criticisms of Hughes and suggest that they are based on misunderstandings of the codon models and on statistical errors. Our analyses suggest that codon-based tests are useful tools for comparative analysis of genomic data sets.  相似文献   

10.
The role of selection in the evolution of human mitochondrial genomes   总被引:27,自引:0,他引:27  
High mutation rate in mammalian mitochondrial DNA generates a highly divergent pool of alleles even within species that have dispersed and expanded in size recently. Phylogenetic analysis of 277 human mitochondrial genomes revealed a significant (P < 0.01) excess of rRNA and nonsynonymous base substitutions among hotspots of recurrent mutation. Most hotspots involved transitions from guanine to adenine that, with thymine-to-cytosine transitions, illustrate the asymmetric bias in codon usage at synonymous sites on the heavy-strand DNA. The mitochondrion-encoded tRNAThr varied significantly more than any other tRNA gene. Threonine and valine codons were involved in 259 of the 414 amino acid replacements observed. The ratio of nonsynonymous changes from and to threonine and valine differed significantly (P = 0.003) between populations with neutral (22/58) and populations with significantly negative Tajima's D values (70/76), independent of their geographic location. In contrast to a recent suggestion that the excess of nonsilent mutations is characteristic of Arctic populations, implying their role in cold adaptation, we demonstrate that the surplus of nonsynonymous mutations is a general feature of the young branches of the phylogenetic tree, affecting also those that are found only in Africa. We introduce a new calibration method of the mutation rate of synonymous transitions to estimate the coalescent times of mtDNA haplogroups.  相似文献   

11.
We estimated the intensity of selection on preferred codons in Drosophila pseudoobscura and D. miranda at X-linked and autosomal loci, using a published data set on sequence variability at 67 loci, by means of an improved method that takes account of demographic effects. We found evidence for stronger selection at X-linked loci, consistent with their higher levels of codon usage bias. The estimates of the strength of selection and mutational bias in favor of unpreferred codons were similar to those found in other species, after taking into account the fact that D. pseudoobscura showed evidence for a recent expansion in population size. We examined correlates of synonymous and nonsynonymous diversity in these species and found no evidence for effects of recurrent selective sweeps on nonsynonymous mutations, which is probably because this set of genes have much higher than average levels of selective constraints. There was evidence for correlated effects of levels of selective constraints on protein sequences and on codon usage, as expected under models of selection for translational accuracy. Our analysis of a published data set on D. melanogaster provided evidence for the effects of selective sweeps of nonsynonymous mutations on linked synonymous diversity, but only in the subset of loci that experienced the highest rates of nonsynonymous substitutions (about one-quarter of the total) and not at more slowly evolving loci. Our correlational analysis of this data set suggested that both selective constraints on protein sequences and recurrent selective sweeps affect the overall level of codon usage.  相似文献   

12.
We consider three approaches for estimating the rates of nonsynonymous and synonymous changes at each site in a sequence alignment in order to identify sites under positive or negative selection: (1) a suite of fast likelihood-based "counting methods" that employ either a single most likely ancestral reconstruction, weighting across all possible ancestral reconstructions, or sampling from ancestral reconstructions; (2) a random effects likelihood (REL) approach, which models variation in nonsynonymous and synonymous rates across sites according to a predefined distribution, with the selection pressure at an individual site inferred using an empirical Bayes approach; and (3) a fixed effects likelihood (FEL) method that directly estimates nonsynonymous and synonymous substitution rates at each site. All three methods incorporate flexible models of nucleotide substitution bias and variation in both nonsynonymous and synonymous substitution rates across sites, facilitating the comparison between the methods. We demonstrate that the results obtained using these approaches show broad agreement in levels of Type I and Type II error and in estimates of substitution rates. Counting methods are well suited for large alignments, for which there is high power to detect positive and negative selection, but appear to underestimate the substitution rate. A REL approach, which is more computationally intensive than counting methods, has higher power than counting methods to detect selection in data sets of intermediate size but may suffer from higher rates of false positives for small data sets. A FEL approach appears to capture the pattern of rate variation better than counting methods or random effects models, does not suffer from as many false positives as random effects models for data sets comprising few sequences, and can be efficiently parallelized. Our results suggest that previously reported differences between results obtained by counting methods and random effects models arise due to a combination of the conservative nature of counting-based methods, the failure of current random effects models to allow for variation in synonymous substitution rates, and the naive application of random effects models to extremely sparse data sets. We demonstrate our methods on sequence data from the human immunodeficiency virus type 1 env and pol genes and simulated alignments.  相似文献   

13.
We develop a new model for studying the molecular evolution of protein-coding DNA sequences. In contrast to existing models, we incorporate the potential for site-to-site heterogeneity of both synonymous and nonsynonymous substitution rates. We demonstrate that within-gene heterogeneity of synonymous substitution rates appears to be common. Using the new family of models, we investigate the utility of a variety of new statistical inference procedures, and we pay particular attention to issues surrounding the detection of sites undergoing positive selection. We discuss how failure to model synonymous rate variation in the model can lead to misidentification of sites as positively selected.  相似文献   

14.
We present an approach for identifying genes under natural selection using polymorphism and divergence data from synonymous and non-synonymous sites within genes. A generalized linear mixed model is used to model the genome-wide variability among categories of mutations and estimate its functional consequence. We demonstrate how the model''s estimated fixed and random effects can be used to identify genes under selection. The parameter estimates from our generalized linear model can be transformed to yield population genetic parameter estimates for quantities including the average selection coefficient for new mutations at a locus, the synonymous and non-synynomous mutation rates, and species divergence times. Furthermore, our approach incorporates stochastic variation due to the evolutionary process and can be fit using standard statistical software. The model is fit in both the empirical Bayes and Bayesian settings using the lme4 package in R, and Markov chain Monte Carlo methods in WinBUGS. Using simulated data we compare our method to existing approaches for detecting genes under selection: the McDonald-Kreitman test, and two versions of the Poisson random field based method MKprf. Overall, we find our method universally outperforms existing methods for detecting genes subject to selection using polymorphism and divergence data.  相似文献   

15.
Many organisms exhibit biased codon usage in their genome, including the fungal model organism Neurospora crassa. The preferential use of subset of synonymous codons (optimal codons) at the macroevolutionary level is believed to result from a history of selection to promote translational efficiency. At present, few data are available about selection on optimal codons at the microevolutionary scale, that is, at the population level. Herein, we conducted a large-scale assessment of codon mutations at biallelic sites, spanning more than 5,100 genes, in 2 distinct populations of N. crassa: the Caribbean and Louisiana populations. Based on analysis of the frequency spectra of synonymous codon mutations at biallelic sites, we found that derived (nonancestral) optimal codon mutations segregate at a higher frequency than derived nonoptimal codon mutations in each population; this is consistent with natural selection favoring optimal codons. We also report that optimal codon variants were less frequent in longer genes and that the fixation of optimal codons was reduced in rapidly evolving long genes/proteins, trends suggestive of genetic hitchhiking (Hill-Robertson) altering codon usage variation. Notably, nonsynonymous codon mutations segregated at a lower frequency than synonymous nonoptimal codon mutations (which impair translational efficiency) in each N. crassa population, suggesting that changes in protein composition are more detrimental to fitness than mutations altering translation. Overall, the present data demonstrate that selection, and partly genetic interference, shapes codon variation across the genome in N. crassa populations.  相似文献   

16.
Evolutionary forces like Hill-Robertson interference and negative epistasis can lead to deleterious mutations being found on distinct haplotypes. However, the extent to which these forces depend on the selection and dominance coefficients of deleterious mutations and shape genome-wide patterns of linkage disequilibrium (LD) in natural populations with complex demographic histories has not been tested. In this study, we first used forward-in-time simulations to predict how negative selection impacts LD. Under models where deleterious mutations have additive effects on fitness, deleterious variants less than 10 kb apart tend to be carried on different haplotypes relative to pairs of synonymous SNPs. In contrast, for recessive mutations, there is no consistent ordering of how selection coefficients affect LD decay, due to the complex interplay of different evolutionary effects. We then examined empirical data of modern humans from the 1000 Genomes Project. LD between derived alleles at nonsynonymous SNPs is lower compared to pairs of derived synonymous variants, suggesting that nonsynonymous derived alleles tend to occur on different haplotypes more than synonymous variants. This result holds when controlling for potential confounding factors by matching SNPs for frequency in the sample (allele count), physical distance, magnitude of background selection, and genetic distance between pairs of variants. Lastly, we introduce a new statistic HR(j) which allows us to detect interference using unphased genotypes. Application of this approach to high-coverage human genome sequences confirms our finding that nonsynonymous derived alleles tend to be located on different haplotypes more often than are synonymous derived alleles. Our findings suggest that interference may play a pervasive role in shaping patterns of LD between deleterious variants in the human genome, and consequently influences genome-wide patterns of LD.  相似文献   

17.
In asexual lineages, both synonymous and nonsynonymous sequence polymorphism may be reduced due to severe founder effects when asexual lineages originate. However, mildly deleterious (nonsynonymous) mutations may accumulate after asexual lineages are formed, because the efficiency of purifying selection is reduced even in the nonrecombining mitochondrial genome. Here we examine patterns of synonymous and nonsynonymous mitochondrial sequence polymorphism in asexual and sexual lineages of the freshwater snail Campeloma. Using clade-specific estimates, we found that synonymous sequence polymorphism was significantly reduced by 75% in asexuals relative to sexuals, whereas nonsynonymous sequence polymorphism did not differ significantly between sexuals and asexuals. Two asexual clades had high negative values for Tajima's D statistic. Coalescent simulations confirmed that various bottleneck scenarios can account for this result. We also used branch-specific estimates of the ratio of amino acid to silent substitutions, K(a)/K(s). Our study revealed that K(a)/K(s) ratios are six times higher in terminal branches of independent asexual lineages compared to sexuals. Coalescent-based reconstruction of gene networks for all sexual and asexual clades indicated that nonsynonymous mutations occurred at a higher frequency in recently derived asexual haplotypes. These findings suggest that patterns of synonymous and nonsynonymous nucleotide polymorphism in asexual snail lineages may be shaped by both severe founder effect and relaxed purifying selection.  相似文献   

18.
Elevated rates of nonsynonymous substitution in island birds   总被引:4,自引:0,他引:4  
Slightly deleterious mutations are expected to fix at relatively higher rates in small populations than in large populations. Support for this prediction of the nearly-neutral theory of molecular evolution comes from many cases in which lineages inferred to differ in long-term average population size have different rates of nonsynonymous substitution. However, in most of these cases, the lineages differ in many other ways as well, leaving open the possibility that some factor other than population size might have caused the difference in substitution rates. We compared synonymous and nonsynonymous substitutions in the mitochondrial cyt b and ND2 genes of nine closely related island and mainland lineages of ducks and doves. We assumed that island taxa had smaller average population sizes than those of their mainland sister taxa for most of the time since they were established. In all nine cases, more nonsynonymous substitutions occurred on the island branch, but synonymous substitutions showed no significant bias. As in previous comparisons of this kind, the lineages with smaller populations might differ in other respects that tend to increase rates of nonsynonymous substitution, but here such differences are expected to be slight owing to the relatively recent origins of the island taxa. An examination of changes to apparently "preferred" and "unpreferred" synonymous codons revealed no consistent difference between island and mainland lineages.  相似文献   

19.
Recent analyses have shown that nonsynonymous variation in human mitochondrial DNA (mtDNA) contains nonneutral variants, suggesting the presence of mildly deleterious mutations. Many of the disease-causing mutations in mtDNA occur in the genes encoding the tRNAs. Nucleotide sequence variation in these genes has not been studied in human populations, nor have the structural consequences of nucleotide substitutions in tRNA molecules been examined. We therefore determined the nucleotide sequences of the 22 tRNA genes in the mtDNA of 477 Finns and, also, obtained 435 European sequences from the MitoKor database. No differences in population polymorphism indices were found between the two data sets. We assessed selective constraints against various tRNA domains by comparing allele frequencies between these domains and the synonymous and nonsynonymous sites, respectively. All tRNA domains except the variable loop were more conserved than synonymous sites, and T stem and D stem were more conserved than the respective loops. We also analyzed the energetic consequences of the 96 polymorphisms recovered in the two data sets or in the Mitomap database. The minimum free energy (ΔG) was calculated using the free energy rules as implemented in mfold version 3.1. The ΔG’s were normally distributed among the 22 wild-type tRNA genes, whereas the 96 polymorphic tRNAs departed significantly from a normal distribution. The largest differences in ΔG between the wild-type and the polymorphic tRNAs in the Finnish population tended to be in the polymorphisms that were present at low frequencies. Allele frequency distributions and minimum free energy calculations both suggested that some polymorphisms in tRNA genes are nonneutral.Reviewing Editor: Dr. Rüdiger Cerff  相似文献   

20.
Single nucleotide polymorphisms (SNPs) are believed to contain relevant information and have been therefore extensively used as genetic markers in population and conservation genetics, and molecular ecology studies. This study reports on the identification of potential SNPs in a diploid European sea bass Dicentrarchus labrax genome by using reference sequences from three assembled chromosomes and mapping all WGS datasets onto them (3× Sanger, 3× 454 and 20× SOLEXA). A total of 20,779 SNPs were identified over the 1469 gene loci and intergenic space analysed. Within chromosomes the occurrence of SNPs was the lowest in exons and higher in introns and intergenic regions, which may be explained by the fact, that coding regions are under strong selective pressure to maintain their biological function. The ratio of nonsynonymous to synonymous mutations was smaller than one for all the chromosomes, suggesting that most of deleterious nonsynonymous mutations were eliminated by negative selection. SNPs were not uniformly distributed over the chromosomes. Two chromosomes exhibited large regions with extremely low SNP density, which might represent homozygous regions in the diploid genome. The results of this study show how SNP detection can take profit from sequencing a single diploid individual, but also uncover the limits of such an approach. SNPs that have been identified will support marker development for genetic linkage mapping, population genetics and aquaculture related questions in general.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号