首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Molecular sequences obtained at different sampling times from populations of rapidly evolving pathogens and from ancient subfossil and fossil sources are increasingly available with modern sequencing technology. Here, we present a Bayesian statistical inference approach to the joint estimation of mutation rate and population size that incorporates the uncertainty in the genealogy of such temporally spaced sequences by using Markov chain Monte Carlo (MCMC) integration. The Kingman coalescent model is used to describe the time structure of the ancestral tree. We recover information about the unknown true ancestral coalescent tree, population size, and the overall mutation rate from temporally spaced data, that is, from nucleotide sequences gathered at different times, from different individuals, in an evolving haploid population. We briefly discuss the methodological implications and show what can be inferred, in various practically relevant states of prior knowledge. We develop extensions for exponentially growing population size and joint estimation of substitution model parameters. We illustrate some of the important features of this approach on a genealogy of HIV-1 envelope (env) partial sequences.  相似文献   

2.
Yang Z  Nielsen R  Goldman N  Pedersen AM 《Genetics》2000,155(1):431-449
Comparison of relative fixation rates of synonymous (silent) and nonsynonymous (amino acid-altering) mutations provides a means for understanding the mechanisms of molecular sequence evolution. The nonsynonymous/synonymous rate ratio (omega = d(N)d(S)) is an important indicator of selective pressure at the protein level, with omega = 1 meaning neutral mutations, omega < 1 purifying selection, and omega > 1 diversifying positive selection. Amino acid sites in a protein are expected to be under different selective pressures and have different underlying omega ratios. We develop models that account for heterogeneous omega ratios among amino acid sites and apply them to phylogenetic analyses of protein-coding DNA sequences. These models are useful for testing for adaptive molecular evolution and identifying amino acid sites under diversifying selection. Ten data sets of genes from nuclear, mitochondrial, and viral genomes are analyzed to estimate the distributions of omega among sites. In all data sets analyzed, the selective pressure indicated by the omega ratio is found to be highly heterogeneous among sites. Previously unsuspected Darwinian selection is detected in several genes in which the average omega ratio across sites is <1, but in which some sites are clearly under diversifying selection with omega > 1. Genes undergoing positive selection include the beta-globin gene from vertebrates, mitochondrial protein-coding genes from hominoids, the hemagglutinin (HA) gene from human influenza virus A, and HIV-1 env, vif, and pol genes. Tests for the presence of positively selected sites and their subsequent identification appear quite robust to the specific distributional form assumed for omega and can be achieved using any of several models we implement. However, we encountered difficulties in estimating the precise distribution of omega among sites from real data sets.  相似文献   

3.
Genetic similarities within and between human populations   总被引:2,自引:0,他引:2       下载免费PDF全文
The proportion of human genetic variation due to differences between populations is modest, and individuals from different populations can be genetically more similar than individuals from the same population. Yet sufficient genetic data can permit accurate classification of individuals into populations. Both findings can be obtained from the same data set, using the same number of polymorphic loci. This article explains why. Our analysis focuses on the frequency, omega, with which a pair of random individuals from two different populations is genetically more similar than a pair of individuals randomly selected from any single population. We compare omega to the error rates of several classification methods, using data sets that vary in number of loci, average allele frequency, populations sampled, and polymorphism ascertainment strategy. We demonstrate that classification methods achieve higher discriminatory power than omega because of their use of aggregate properties of populations. The number of loci analyzed is the most critical variable: with 100 polymorphisms, accurate classification is possible, but omega remains sizable, even when using populations as distinct as sub-Saharan Africans and Europeans. Phenotypes controlled by a dozen or fewer loci can therefore be expected to show substantial overlap between human populations. This provides empirical justification for caution when using population labels in biomedical settings, with broad implications for personalized medicine, pharmacogenetics, and the meaning of race.  相似文献   

4.
Intestinal aspects of lipid absorption: in review   总被引:2,自引:0,他引:2  
The rapidly evolving field of lipid absorption is reviewed with the thrust of new knowledge focused on the interpendency of the luminal and cellular phases of absorption. To date little attention has been paid to factors that regulate the phospholipid biosynthesis in the enterocyte. The availability of 20:4 omega 6 may be the rate-limiting factor for phospholipid synthesis. The source of 20:4 omega 6 is unknown, whether it be synthesized de novo the enterocyte or entirely originating from degradation of bile phospholipid. It has been established that dietary fat can modulate the enterocyte membrane lipid composition and transport properties. Specified fats such as as fish oils rich in 20:5 omega 3 and 22:6 omega 3 have been implicated as protective against hypercholesterolemia. However, the effects of these dietary fats on the transport of nutrients across the enterocyte are not yet known, nor are the mechanisms responsible for the adaptive responses of the brush border identified.  相似文献   

5.
The ratio of nonsynonymous (dN) to synonymous (dS) substitution rates, omega, provides a measure of selection at the protein level. Models have been developed that allow omega to vary among lineages. However, these models require the lineages in which differential selection has acted to be specified a priori. We propose a genetic algorithm approach to assign lineages in a phylogeny to a fixed number of different classes of omega, thus allowing variable selection pressure without a priori specification of particular lineages. This approach can identify models with a better fit than a single-ratio model, and with fits that are better than (in an information theoretic sense) a fully local model, in which all lineages are assumed to evolve under different values of omega, but with far fewer parameters. By averaging over models which explain the data reasonably well, we can assess the robustness of our conclusions to uncertainty in model estimation. Our approach can also be used to compare results from models in which branch classes are specified a priori with a wide range of credible models. We illustrate our methods on primate lysozyme sequences and compare them with previous methods applied to the same data sets.  相似文献   

6.
Site-specific selfish genes exploit host functions to copy themselves into a defined target DNA sequence, and include homing endonuclease genes, group II introns and some LINE-like transposable elements. If such genes can be engineered to target new host sequences, then they can be used to manipulate natural populations, even if the number of individuals released is a small fraction of the entire population. For example, a genetic load sufficient to eradicate a population can be imposed in fewer than 20 generations, if the target is an essential host gene, the knockout is recessive and the selfish gene has an appropriate promoter. There will be selection for resistance, but several strategies are available for reducing the likelihood of it evolving. These genes may also be used to genetically engineer natural populations, by means of population-wide gene knockouts, gene replacements and genetic transformations. By targeting sex-linked loci just prior to meiosis one may skew the population sex ratio, and by changing the promoter one may limit the spread of the gene to neighbouring populations. The proposed constructs are evolutionarily stable in the face of the mutations most likely to arise during their spread, and strategies are also available for reversing the manipulations.  相似文献   

7.
8.
Increasing evidence suggests that omega 3 fatty acids derived from fish and fish oils may play a protective role in coronary heart disease and its many complications, through a variety of actions, including effects on lipids, blood pressure, cardiac and vascular function, prostanoids, coagulation and immunological responses. Interesting differences between the effects of highly purified eicosapentaenoic acid and docosahexaenoic acid are emerging, which may be relevant in the choice of omega 3 fatty acid for incorporation into food products. On the basis of our current knowledge, we believe it is justified to recommend, particularly to high-risk populations, an increased dietary intake of omega 3 fatty acids through the consumption of fish.  相似文献   

9.
Summary The statistical properties of sample estimation and bootstrap estimation of phylogenetic variability from a sample of nucleotide sequences were studied by considering model trees of three taxa with an outgroup. The cases of constant and varying rates of nucleotide substitution were compared. From sequences obtained by simulation, phylogenetic trees were constructed by using the maximum parsimony (MP) and neighbor joining (NJ) methods. The effectiveness and consistency of the MP method were studied in terms of proportions of informative sites. The results of simulation showed that bootstrap estimation of the confidence level for an inferred phylogeny can be used even under unequal rates of evolution if the rate differences are not large so that the MP method is not misleading. The condition under which the MP method becomes misleading (inconsistent) is more stringent for slowly evolving sequences than for rapidly evolving ones, and it also depends on the length of the internal branch. If the rate differences are large so that the MP method becomes consistently misleading, then bootstrap estimation will reinforce an erroneous conclusion on topology. Similar conclusions apply to the NJ method with uncorrected distances. The NJ method with corrected distances performs poorly when the sequence length is short but can avoid the inconsistency problem if the sequence length is long and if the distances can be estimated accurately.Offprint requests to: W.-H. Li  相似文献   

10.
Deleterious mutations affecting biological function of proteins are constantly being rejected by purifying selection from the gene pool. The non-synonymous/synonymous substitution rate ratio (omega) is a measure of selective pressure on amino acid replacement mutations for protein-coding genes. Different methods have been developed in order to predict non-synonymous changes affecting gene function. However, none has considered the estimation of selective constraints acting on protein residues. Here, we have used codon-based maximum likelihood models in order to estimate the selective pressures on the individual amino acid residues of a well-known model protein: p53. We demonstrate that the number of residues under strong purifying selection in p53 is much higher than those that are strictly conserved during the evolution of the species. In agreement with theoretical expectations, residues that have been noted to be of structural relevance, or in direct association with DNA, were among those showing the highest signals of purifying selection. Conversely, those changing according to a neutral, or nearly neutral mode of evolution, were observed to be irrelevant for protein function. Finally, using more than 40 human disease genes, we demonstrate that residues evolving under strong selective pressures (omega<0.1) are significantly associated (p<0.01) with human disease. We hypothesize that non-synonymous change on amino acids showing omega<0.1 will most likely affect protein function. The application of this evolutionary prediction at a genomic scale will provide an a priori hypothesis of the phenotypic effect of non-synonymous coding single nucleotide polymorphisms (SNPs) in the human genome.  相似文献   

11.
Codon-based substitution models have been widely used to identify amino acid sites under positive selection in comparative analysis of protein-coding DNA sequences. The nonsynonymous-synonymous substitution rate ratio (d(N)/d(S), denoted omega) is used as a measure of selective pressure at the protein level, with omega > 1 indicating positive selection. Statistical distributions are used to model the variation in omega among sites, allowing a subset of sites to have omega > 1 while the rest of the sequence may be under purifying selection with omega < 1. An empirical Bayes (EB) approach is then used to calculate posterior probabilities that a site comes from the site class with omega > 1. Current implementations, however, use the naive EB (NEB) approach and fail to account for sampling errors in maximum likelihood estimates of model parameters, such as the proportions and omega ratios for the site classes. In small data sets lacking information, this approach may lead to unreliable posterior probability calculations. In this paper, we develop a Bayes empirical Bayes (BEB) approach to the problem, which assigns a prior to the model parameters and integrates over their uncertainties. We compare the new and old methods on real and simulated data sets. The results suggest that in small data sets the new BEB method does not generate false positives as did the old NEB approach, while in large data sets it retains the good power of the NEB approach for inferring positively selected sites.  相似文献   

12.
13.
A comparative approach was taken for identifying amino acid substitutions that may be under positive Darwinian selection and are correlated with spectral shifts among orthologous and paralogous lepidopteran long wavelength-sensitive (LW) opsins. Four novel LW opsin fragments were isolated, cloned, and sequenced from eye-specific cDNAs from two butterflies, Vanessa cardui (Nymphalidae) and Precis coenia (Nymphalidae), and two moths, Spodoptera exigua (Noctuidae) and Galleria mellonella (Pyralidae). These opsins were sampled because they encode visual pigments having a naturally occurring range of lambda(max) values (510-530 nm), which in combination with previously characterized lepidopteran opsins, provide a complete range of known spectral sensitivities (510-575 nm) among lepidopteran LW opsins. Two recent opsin gene duplication events were found within the papilionid but not within the nymphalid butterfly families through neighbor-joining, maximum parsimony, and maximum likelihood phylogenetic analyses of 13 lepidopteran opsin sequences. An elevated rate of evolution was detected in the red-shifted Papilio Rh3 branch following gene duplication, because of an increase in the amino acid substitution rate in the transmembrane domain of the protein, a region that forms the chromophore-binding pocket of the visual pigment. A maximum likelihood approach was used to estimate omega, the ratio of nonsynonymous to synonymous substitutions per site. Branch-specific tests of selection (free-ratio) identified one branch with omega = 2.1044, but the small number of substitutions involved was not significantly different from the expected number of changes under the neutral expectation of omega = 1. Ancestral sequences were reconstructed with a high degree of certainty from these data. Reconstructed ancestral sequences revealed several instances of convergence to the same amino acid between butterfly and vertebrate cone pigments, and between independent branches of the butterfly opsin tree that are correlated with spectral shifts.  相似文献   

14.
The morphology of some Hoplia species (Scarabaeoidea: Hopliinae) is so variable that parapatric populations have often been considered different species or subspecies. In this study we analyze the nucleotide sequences of a fragment of mitochondrial gene cytochrome c oxidase subunit I (COI) of six species and two subspecies of Palaearctic Hoplia to reexamine the species limits. Based on the analysis of sequences from COI and morphological and ecological observations, we consider Hoplia freyi Baraud to be a junior synonym of Hoplia chlorophana Erichson and H. philanthus ramburi Heyden to be a junior synonym of H. philanthus philanthus (Fuessly). However, complete resolution of relationships among H. philanthus subspecies requires the addition of sequences from genes evolving faster than COI. Phylogenetic relationships among the species studied are discussed.  相似文献   

15.
The phenomenological solute permeability (omega p) of a membrane measures the flux of solute across it when the concentrations of the solutions on the two sides of the membrane differ. The relationship between omega p and the the conventionally measured tracer permeability (omega T) is examined for homoporous and heteroporous (parallel path) membranes in nonideal, nondilute solutions and in the presence of boundary layers. In general, omega p and omega T are not equal; therefore, predictions of transmembrane solute flux based on omega T are always subject to error. For a homoporous membrane, the two permeabilities become equal as the solutions become ideal and dilute. For heteroporous membranes, omega p is always greater than omega T. An upper bound on omega p- omega T is derived to provide an estimate of the maximum error in predicted solute flux. This bound is also used to show that the difference between omega P and omega T demonstrated earlier for the sucrose-Cuprophan system can be explained if the membrane is heteroporous. The expressions for omega P developed here support the use of a modified osmotic driving force to describe membrane transport in nonideal, nondilute solutions.  相似文献   

16.
Mitochondrial DNA (mtDNA) control-region (CR) sequences were analysed to address three questions regarding the evolution of geographical variation in song sparrows. (i) Are mtDNA sequences more informative about phylogenetic relationships and population history than previously published restriction fragment (RFLP) data? (ii) Are song sparrow CR sequences evolving in a selectively neutral manner? (iii) What do the haplotype cladogram and geographical pattern of nucleotide diversity (π) suggest about the recent evolutionary history of song sparrow populations? Results from phylogenetic analyses of CR sequences corroborate RFLP results and reveal instances in which haplotypes do not group by locality. Neutrality tests ( 51 ) suggest that song sparrow mtDNA is evolving in a selectively neutral manner, although exceptions are noted. A novel geographical pattern of π suggests a model of song sparrow population history involving multiple Pleistocene refugia and colonization of some formerly glaciated regions from multiple sources. Moreover, application of coalescence theory to the haplotype cladogram suggests that two different haplotypes (48NF and 151HA) may have predominated in different parts of the song sparrow's range. This model provides insight into the current distribution of song sparrow mtDNA haplotypes and may explain the discordance between evolutionary history inferred from mtDNA and morphology in this species.  相似文献   

17.
Human genetic diseases have been successfully corrected by integration of functional copies of the defective genes into human cells, but in some cases integration of therapeutic vectors has activated proto-oncogenes and contributed to leukemia. For this reason, extensive efforts have focused on analyzing integration site populations from patient samples, but the most commonly used methods for recovering newly integrated DNA suffer from severe recovery biases. Here, we show that a new method based on phage Mu transposition in vitro allows convenient and consistent recovery of integration site sequences in a form that can be analyzed directly using DNA barcoding and pyrosequencing. The method also allows simple estimation of the relative abundance of gene-modified cells from human gene therapy subjects, which has previously been lacking but is crucial for detecting expansion of cell clones that may be a prelude to adverse events.  相似文献   

18.
19.
The vast majority of microorganisms in the environment remain uncultured, and their existence is known only from sequences retrieved by PCR. As a consequence, our understanding of the ecological function of dominant microbial populations in the environment is limited. We will review microbial diversity studies and show that these may have moved from an extreme underestimation to a potentially severe overestimation of diversity. The latter results from a simple PCR-generated artifact: the cloning of heteroduplex molecules followed by Escherichia coli mismatch repair, which may generate an exponential increase in observed sequence diversity. However, simple modifications to current PCR amplification protocols minimize such artifactual sequences and may bring within our reach estimation of bacterial diversity in environmental samples. Such estimates may spur new culture-independent approaches based on genomic and microarray technology, allowing correlation of phylogenetic identity with the ecological function of unculturable organisms. In particular, we are developing a DNA microarray that enables identification of individual populations active in utilization of specific organic substrates. The array consists of 16S and 23S rDNA-targeted oligonucleotides and is hybridized to RNA extracted from samples incubated with (14)C-labeled organic substrates. Populations that metabolize the substrate can be identified by the radiolabel incorporated in their rRNA after only one to two cell doublings, ensuring realistic preservation of community structure. Thus, the microarray approach may provide a powerful means to link microbial community structure with in situ function of individual populations.  相似文献   

20.
Swanson WJ  Wong A  Wolfner MF  Aquadro CF 《Genetics》2004,168(3):1457-1465
Genes whose products are involved in reproduction include some of the fastest-evolving genes found within the genomes of several organisms. Drosophila has long been used to study the function and evolutionary dynamics of genes thought to be involved in sperm competition and sexual conflict, two processes that have been hypothesized to drive the adaptive evolution of reproductive molecules. Several seminal fluid proteins (Acps) made in the Drosophila male reproductive tract show evidence of rapid adaptive evolution. To identify candidate genes in the female reproductive tract that may be involved in female-male interactions and that may thus have been subjected to adaptive evolution, we used an evolutionary bioinformatics approach to analyze sequences from a cDNA library that we have generated from Drosophila female reproductive tracts. We further demonstrate that several of these genes have been subjected to positive selection. Their expression in female reproductive tracts, presence of signal sequences/transmembrane domains, and rapid adaptive evolution indicate that they are prime candidates to encode female reproductive molecules that interact with rapidly evolving male Acps.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号