首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
It has long been recognized that the rates of molecular evolution vary amongst sites in proteins. The usual model for rate heterogeneity assumes independent rate variation according to a rate distribution. In such models the rate at a site, although random, is assumed fixed throughout the evolutionary tree. Recent work by several groups has suggested that rates at sites often vary across subtrees of the larger tree as well as across sites. This phenomenon is not captured by most phylogenetic models but instead is more similar to the covarion model of Fitch and coworkers. In this article we present methods that can be useful in detecting whether different rates occur in two different subtrees of the larger tree and where these differences occur. Parametric bootstrapping and orthogonal regression methodologies are used to test for rate differences and to make statements about the general differences in the rates at sites. Confidence intervals based on the conditional distributions of rates at sites are then used to detect where the rate differences occur. Such methods will be helpful in studying the phylogenetic, structural, and functional bases of changes in evolutionary rates at sites, a phenomenon that has important consequences for deep phylogenetic inference.  相似文献   

2.
The species rich butterfly family Nymphalidae has been used to study evolutionary interactions between plants and insects. Theories of insect-hostplant dynamics predict accelerated diversification due to key innovations. In evolutionary biology, analysis of maximum credibility trees in the software MEDUSA (modelling evolutionary diversity using stepwise AIC) is a popular method for estimation of shifts in diversification rates. We investigated whether phylogenetic uncertainty can produce different results by extending the method across a random sample of trees from the posterior distribution of a Bayesian run. Using the MultiMEDUSA approach, we found that phylogenetic uncertainty greatly affects diversification rate estimates. Different trees produced diversification rates ranging from high values to almost zero for the same clade, and both significant rate increase and decrease in some clades. Only four out of 18 significant shifts found on the maximum clade credibility tree were consistent across most of the sampled trees. Among these, we found accelerated diversification for Ithomiini butterflies. We used the binary speciation and extinction model (BiSSE) and found that a hostplant shift to Solanaceae is correlated with increased net diversification rates in Ithomiini, congruent with the diffuse cospeciation hypothesis. Our results show that taking phylogenetic uncertainty into account when estimating net diversification rate shifts is of great importance, as very different results can be obtained when using the maximum clade credibility tree and other trees from the posterior distribution.  相似文献   

3.
The neutral theory of molecular evolution predicts that rates of phenotypic change are largely independent from genotypic change. A recent study by Bromham et al. (2002) confirmed this expectation, finding no evidence for correlated phenotypic and molecular evolutionary rates in animals. We reevaluate this hypothesis, sampling at different taxonomic levels in plants and animals, using Bayesian inference to reconstruct phylogenetic trees and estimate rates of molecular evolution. We use independent contrasts in branch lengths to maximize the information extracted from each of the trees and nodal posterior probabilities to assess the influence of phylogenetic error. Our results indicate that in vascular plants between 2% and 11% of the variation in phenotypic rates of change can be explained by the rate of genotypic change. These results may be explained by the idea that processes that affect general evolutionary rates, such as body size, may also be expected to influence rates of morphological change.  相似文献   

4.
The evolutionary rate at an amino acid site is indicative of how conserved this site is and, in turn, allows evaluating the importance of this site in maintaining the structure/function of the protein. When evolutionary rates are estimated, one must reconstruct the phylogenetic tree describing the evolutionary relationship among the sequences under study. However, if the inferred phylogenetic tree is incorrect, it can lead to erroneous site-specific rate estimates. Here we describe a novel Bayesian method that uses Markov chain Monte Carlo methodology to integrate over the space of all possible trees and model parameters. By doing so, the method considers alternative evolutionary scenarios weighted by their posterior probabilities. We show that this comprehensive evolutionary approach is superior over methods that are based on only a single tree. We illustrate the potential of our algorithm by analyzing the conservation pattern of the potassium channel protein family.Itay Mayrose, Amir Mitchell contributed equal. Reviewing Editor : Dr. Nicolas Galtier  相似文献   

5.
Many evolutionary processes can lead to a change in the correlation between continuous characters over time or on different branches of a phylogenetic tree. Shifts in genetic or functional constraint, in the selective regime, or in some combination thereof can influence both the evolution of continuous traits and their relation to each other. These changes can often be mapped on a phylogenetic tree to examine their influence on multivariate phenotypic diversification. We propose a new likelihood method to fit multiple evolutionary rate matrices (also called evolutionary variance–covariance matrices) to species data for two or more continuous characters and a phylogeny. The evolutionary rate matrix is a matrix containing the evolutionary rates for individual characters on its diagonal, and the covariances between characters (of which the evolutionary correlations are a function) elsewhere. To illustrate our approach, we apply the method to an empirical dataset consisting of two features of feeding morphology sampled from 28 centrarchid fish species, as well as to data generated via phylogenetic numerical simulations. We find that the method has appropriate type I error, power, and parameter estimation. The approach presented herein is the first to allow for the explicit testing of how and when the evolutionary covariances between characters have changed in the history of a group.  相似文献   

6.
The extinction of species results in a permanent loss of evolutionary history. Recent theoretical studies show that this loss may be proportionally much smaller than the loss of species, but under some conditions can exceed it. Such conditions occur when the phylogenetic tree that describes the evolutionary relationships among species is highly imbalanced due to differences between lineages in past speciation and/or extinction rates. I used the taxonomy by C. G. Sibley and B. L. Monroe Jr to estimate the global loss of bird evolutionary history from historical and predicted extinctions, and to quantify the ensuing changes in balance of the bird phylogenetic tree. In the global bird fauna, evolutionary history is being lost at a high rate, similar to the rate of species extinction. The bird phylogenetic tree is highly imbalanced, and the imbalance is increased significantly by anthropogenic extinction. Historically, the elevated loss of bird evolutionary history has been fuelled mostly by phylogenetic non-randomness in the extinction of species, but the direct effect of tree imbalance is substantial and could dominate in the future.  相似文献   

7.
The tempo and mode of species diversification and phenotypic evolution vary widely across the tree of life, yet the relationship between these processes is poorly known. Previous tests of the relationship between rates of phenotypic evolution and rates of species diversification have assumed that species richness increases continuously through time. If this assumption is violated, simple phylogenetic estimates of net diversification rate may bear no relationship to processes that influence the distribution of species richness among clades. Here, we demonstrate that the variation in species richness among plethodontid salamander clades is unlikely to have resulted from simple time-dependent processes, leading to fundamentally different conclusions about the relationship between rates of phenotypic evolution and species diversification. Morphological evolutionary rates of both size and shape evolution are correlated with clade species richness, but are uncorrelated with simple estimators of net diversification that assume constancy of rates through time. This coupling between species diversification and phenotypic evolution is consistent with the hypothesis that clades with high rates of morphological trait evolution may diversify more than clades with low rates. Our results indicate that assumptions about underlying processes of diversity regulation have important consequences for interpreting macroevolutionary patterns.  相似文献   

8.
Phylogenetic comparative methods that incorporate intraspecific variability are relatively new and, so far, not especially widely used in empirical studies. In the present short article we will describe a new Bayesian method for fitting evolutionary models to comparative data that incorporates intraspecific variability. This method differs from an existing likelihood-based approach in that it requires no a priori inference about species means and variances; rather it takes phenotypic values from individuals and a phylogenetic tree as input, and then samples species means and variances, along with the parameters of the evolutionary model, from their joint posterior probability distribution. One of the most novel and intriguing attributes of this approach is that jointly sampling the species means with the evolutionary model parameters means that the model and tree can influence our estimates of species mean trait values, not just the reverse. In the present implementation, we first apply this method to the most widely used evolutionary model for continuously valued phenotypic trait data (Brownian motion). However, the general approach has broad applicability, which we illustrate by also fitting the λ model, another simple model for quantitative trait evolution on a phylogeny. We test our approach via simulation and by analyzing two empirical datasets obtained from the literature. Finally, we have implemented the methods described herein in a new function for the R statistical computing environment, and this function will be distributed as part of the 'phytools' R library.  相似文献   

9.
A compound poisson process for relaxing the molecular clock   总被引:18,自引:0,他引:18  
Huelsenbeck JP  Larget B  Swofford D 《Genetics》2000,154(4):1879-1892
The molecular clock hypothesis remains an important conceptual and analytical tool in evolutionary biology despite the repeated observation that the clock hypothesis does not perfectly explain observed DNA sequence variation. We introduce a parametric model that relaxes the molecular clock by allowing rates to vary across lineages according to a compound Poisson process. Events of substitution rate change are placed onto a phylogenetic tree according to a Poisson process. When an event of substitution rate change occurs, the current rate of substitution is modified by a gamma-distributed random variable. Parameters of the model can be estimated using Bayesian inference. We use Markov chain Monte Carlo integration to evaluate the posterior probability distribution because the posterior probability involves high dimensional integrals and summations. Specifically, we use the Metropolis-Hastings-Green algorithm with 11 different move types to evaluate the posterior distribution. We demonstrate the method by analyzing a complete mtDNA sequence data set from 23 mammals. The model presented here has several potential advantages over other models that have been proposed to relax the clock because it is parametric and does not assume that rates change only at speciation events. This model should prove useful for estimating divergence times when substitution rates vary across lineages.  相似文献   

10.
This article generalizes previous models for codon substitution and rate variation in molecular phylogeny. Particular attention is paid to (1) reversibility, (2) acceptance and rejection of proposed codon changes, (3) varying rates of evolution among codon sites, and (4) the interaction of these sites in determining evolutionary rates. To accommodate spatial variation in rates, Markov random fields rather than Markov chains are introduced. Because these innovations complicate maximum likelihood estimation in phylogeny reconstruction, it is necessary to formulate new algorithms for the evaluation of the likelihood and its derivatives with respect to the underlying kinetic, acceptance, and spatial parameters. To derive the most from maximum likelihood analysis of sequence data, it is useful to compute posterior probabilities assigning residues to internal nodes and evolutionary rate classes to codon sites. It is also helpful to search through tree space in a way that respects accepted phylogenetic relationships. Our phylogeny program LINNAEUS implements algorithms realizing these goals. Readers may consult our companion article in this issue for several examples.  相似文献   

11.
Simultaneous molecular dating of population and species divergences is essential in many biological investigations, including phylogeography, phylodynamics and species delimitation studies. In these investigations, multiple sequence alignments consist of both intra‐ and interspecies samples (mixed samples). As a result, the phylogenetic trees contain interspecies, interpopulation and within‐population divergences. Bayesian relaxed clock methods are often employed in these analyses, but they assume the same tree prior for both inter‐ and intraspecies branching processes and require specification of a clock model for branch rates (independent vs. autocorrelated rates models). We evaluated the impact of a single tree prior on Bayesian divergence time estimates by analysing computer‐simulated data sets. We also examined the effect of the assumption of independence of evolutionary rate variation among branches when the branch rates are autocorrelated. Bayesian approach with coalescent tree priors generally produced excellent molecular dates and highest posterior densities with high coverage probabilities. We also evaluated the performance of a non‐Bayesian method, RelTime, which does not require the specification of a tree prior or a clock model. RelTime's performance was similar to that of the Bayesian approach, suggesting that it is also suitable to analyse data sets containing both populations and species variation when its computational efficiency is needed.  相似文献   

12.
We propose a Bayesian method for testing molecular clock hypotheses for use with aligned sequence data from multiple taxa. Our method utilizes a nonreversible nucleotide substitution model to avoid the necessity of specifying either a known tree relating the taxa or an outgroup for rooting the tree. We employ reversible jump Markov chain Monte Carlo to sample from the posterior distribution of the phylogenetic model parameters and conduct hypothesis testing using Bayes factors, the ratio of the posterior to prior odds of competing models. Here, the Bayes factors reflect the relative support of the sequence data for equal rates of evolutionary change between taxa versus unequal rates, averaged over all possible phylogenetic parameters, including the tree and root position. As the molecular clock model is a restriction of the more general unequal rates model, we use the Savage-Dickey ratio to estimate the Bayes factors. The Savage-Dickey ratio provides a convenient approach to calculating Bayes factors in favor of sharp hypotheses. Critical to calculating the Savage-Dickey ratio is a determination of the prior induced on the modeling restrictions. We demonstrate our method on a well-studied mtDNA sequence data set consisting of nine primates. We find strong support against a global molecular clock, but do find support for a local clock among the anthropoids. We provide mathematical derivations of the induced priors on branch length restrictions assuming equally likely trees. These derivations also have more general applicability to the examination of prior assumptions in Bayesian phylogenetics.  相似文献   

13.
Evolutionary biologists since Darwin have been fascinated by differences in the rate of trait-evolutionary change across lineages. Despite this continued interest, we still lack methods for identifying shifts in evolutionary rates on the growing tree of life while accommodating uncertainty in the evolutionary process. Here we introduce a Bayesian approach for identifying complex patterns in the evolution of continuous traits. The method (auteur) uses reversible-jump Markov chain Monte Carlo sampling to more fully characterize the complexity of trait evolution, considering models that range in complexity from those with a single global rate to potentially ones in which each branch in the tree has its own independent rate. This newly introduced approach performs well in recovering simulated rate shifts and simulated rates for datasets nearing the size typical for comparative phylogenetic study (i.e., ≥64 tips). Analysis of two large empirical datasets of vertebrate body size reveal overwhelming support for multiple-rate models of evolution, and we observe exceptionally high rates of body-size evolution in a group of emydid turtles relative to their evolutionary background. auteur will facilitate identification of exceptional evolutionary dynamics, essential to the study of both adaptive radiation and stasis.  相似文献   

14.
The HSSP (Homology-Derived Secondary Structure of Proteins) database provides multiple sequence alignments (MSAs) for proteins of known three-dimensional (3D) structure in the Protein Data Bank (PDB). The database also contains an estimate of the degree of evolutionary conservation at each amino acid position. This estimate, which is based on the relative entropy, correlates with the functional importance of the position; evolutionarily conserved positions (i.e., positions with limited variability and low entropy) are occasionally important to maintain the 3D structure and biological function(s) of the protein. We recently developed the Rate4Site algorithm for scoring amino acid conservation based on their calculated evolutionary rate. This algorithm takes into account the phylogenetic relationships between the homologs and the stochastic nature of the evolutionary process. Here we present the ConSurf-HSSP database of Rate4Site estimates of the evolutionary rates of the amino acid positions, calculated using HSSP's MSAs. The database provides precalculated evolutionary rates for nearly all of the PDB. These rates are projected, using a color code, onto the protein structure, and can be viewed online using the ConSurf server interface. To exemplify the database, we analyzed in detail the conservation pattern obtained for pyruvate kinase and compared the results with those observed using the relative entropy scores of the HSSP database. It is reassuring to know that the main functional region of the enzyme is detectable using both conservation scores. Interestingly, the ConSurf-HSSP calculations mapped additional functionally important regions, which are moderately conserved and were overlooked by the original HSSP estimate. The ConSurf-HSSP database is available online (http://consurf-hssp.tau.ac.il).  相似文献   

15.
Abstract

Molecular sequence data have become prominent tools for phylogenetic relationship inference, particularly useful in the analysis of highly diverse taxonomic orders. Ribosomal RNA sequences provide markers that can be used in the study of phylogeny, because their function and structure have been conserved to a large extent throughout the evolutionary history of organisms. These sequences are inferred from cloned or enzymatically amplified gene sequences, or determined by direct RNA sequencing. The first step of the phylogenetic interpretation of nucleic acid sequence variations implies proper alignment of corresponding sequences from various organisms. Best alignment based on similarity criteria is greatly reinforced, in the case of ribosomal RNAs, by secondary structure homologies. Distance matrix methods to infer evolutionary trees are based on the assumption that the phylogenetic distance between each pair of organisms is proportional to the number of nucleotide substitution events. Computed tree inference methods usually take into consideration the possibility of unequal mutation rates among lineages. Divergence times can be estimated on the tree, provided that at least one lineage has been dated by fossil records. We have utilized this approach based on ribosomal RNA sequence comparison to investigate the phylogenetic relationship between dinoflagellated and other eukaryote protists, and to refine controverse phylogenies of the class Dinophycae.  相似文献   

16.
Rates of recombination vary considerably between species. Despite the significance of this observation for evolutionary biology and genetics, the evolutionary mechanisms that contribute to these interspecific differences are unclear. On fine physical scales, recombination rates appear to evolve rapidly between closely related species, but the mode and tempo of recombination rate evolution on the broader scale is poorly understood. Here, we use phylogenetic comparative methods to begin to characterize the evolutionary processes underlying average genomic recombination rates in mammals. We document a strong phylogenetic effect in recombination rates, indicating that more closely related species tend to have more similar average rates of recombination. We demonstrate that this phylogenetic signal is not an artifact of errors in recombination rate estimation and show that it is robust to uncertainty in the mammalian phylogeny. Neutral evolutionary models present good fits to the data and we find no evidence for heterogeneity in the rate of evolution in recombination across the mammalian tree. These results suggest that observed interspecific variation in average genomic rates of recombination is largely attributable to the steady accumulation of neutral mutations over evolutionary time. Although single recombination hotspots may live and die on short evolutionary time scales, the strong phylogenetic signal in genomic recombination rates indicates that the pace of evolution on this scale may be considerably slower.  相似文献   

17.
To what extent the speed of mutational production of phenotypic variation determines the rate of long-term phenotypic evolution is a central question. Houle et al. recently addressed this question by studying the mutational variances, additive genetic variances, and macroevolution of locations of vein intersections on fly wings, reporting very slow phenotypic evolution relative to the rates of mutational input, high phylogenetic signals, and a strong, linear relationship between the mutational variance of a trait and its rate of evolution. Houle et al. found no existing model of phenotypic evolution to be consistent with all these observations, and proposed the improbable scenario of equal influence of mutational pleiotropy on all traits. Here, we demonstrate that the purported linear relationship between mutational variance and evolutionary divergence is artifactual. We further show that the data are explainable by a simple model in which the wing traits are effectively neutral at least within a range of phenotypic values but their evolutionary rates are differentially reduced because mutations affecting these traits are purged owing to their different pleiotropic effects on other traits that are under stabilizing selection. Thus, the evolutionary patterns of fly wing morphologies are explainable under the existing theoretical framework of phenotypic evolution.  相似文献   

18.
Concerns have been raised that posterior probabilities on phylogenetic trees can be unreliable when the true tree is unresolved or has very short internal branches, because existing methods for Bayesian phylogenetic analysis do not explicitly evaluate unresolved trees. Two recent papers have proposed that evaluating only resolved trees results in a "star tree paradox": when the true tree is unresolved or close to it, posterior probabilities were predicted to become increasingly unpredictable as sequence length grows, resulting in inflated confidence in one resolved tree or another and an increasing risk of false-positive inferences. Here we show that this is not the case; existing Bayesian methods do not lead to an inflation of statistical confidence, provided the evolutionary model is correct and uninformative priors are assumed. Posterior probabilities do not become increasingly unpredictable with increasing sequence length, and they exhibit conservative type I error rates, leading to a low rate of false-positive inferences. With infinite data, posterior probabilities give equal support for all resolved trees, and the rate of false inferences falls to zero. We conclude that there is no star tree paradox caused by not sampling unresolved trees.  相似文献   

19.
Aspects of phylogenetic tree shape, and in particular tree balance, provide clues to the workings of the macroevolutionary process. I use a simulation approach to explore patterns in tree balance for several models of the evolutionary process under which speciation rates vary through the history of diversifying clades. I demonstrate that when speciation rates depend on an evolving trait of individuals, and are therefore “heritable” along evolutionary lineages, the resulting phylogenies become imbalanced. However, imbalance also results from some (but not all) models of “nonheritable” speciation rate variation. The degree of imbalance increases with the magnitude of speciation rate variation, and then for gradual evolution (but not punctuated equilibria) reaches an asymptote short of the theoretical maximum. Very high levels of rate variation are required to produce imbalance matching that found in real data (estimated phylogenies from the systematic literature). I discuss implications of the simulation results for our understanding of macroevolution.  相似文献   

20.
The rate at which a given site in a gene sequence alignment evolves over time may vary. This phenomenon--known as heterotachy--can bias or distort phylogenetic trees inferred from models of sequence evolution that assume rates of evolution are constant. Here, we describe a phylogenetic mixture model designed to accommodate heterotachy. The method sums the likelihood of the data at each site over more than one set of branch lengths on the same tree topology. A branch-length set that is best for one site may differ from the branch-length set that is best for some other site, thereby allowing different sites to have different rates of change throughout the tree. Because rate variation may not be present in all branches, we use a reversible-jump Markov chain Monte Carlo algorithm to identify those branches in which reliable amounts of heterotachy occur. We implement the method in combination with our 'pattern-heterogeneity' mixture model, applying it to simulated data and five published datasets. We find that complex evolutionary signals of heterotachy are routinely present over and above variation in the rate or pattern of evolution across sites, that the reversible-jump method requires far fewer parameters than conventional mixture models to describe it, and serves to identify the regions of the tree in which heterotachy is most pronounced. The reversible-jump procedure also removes the need for a posteriori tests of 'significance' such as the Akaike or Bayesian information criterion tests, or Bayes factors. Heterotachy has important consequences for the correct reconstruction of phylogenies as well as for tests of hypotheses that rely on accurate branch-length information. These include molecular clocks, analyses of tempo and mode of evolution, comparative studies and ancestral state reconstruction. The model is available from the authors' website, and can be used for the analysis of both nucleotide and morphological data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号