首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Abstract New inference techniques, such as supertrees, have improved the construction of large phylogenies, helping to reveal the tree of life. In addition, these large phylogenies have enhanced the study of other evolutionary questions, such as whether traits have evolved in a neutral or adaptive way, or what factors have influenced diversification. However, supertrees usually lack branch lengths, which are necessary for all these issues to be investigated. Here, divergence times within the largest family of flowering plants, namely the Asteraceae, are reviewed to estimate time‐calibrated branch lengths in the supertree of this lineage. An inconsistency between estimated dates of basal branching events and the earliest asteraceous fossil pollen record was detected. In addition, the impact of different methods of branch length assignment on the total number of transitions between states in the reconstruction of sexual system evolution in Asteraceae was investigated. At least for this dataset, different branch length assignation approaches influenced maximum likelihood (ML) reconstructions only and not Bayesian ones. Therefore, the selection of different branch length information is not arbitrary and should be carefully assessed, at least when ML approaches are being used. The reviewed divergence times and the estimated time‐calibrated branch lengths provide a useful tool for future phylogenetic comparative and macroevolutionary studies of Asteraceae.  相似文献   

2.
Recently, dated phylogenies have been increasingly used for ecological studies on community structure and conservation planning. There is, however, a major impediment to a systematic application of phylogenetic methods in ecology: reliable phylogenies with time-calibrated branch lengths are lacking for a large number of taxonomic groups and this condition is likely to continue for a long time. A solution for this problem consists in using undated phylogenies or taxonomic hierarchies as proxies for dated phylogenies. Nonetheless, little is known on the potential loss of information of these approaches compared to studies using dated phylogenies with time-calibrated branch lengths. The aim of this study is to ask how the use of undated phylogenies and taxonomic hierarchies biases a very simple measure of diversity, the mean pairwise phylogenetic distance between community species, compared to the diversity of dated phylogenies derived from the freely available software Phylomatic. This is illustrated with three sets of data on plant species sampled at different scales. Our results show that: (1) surprisingly, the diversity computed from dated phylogenies derived from Phylomatic is more strongly related to the diversity computed from taxonomic hierarchies than to the diversity computed from undated phylogenies, while (2) less surprisingly, the strength of this relationship increases if we consider only angiosperm species.  相似文献   

3.
Recent years have seen an increasing effort to incorporate phylogenetic hypotheses to the study of community assembly processes. The incorporation of such evolutionary information has been eased by the emergence of specialized software for the automatic estimation of partially resolved supertrees based on published phylogenies. Despite this growing interest in the use of phylogenies in ecological research, very few studies have attempted to quantify the potential biases related to the use of partially resolved phylogenies and to branch length accuracy, and no work has examined how tree shape may affect inference of community phylogenetic metrics. In this study, we tested the influence of phylogenetic resolution and branch length information on the quantification of phylogenetic structure, and also explored the impact of tree shape (stemminess) on the loss of accuracy in phylogenetic structure quantification due to phylogenetic resolution. For this purpose, we used 9 sets of phylogenetic hypotheses of varying resolution and branch lengths to calculate three indices of phylogenetic structure: the mean phylogenetic distance (NRI), the mean nearest taxon distance (NTI) and phylogenetic diversity (stdPD) metrics. The NRI metric was the less sensitive to phylogenetic resolution, stdPD showed an intermediate sensitivity, and NTI was the most sensitive one; NRI was also less sensitive to branch length accuracy than NTI and stdPD, the degree of sensitivity being strongly dependent on the dating method and the sample size. Directional biases were generally towards type II errors. Interestingly, we detected that tree shape influenced the accuracy loss derived from the lack of phylogenetic resolution, particularly for NRI and stdPD. We conclude that well‐resolved molecular phylogenies with accurate branch length information are needed to identify the underlying phylogenetic structure of communities, and also that sensitivity of phylogenetic structure measures to low phylogenetic resolution can strongly vary depending on phylogenetic tree shape.  相似文献   

4.

Background  

Phylogenetic comparative methods are often improved by complete phylogenies with meaningful branch lengths (e.g., divergence dates). This study presents a dated molecular supertree for all 34 world pinniped species derived from a weighted matrix representation with parsimony (MRP) supertree analysis of 50 gene trees, each determined under a maximum likelihood (ML) framework. Divergence times were determined by mapping the same sequence data (plus two additional genes) on to the supertree topology and calibrating the ML branch lengths against a range of fossil calibrations. We assessed the sensitivity of our supertree topology in two ways: 1) a second supertree with all mtDNA genes combined into a single source tree, and 2) likelihood-based supermatrix analyses. Divergence dates were also calculated using a Bayesian relaxed molecular clock with rate autocorrelation to test the sensitivity of our supertree results further.  相似文献   

5.
Aim When hypotheses of historical biogeography are evaluated, age estimates of individual nodes in a phylogeny often have a direct impact on what explanation is concluded to be most likely. Confidence intervals of estimated divergence times obtained in molecular dating analyses are usually very large, but the uncertainty is rarely incorporated in biogeographical analyses. The aim of this study is to use the group Urophylleae, which has a disjunct pantropical distribution, to explore how the uncertainty in estimated divergence times affects conclusions in biogeographical analysis. Two hypotheses are evaluated: (1) long‐distance dispersal from Africa to Asia and the Neotropics, and (2) a continuous distribution in the boreotropics, probably involving migration across the North Atlantic Land Bridge, followed by isolation in equatorial refugia. Location Tropical and subtropical Asia, tropical Africa, and central and southern tropical America. Methods This study uses parsimony and Bayesian phylogenetic analyses of chloroplast DNA and nuclear ribosomal DNA data from 56 ingroup species, beast molecular dating and a Bayesian approach to dispersal–vicariance analysis (Bayes‐DIVA) to reconstruct the ancestral area of the group, and the dispersal–extinction–cladogenesis method to test biogeographical hypotheses. Results When the two models of geographic range evolution were compared using the maximum likelihood (ML) tree with mean estimates of divergence times, boreotropical migration was indicated to be much more likely than long‐distance dispersal. Analyses of a large sample of dated phylogenies did, however, show that this result was not consistent. The age estimate of one specific node had a major impact on likelihood values and on which model performed best. The results show that boreotropical migration provides a slightly better explanation of the geographical distribution patterns of extant Urophylleae than long‐distance dispersal. Main conclusions This study shows that results from biogeographical analyses based on single phylogenetic trees, such as a ML or consensus tree, can be misleading, and that it may be very important to take the uncertainty in age estimates into account. Methods that account for the uncertainty in topology, branch lengths and estimated divergence times are not commonly used in biogeographical inference today but should definitely be preferred in order to avoid unwarranted conclusions.  相似文献   

6.
The evolution of body size in tetrapods is assessed using a database that includes 107 early stegocephalian species ranging in time from the Frasnian (Upper Devonian) to the Tatarian (Upper Permian). All analyses use methods that incorporate phylogenetic information (topology and branch lengths). In all tests, the impact of alternative topologies and branch lengths are assessed. Previous reports that raised doubts about the accuracy of squared-change parsimony assessment of ancestral character value appear to have used datasets in which there was no phylogenetic signal. Hence, squared-change parsimony may be more reliable than suggested in recent studies, at least when a phylogenetic signal is present in the datasets of interest. Analysis using random taxon reshuffling on three reference phylogenies shows that cranial and presacral length include a strong phylogenetic signal. Character optimization of body size in stegocephalians using squared-change parsimony on a time-calibrated phylogeny incorporating branch length information is used to test a previously published scenario on the origin of amniotes and of the amniotic egg that implies that the ancestors of amniotes were small (no more than 10 cm in snout-vent length), and that their size increased subsequent to the appearance of the amniotic egg. The optimization suggests that first amniotes were somewhat larger than previously hypothesized; the estimated snout-vent length is about 24 cm, and the lower end of the 95% confidence interval of the phylogeny that yields the smallest inferred size suggests that no ancestor of amniotes measured less than 12 cm in snout-vent length. Character optimization, permutational multiple linear regressions, and independent contrast analyses show that Cope's rule of phyletic size increase applies to early reptiliomorphs but that it does not apply to early stegocephalians globally.  相似文献   

7.
Phylogenetic trees inferred from sequence data often have branch lengths measured in the expected number of substitutions and therefore, do not have divergence times estimated. These trees give an incomplete view of evolutionary histories since many applications of phylogenies require time trees. Many methods have been developed to convert the inferred branch lengths from substitution unit to time unit using calibration points, but none is universally accepted as they are challenged in both scalability and accuracy under complex models. Here, we introduce a new method that formulates dating as a nonconvex optimization problem where the variance of log-transformed rate multipliers is minimized across the tree. On simulated and real data, we show that our method, wLogDate, is often more accurate than alternatives and is more robust to various model assumptions.  相似文献   

8.

Background  

Estimates of divergence dates between species improve our understanding of processes ranging from nucleotide substitution to speciation. Such estimates are frequently based on molecular genetic differences between species; therefore, they rely on accurate estimates of the number of such differences (i.e. substitutions per site, measured as branch length on phylogenies). We used simulations to determine the effects of dataset size, branch length heterogeneity, branch depth, and analytical framework on branch length estimation across a range of branch lengths. We then reanalyzed an empirical dataset for plethodontid salamanders to determine how inaccurate branch length estimation can affect estimates of divergence dates.  相似文献   

9.
Eastern North America is the location of the world's most species-rich temperate freshwater fish fauna. Hypotheses regarding the geographic and temporal scale of teleost diversification in this region have not been broadly investigated using absolute divergence time estimates among the constituent lineages. This study used time-calibrated molecular phylogenies estimated from mitochondrial and nuclear genes to investigate the temporal and geographic signatures of diversification within barcheek darters, a clade of allopatrically distributed species endemic to the Eastern Highlands. Results from divergence time estimates using an uncorrelated lognormal model suggest that the barcheek darters are an ancient group with a crown node estimate of 16.3 mya, 95% highest posterior density (HPD): [12.4, 20.5], and the clade is characterized by substantial intraspecific divergence times within several species. In particular, the Caney Fork endemic Etheostoma basilare comprises five strongly supported and deeply divergent clades with a most recent common ancestor estimated at 8.0 mya, 95% HPD: [5.6, 10.7]. These results are concordant with the hypothesis that geologically stable areas of eastern North America have facilitated both the generation and preservation of lineages across a substantial breadth of evolutionary time, and that allopatric speciation in darters has occurred at much smaller spatial scales than previously realized.  相似文献   

10.

Background  

In recent years there has been a trend of leaving the strict molecular clock in order to infer dating of speciations and other evolutionary events. Explicit modeling of substitution rates and divergence times makes formulation of informative prior distributions for branch lengths possible. Models with birth-death priors on tree branching and auto-correlated or iid substitution rates among lineages have been proposed, enabling simultaneous inference of substitution rates and divergence times. This problem has, however, mainly been analysed in the Markov chain Monte Carlo (MCMC) framework, an approach requiring computation times of hours or days when applied to large phylogenies.  相似文献   

11.
An important challenge in evolutionary biology is to understand how major changes in body form arise. The dramatic transition from a lizard-like to snake-like body form in squamate reptiles offers an exciting system for such research because this change is replicated dozens of times. Here, we use morphometric data for 258 species and a time-calibrated phylogeny to explore rates and patterns of body-form evolution across squamates. We also demonstrate how time-calibrated phylogenies may be used to make inferences about the time frame over which major morphological transitions occur. Using the morphometric data, we find that the transition from lizard-like to snake-like body form involves concerted evolution of limb reduction, digit loss, and body elongation. These correlations are similar across squamate clades, despite very different ecologies and >180 million years (My) of divergence. Using the time-calibrated phylogeny and ancestral reconstructions, we find that the dramatic transition between these body forms can occur in 20 My or less, but that seemingly intermediate morphologies can also persist for tens of millions of years. Finally, although loss of digits is common, we find statistically significant support for at least six examples of the re-evolution of lost digits in the forelimb and hind limb.  相似文献   

12.
The ancestral distance test is introduced to detect correlated evolution between two binary traits in large phylogenies that may lack resolved subclades, branch lengths, and/or comparative data. We define the ancestral distance as the time separating a randomly sampled taxon from its most recent ancestor (MRA) with extant descendants that have an independent trait. The sampled taxon either has (target sample) or lacks (nontarget sample) a dependent trait. Modeled as a Markov process, we show that the distribution of ancestral distances for the target sample is identical to that of the nontarget sample when characters are uncorrelated, whereas ancestral distances are smaller on average for the target sample when characters are correlated. Simulations suggest that the ancestral distance can be estimated using the time, total branch length, taxonomic rank, or number of speciation events between a sampled taxon and the MRA. These results are shown to be robust to deviations from Markov assumptions. A Monte Carlo technique estimates P-values when fully resolved phylogenies with branch lengths are available, and we evaluate the Monte Carlo approach using a data set with known correlation. Measures of relatedness were found to provide a robust means to test hypotheses of correlated character evolution.  相似文献   

13.
We examined Type I error rates of Felsenstein's (1985; Am. Nat. 125:1-15) comparative method of phylogenetically independent contrasts when branch lengths are in error and the model of evolution is not Brownian motion. We used seven evolutionary models, six of which depart strongly from Brownian motion, to simulate the evolution of two continuously valued characters along two different phylogenies (15 and 49 species). First, we examined the performance of independent contrasts when branch lengths are distorted systematically, for example, by taking the square root of each branch segment. These distortions often caused inflated Type I error rates, but performance was almost always restored when branch length transformations were used. Next, we investigated effects of random errors in branch lengths. After the data were simulated, we added errors to the branch lengths and then used the altered phylogenies to estimate character correlations. Errors in the branches could be of two types: fixed, where branch lengths are either shortened or lengthened by a fixed fraction; or variable, where the error is a normal variate with mean zero and the variance is scaled to the length of the branch (so that expected error relative to branch length is constant for the whole tree). Thus, the error added is unrelated to the microevolutionary model. Without branch length checks and transformations, independent contrasts tended to yield extremely inflated and highly variable Type I error rates. Type I error rates were reduced, however, when branch lengths were checked and transformed as proposed by Garland et al. (1992; Syst. Biol. 41:18-32), and almost never exceeded twice the nominal P-value at alpha = 0.05. Our results also indicate that, if branch length transformations are applied, then the appropriate degrees of freedom for testing the significance of a correlation coefficient should, in general, be reduced to account for estimation of the best branch length transformation. These results extend those reported in Díaz-Uriarte and Garland (1996; Syst. Biol. 45:27-47), and show that, even with errors in branch lengths and evolutionary models different from Brownian motion, independent contrasts are a robust method for testing hypotheses of correlated evolution.  相似文献   

14.
Branch length estimates play a central role in maximum-likelihood (ML) and minimum-evolution (ME) methods of phylogenetic inference. For various reasons, branch length estimates are not statistically independent under ML or ME. We studied the response of correlations among branch length estimates to the degree of among-branch length heterogeneity (BLH) in the model (true) tree. The frequency and magnitude of (especially negative) correlations among branch length estimates were both shown to increase as BLH increases under simulation and analytically. For ML, we used the correct model (Jukes–Cantor). For ME, we employed ordinary least-squares (OLS) branch lengths estimated under both simple p-distances and Jukes–Cantor distances, analyzed with and without an among-site rate heterogeneity parameter. The efficiency of ME and ML was also shown to decrease in response to increased BLH. We note that the shape of the true tree will in part determine BLH and represents a critical factor in the probability of recovering the correct topology. An important finding suggests that researchers cannot expect that different branches that were in fact the same length will have the same probability of being accurately reconstructed when BLH exists in the overall tree. We conclude that methods designed to minimize the interdependencies of branch length estimates (BLEs) may (1) reduce both the variance and the covariance associated with the estimates and (2) increase the efficiency of model-based optimality criteria. We speculate on possible ways to reduce the nonindependence of BLEs under OLS and ML. Received: 9 March 1999 / Accepted: 4 May 1999  相似文献   

15.
The relative efficiencies of the maximum-likelihood (ML), neighbor- joining (NJ), and maximum-parsimony (MP) methods in obtaining the correct topology and in estimating the branch lengths for the case of four DNA sequences were studied by computer simulation, under the assumption either that there is variation in substitution rate among different nucleotide sites or that there is no variation. For the NJ method, several different distance measures (Jukes-Cantor, Kimura two- parameter, and gamma distances) were used, whereas for the ML method three different transition/transversion ratios (R) were used. For the MP method, both the standard unweighted parsimony and the dynamically weighted parsimony methods were used. The results obtained are as follows: (1) When the R value is high, dynamically weighted parsimony is more efficient than unweighted parsimony in obtaining the correct topology. (2) However, both weighted and unweighted parsimony methods are generally less efficient than the NJ and ML methods even in the case where the MP method gives a consistent tree. (3) When all the assumptions of the ML method are satisfied, this method is slightly more efficient than the NJ method. However, when the assumptions are not satisfied, the NJ method with gamma distances is slightly better in obtaining the correct topology than is the ML method. In general, the two methods show more or less the same performance. The NJ method may give a correct topology even when the distance measures used are not unbiased estimators of nucleotide substitutions. (4) Branch length estimates of a tree with the correct topology are affected more easily than topology by violation of the assumptions of the mathematical model used, for both the ML and the NJ methods. Under certain conditions, branch lengths are seriously overestimated or underestimated. The MP method often gives serious underestimates for certain branches. (5) Distance measures that generate the correct topology, with high probability, do not necessarily give good estimates of branch lengths. (6) The likelihood-ratio test and the confidence-limit test, in Felsenstein's DNAML, for examining the statistical of branch length estimates are quite sensitive to violation of the assumptions and are generally too liberal to be used for actual data. Rzhetsky and Nei's branch length test is less sensitive to violation of the assumptions than is Felsenstein's test. (7) When the extent of sequence divergence is < or = 5% and when > or = 1,000 nucleotides are used, all three methods show essentially the same efficiency in obtaining the correct topology and in estimating branch lengths.(ABSTRACT TRUNCATED AT 400 WORDS)   相似文献   

16.

Background  

Biology has increasingly recognized the necessity to build and utilize larger phylogenies to address broad evolutionary questions. Large phylogenies have facilitated the discovery of differential rates of molecular evolution between trees and herbs. They have helped us understand the diversification patterns of mammals as well as the patterns of seed evolution. In addition to these broad evolutionary questions there is increasing awareness of the importance of large phylogenies for addressing conservation issues such as biodiversity hotspots and response to global change. Two major classes of methods have been employed to accomplish the large tree-building task: supertrees and supermatrices. Although these methods are continually being developed, they have yet to be made fully accessible to comparative biologists making extremely large trees rare.  相似文献   

17.
The last decades have seen an upsurge in ecological studies incorporating phylogenetic information with increasing species samples, motivated by the common conjecture that species with common ancestors should share some ecological characteristics due to niche conservatism. This has been carried out using various methods of increasing complexity and reliability: using only taxonomical classification; constructing supertrees that incorporate only topological information from previously published phylogenies; or building supermatrices of molecular data that are used to estimate phylogenies with evolutionary meaningful branch lengths. Although the latter option is more informative than the others, it remains under‐used in ecology because ecologists are generally unaware of or unfamiliar with modern molecular phylogenetic methods. However, a solid phylogenetic hypothesis is necessary to conduct reliable ecological analysis integrating evolutive aspects. Our aim here is to clarify the concepts and methodological issues associated with the reconstruction of dated megaphylogenies, and to show that it is nowadays possible to obtain accurate and well sampled megaphylogenies with informative branch‐lengths on large species samples. This is possible thanks to improved phylogenetic methods, vast amounts of molecular data available from databases such as Genbank, and consensus knowledge on deep phylogenetic relationships for an increasing number of groups of organisms. Finally, we include a detailed step‐by‐step workflow pipeline (Supplementary material), from data acquisition to phylogenetic inference, mainly based on the R environment (widely used by ecologists) and the use of free web‐servers, that has been applied to the reconstruction of a species‐level phylogeny of all breeding birds of Europe.  相似文献   

18.
We have investigated the effects of different among-site rate variation models on the estimation of substitution model parameters, branch lengths, topology, and bootstrap proportions under minimum evolution (ME) and maximum likelihood (ML). Specifically, we examined equal rates, invariable sites, gamma-distributed rates, and site-specific rates (SSR) models, using mitochondrial DNA sequence data from three protein-coding genes and one tRNA gene from species of the New Zealand cicada genus Maoricicada. Estimates of topology were relatively insensitive to the substitution model used; however, estimates of bootstrap support, branch lengths, and R-matrices (underlying relative substitution rate matrix) were strongly influenced by the assumptions of the substitution model. We identified one situation where ME and ML tree building became inaccurate when implemented with an inappropriate among-site rate variation model. Despite the fact the SSR models often have a better fit to the data than do invariable sites and gamma rates models, SSR models have some serious weaknesses. First, SSR rate parameters are not comparable across data sets, unlike the proportion of invariable sites or the alpha shape parameter of the gamma distribution. Second, the extreme among-site rate variation within codon positions is problematic for SSR models, which explicitly assume rate homogeneity within each rate class. Third, the SSR models appear to give severe underestimates of R-matrices and branch lengths relative to invariable sites and gamma rates models in this example. We recommend performing phylogenetic analyses under a range of substitution models to test the effects of model assumptions not only on estimates of topology but also on estimates of branch length and nodal support.  相似文献   

19.
A central challenge facing the temporal calibration of molecular phylogenies is finding a quantitative method for estimating maximum age constraints on lineage divergence times. Here, I provide such a method. This method requires an ultrametric tree generated without reference to the fossil record. Exploiting the fact that the relative branch lengths on the ultrametric tree are proportional to time, this method identifies the lineage with the greatest proportion of its true temporal range covered by the fossil record. The oldest fossil of this calibration lineage is used as the minimum age constraint. The maximum age constraint is obtained by adding a confidence interval onto the end point of the calibration lineage, thus making it possible to bracket the true divergence times of all lineages on the tree. The approach can also identify fossils that have been grossly misdated or misassigned to the phylogeny. The method assumes that the relative branch lengths on the ultrametric tree are accurate and that fossilization is random. The effect of violations of these assumptions is assessed. This method is simple to use and is illustrated with a reanalysis of Near et al.'s turtle data.  相似文献   

20.
Since branch lengths provide important information about the timing and the extent of evolutionary divergence among taxa, accurate resolution of evolutionary history depends as much on branch length estimates as on recovery of the correct topology. However, the empirical relationship between the choice of genes to sequence and the quality of branch length estimation remains ill defined. To address this issue, we evaluated the accuracy of branch lengths estimated from subsets of the mitochondrial genome for a mammalian phylogeny with known subordinal relationships. Using maximum-likelihood methods, we estimated branch lengths from an 11-kb sequence of all 13 protein-coding genes and compared them with estimates from single genes (0.2-1.8 kb) and from 7 different combinations of genes (2-3.5 kb). For each sequence, we separated the component of the log-likelihood deviation due to branch length differences associated with alternative topologies from that due to those that are independent of the topology. Even among the sequences that recovered the same tree topology, some produced significantly better branch length estimates than others did. The combination of correct topology and significantly better branch length estimation suggests that these gene combinations may prove useful in estimating phylogenetic relationships for mammalian divergences below the ordinal level. Thus, the proper choice of genes to sequence is a critical factor for reliable estimation of evolutionary history from molecular data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号