首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Estimating phylogenetic relationships among closely related species can be extremely difficult when there is incongruence among gene trees and between the gene trees and the species tree. Here we show that incorporating a model of the stochastic loss of gene lineages by genetic drift into the phylogenetic estimation procedure can provide a robust estimate of species relationships, despite widespread incomplete sorting of ancestral polymorphism. This approach is applied to a group of montane Melanoplus grasshoppers for which genealogical discordance among loci and incomplete lineage sorting obscures any obvious phylogenetic relationships among species. Unlike traditional treatments where gene trees estimated using standard phylogenetic methods are implicitly equated with the species tree, with the coalescent-based approach the species tree is modeled probabilistically from the estimated gene trees. The estimated species phylogeny (the ESP) is calculated for the grasshoppers from multiple gene trees reconstructed for nuclear loci and a mitochondrial gene. This empirical application is coupled with a simulation study to explore the performance of the coalescent-based approach. Specifically, we test the accuracy of the ESP given the data based on analyses of simulated data matching the multilocus data collected in Melanoplus (i.e., data were simulated for each locus with the same number of base pairs and locus-specific mutational models). The results of the study show that ESPs can be computed using the coalescent-based approach long before reciprocal monophyly has been achieved, and that these statistical estimates are accurate. This contrasts with analyses of the empirical data collected in Melanoplus and simulated data based on concatenation of multiple loci, for which the incomplete lineage sorting of recently diverged species posed significant problems. The strengths and potential challenges associated with incorporating an explicit model of gene-lineage coalescence into the phylogenetic procedure to obtain an ESP, as illustrated by application to Melanoplus, versus concatenation and consensus approaches are discussed. This study represents a fundamental shift in how species relationships are estimated - the relationship between the gene trees and the species phylogeny is modeled probabilistically rather than equating gene trees with a species tree.  相似文献   

2.
Species complexes undergoing rapid radiation present a challenge in molecular systematics because of the possibility that ancestral polymorphism is retained in component gene trees. Coalescent theory has demonstrated that gene trees often fail to match lineage trees when taxon divergence times are less than the ancestral effective population sizes. Suggestions to increase the number of loci and the number of individuals per taxon have been proposed; however, phylogenetic methods to adequately analyze these data in a coalescent framework are scarce. We compare two approaches to estimating lineage (species) trees using multiple individuals and multiple loci: the commonly used partitioned Bayesian analysis of concatenated sequences and a modification of a newly developed hierarchical Bayesian method (BEST) that simultaneously estimates gene trees and species trees from multilocus data. We test these approaches on a phylogeny of rapidly radiating species wherein divergence times are likely to be smaller than effective population sizes, and incomplete lineage sorting is known, in the rodent genus, Thomomys. We use seven independent noncoding nuclear sequence loci (total approximately 4300 bp) and between 1 and 12 individuals per taxon to construct a phylogenetic hypothesis for eight Thomomys species. The majority-rule consensus tree from the partitioned concatenated analysis included 14 strongly supported bipartitions, corroborating monophyletic species status of five of the eight named species. The BEST tree strongly supported only the split between the two subgenera and showed very low support for any other clade. Comparison of both lineage trees to individual gene trees revealed that the concatenation method appears to ignore conflicting signals among gene trees, whereas the BEST tree considers conflicting signals and downweights support for those nodes. Bayes factor analysis of posterior tree distributions from both analyses strongly favor the model underlying the BEST analysis. This comparison underscores the risks of overreliance on results from concatenation, and ignoring the properties of coalescence, especially in cases of recent, rapid radiations.  相似文献   

3.
Interest in methods that estimate speciation and extinction rates from molecular phylogenies has increased over the last decade. The application of such methods requires reliable estimates of tree topology and node ages, which are frequently obtained using standard phylogenetic inference combining concatenated loci and molecular dating. However, this practice disregards population‐level processes that generate gene tree/species tree discordance. We evaluated the impact of employing concatenation and coalescent‐based phylogeny inference in recovering the correct macroevolutionary regime using simulated data based on the well‐established diversification rate shift of delphinids in Cetacea. We found that under scenarios of strong incomplete lineage sorting, macroevolutionary analysis of phylogenies inferred by concatenating loci failed to recover the delphinid diversification shift, while the coalescent‐based tree consistently retrieved the correct rate regime. We suggest that ignoring microevolutionary processes reduces the power of methods that estimate macroevolutionary regimes from molecular data.  相似文献   

4.
The phylogenetic relationships for rapid species radiations are difficult to disentangle. Here we study one such case, namely the genus Triturus, which is composed of the marbled and crested newts. We analyze data for 38 genetic markers, positioned in 3-prime untranslated regions of protein-coding genes, obtained with 454 sequencing. Our dataset includes twenty Triturus newts and represents all nine species. Bayesian analysis of population structure allocates all individuals to their respective species. The branching patterns obtained by data concatenation, Bayesian concordance analysis and coalescent-based estimations of the species tree differ from one another. The data concatenation based species tree shows high branch support but branching order is considerably affected by allele choice in the case of heterozygotes in the concatenation process. Bayesian concordance analysis expresses the conflict between individual gene trees for part of the Triturus species tree as low concordance factors. The coalescent-based species tree is relatively similar to a previously published species tree based upon morphology and full mtDNA and any conflicting internal branches are not highly supported. Our findings reflect high gene tree discordance due to incomplete lineage sorting (possibly aggravated by hybridization) in combination with low information content of the markers employed (as can be expected for relatively recent species radiations). This case study highlights the complexity of resolving rapid radiations and we acknowledge that to convincingly resolve the Triturus species tree even more genes will have to be consulted.  相似文献   

5.
The use of diverse data sets in phylogenetic studies aiming for understanding evolutionary histories of species can yield conflicting inference. Phylogenetic conflicts observed in animal and plant systems have often been explained by hybridization, incomplete lineage sorting (ILS), or horizontal gene transfer. Here, we used target enrichment data, species tree, and species network approaches to infer the backbone phylogeny of the family Caprifoliaceae, while distinguishing among sources of incongruence. We used 713 nuclear loci and 46 complete plastome sequence data from 43 samples representing 38 species from all major clades to reconstruct the phylogeny of the family using concatenation and coalescence approaches. We found significant nuclear gene tree conflict as well as cytonuclear discordance. Additionally, coalescent simulations and phylogenetic species network analyses suggested putative ancient hybridization among subfamilies of Caprifoliaceae, which seems to be the main source of phylogenetic discordance. Ancestral state reconstruction of six morphological characters revealed some homoplasy for each character examined. By dating the branching events, we inferred the origin of Caprifoliaceae at approximately 66.65 Ma in the late Cretaceous. By integrating evidence from molecular phylogeny, divergence times, and morphology, we here recognize Zabelioideae as a new subfamily in Caprifoliaceae. This work shows the necessity of using a combination of multiple approaches to identify the sources of gene tree discordance. Our study also highlights the importance of using data from both nuclear and plastid genomes to reconstruct deep and shallow phylogenies of plants.  相似文献   

6.
Recent computational advances provide novel opportunities to infer species trees based on multiple independent loci. Thus, single gene trees no longer need suffice as proxies for species phylogenies. Several methods have been developed to deal with the challenges posed by incomplete and stochastic lineage sorting. In this study, we employed four Bayesian methods to infer the phylogeny of a clade of 11 recently diverged oriole species within the genus Icterus. We obtained well-resolved and mostly congruent phylogenies using a set of seven unlinked nuclear intron loci and sampling multiple individuals per species. Most notably, Bayesian concordance analysis generally agreed well with concatenation; the two methods agreed fully on eight of nine nodes. The coalescent-based method BEAST further supported six of these eight nodes. The fourth method used, BEST, failed to converge despite exhaustive efforts to optimize the tree search. Overall, the results obtained by new species tree methods and concatenation generally corroborate our findings from previous analyses and data sets. However, we found striking disagreement between mitochondrial and nuclear DNA involving relationships within the northern oriole group. Our results highlight the danger of reliance on mtDNA alone for phylogenetic inference. We demonstrate that in spite of low variability and incomplete lineage sorting, multiple nuclear loci can produce largely congruent phylogenies based on multiple species tree methods, even for very closely-related species.  相似文献   

7.
8.
Species tree methods have provided improvements for estimating species relationships and the timing of diversification in recent radiations by allowing for gene tree discordance. Although gene tree discordance is often observed, most discordance is attributed to incomplete lineage sorting rather than other biological phenomena, and the causes of discordance are rarely investigated. We use species trees from multi-locus data to estimate the species relationships, evolutionary history and timing of diversification among Australian Gehyra—a group renowned for taxonomic uncertainty and showing a large degree of gene tree discordance. We find support for a recent Asian origin and two major clades: a tropically adapted clade and an arid adapted clade, with some exceptions, but no support for allopatric speciation driven by chromosomal rearrangement in the group. Bayesian concordance analysis revealed high gene tree discordance and comparisons of Robinson–Foulds distances showed that discordance between gene trees was significantly higher than that generated by topological uncertainty within each gene. Analysis of gene tree discordance and incomplete taxon sampling revealed that gene tree discordance was high whether terminal taxon or gene sampling was maximized, indicating discordance is due to biological processes, which may be important in contributing to gene tree discordance in many recently diversified organisms.  相似文献   

9.
Population genetic theory predicts discordance in the true phylogeny of different genomic regions when studying recently diverged species. Despite this expectation, genome-wide discordance in young species groups has rarely been statistically quantified. The house mouse subspecies group provides a model system for examining phylogenetic discordance. House mouse subspecies are recently derived, suggesting that even if there has been a simple tree-like population history, gene trees could disagree with the population history due to incomplete lineage sorting. Subspecies of house mice also hybridize in nature, raising the possibility that recent introgression might lead to additional phylogenetic discordance. Single-locus approaches have revealed support for conflicting topologies, resulting in a subspecies tree often summarized as a polytomy. To analyze phylogenetic histories on a genomic scale, we applied a recently developed method, Bayesian concordance analysis, to dense SNP data from three closely related subspecies of house mice: Mus musculus musculus, M. m. castaneus, and M. m. domesticus. We documented substantial variation in phylogenetic history across the genome. Although each of the three possible topologies was strongly supported by a large number of loci, there was statistical evidence for a primary phylogenetic history in which M. m. musculus and M. m. castaneus are sister subspecies. These results underscore the importance of measuring phylogenetic discordance in other recently diverged groups using methods such as Bayesian concordance analysis, which are designed for this purpose.  相似文献   

10.
The application of phylogenetic inference methods, to data for a set of independent genes sampled randomly throughout the genome, often results in substantial incongruence in the single-gene phylogenetic estimates. Among the processes known to produce discord between single-gene phylogenies, two of the best studied in a phylogenetic context are hybridization and incomplete lineage sorting. Much recent attention has focused on the development of methods for estimating species phylogenies in the presence of incomplete lineage sorting, but phylogenetic models that allow for hybridization have been more limited. Here we propose a model that allows incongruence in single-gene phylogenies to be due to both hybridization and incomplete lineage sorting, with the goal of determining the contribution of hybridization to observed gene tree incongruence in the presence of incomplete lineage sorting. Using our model, we propose methods for estimating the extent of the role of hybridization in both a likelihood and a Bayesian framework. The performance of our methods is examined using both simulated and empirical data.  相似文献   

11.
Clades that have undergone episodes of rapid cladogenesis are challenging from a phylogenetic point of view. They are generally characterised by short or missing internal branches in phylogenetic trees and by conflicting topologies among individual gene trees. This may be the case of the subfamily Trematominae, a group of marine teleosts of coastal Antarctic waters, which is considered to have passed through a period of rapid diversification. Despite much phylogenetic attention, the relationships among Trematominae species remain unclear. In contrast to previous studies that were mostly based on concatenated datasets of mitochondrial and/or single nuclear loci, we applied various single-locus and multilocus phylogenetic approaches to sequences from 11 loci (eight nuclear) and we also used several methods to assess the hypothesis of a radiation event in Trematominae evolution. Diversification rate analyses support the hypothesis of a period of rapid diversification during Trematominae history and only a few nodes in the hypothetical species tree were consistently resolved with various phylogenetic methods. We detected significant discrepancies among trees from individual genes of these species, most probably resulting from incomplete lineage sorting, suggesting that concatenation of loci is not the most appropriate way to investigate Trematominae species interrelationships. These data also provide information about the possible effects of historic climate changes on the diversification rate of this group of fish.  相似文献   

12.
Genome-scale data have greatly facilitated the resolution of recalcitrant nodes that Sanger-based datasets have been unable to resolve. However, phylogenomic studies continue to use traditional methods such as bootstrapping to estimate branch support; and high bootstrap values are still interpreted as providing strong support for the correct topology. Furthermore, relatively little attention has been given to assessing discordances between gene and species trees, and the underlying processes that produce phylogenetic conflict. We generated novel genomic datasets to characterize and determine the causes of discordance in Old World treefrogs (Family: Rhacophoridae)—a group that is fraught with conflicting and poorly supported topologies among major clades. Additionally, a suite of data filtering strategies and analytical methods were applied to assess their impact on phylogenetic inference. We showed that incomplete lineage sorting was detected at all nodes that exhibited high levels of discordance. Those nodes were also associated with extremely short internal branches. We also clearly demonstrate that bootstrap values do not reflect uncertainty or confidence for the correct topology and, hence, should not be used as a measure of branch support in phylogenomic datasets. Overall, we showed that phylogenetic discordances in Old World treefrogs resulted from incomplete lineage sorting and that species tree inference can be improved using a multi-faceted, total-evidence approach, which uses the most amount of data and considers results from different analytical methods and datasets.  相似文献   

13.
Estimating species trees using multiple-allele DNA sequence data   总被引:3,自引:0,他引:3  
Several techniques, such as concatenation and consensus methods, are available for combining data from multiple loci to produce a single statement of phylogenetic relationships. However, when multiple alleles are sampled from individual species, it becomes more challenging to estimate relationships at the level of species, either because concatenation becomes inappropriate due to conflicts among individual gene trees, or because the species from which multiple alleles have been sampled may not form monophyletic groups in the estimated tree. We propose a Bayesian hierarchical model to reconstruct species trees from multiple-allele, multilocus sequence data, building on a recently proposed method for estimating species trees from single allele multilocus data. A two-step Markov Chain Monte Carlo (MCMC) algorithm is adopted to estimate the posterior distribution of the species tree. The model is applied to estimate the posterior distribution of species trees for two multiple-allele datasets--yeast (Saccharomyces) and birds (Manacus-manakins). The estimates of the species trees using our method are consistent with those inferred from other methods and genetic markers, but in contrast to other species tree methods, it provides credible regions for the species tree. The Bayesian approach described here provides a powerful framework for statistical testing and integration of population genetics and phylogenetics.  相似文献   

14.
Gene trees are often assumed to be equivalent to species trees, but processes such as incomplete lineage sorting can generate incongruence among gene topologies and analyzing multilocus data in concatenated matrices can be prone to systematic errors. Accordingly, a variety of new methods have been developed to estimate species trees using multilocus data sets. Here, we apply some of these methods to reconstruct the phylogeny of Buarremon and near relatives, a group in which phylogenetic analyses of mitochondrial DNA sequences produced results that were inconsistent with relationships implied by a taxonomy based on variation in external phenotype. Gene genealogies obtained for seven loci (one mitochondrial, six nuclear) were varied, with some supporting and some rejecting the monophyly of Buarremon. Overall, our species-tree analyses tended to support a monophyletic Buarremon, but due to lack of congruence between methodologies, resolution of the phylogeny of this group remains uncertain. More generally, our study indicates that the number of individuals sampled can have an important effect on phylogenetic reconstruction, that the use of seven markers does not guarantee obtaining a strongly-supported species tree, and that methods for species-tree reconstruction can produce different results using the same data; these are important considerations for researchers using these new phylogenetic approaches in other systems.  相似文献   

15.
With the continued adoption of genome‐scale data in evolutionary biology comes the challenge of adequately harnessing the information to make accurate phylogenetic inferences. Coalescent‐based methods of species tree inference have become common, and concatenation has been shown in simulation to perform well, particularly when levels of incomplete lineage sorting are low. However, simulation conditions are often overly simplistic, leaving empiricists with uncertainty regarding analytical tools. We use a large ultraconserved element data set (>3,000 loci) from rattlesnakes of the Crotalus triseriatus group to delimit lineages and estimate species trees using concatenation and several coalescent‐based methods. Unpartitioned and partitioned maximum likelihood and Bayesian analysis of the concatenated matrix yield a topology identical to coalescent analysis of a subset of the data in bpp . ASTRAL analysis on a subset of the more variable loci also results in a tree consistent with concatenation and bpp , whereas the SVDquartets phylogeny differs at additional nodes. The size of the concatenated matrix has a strong effect on species tree inference using SVDquartets , warranting additional investigation on optimal data characteristics for this method. Species delimitation analyses suggest up to 16 unique lineages may be present within the C. triseriatus group, with divergences occurring during the Neogene and Quaternary. Network analyses suggest hybridization within the group is relatively rare. Altogether, our results reaffirm the Mexican highlands as a biodiversity hotspot and suggest that coalescent‐based species tree inference on data subsets can provide a strongly supported species tree consistent with concatenation of all loci with a large amount of missing data.  相似文献   

16.
Mouse lemurs (genus Microcebus) are nocturnal primates endemic to the island of Madagascar. Until recently, they were classified as two species, one from eastern and one from western Madagascar. Previously published analyses of morphometric and mitochondrial DNA data show strong support for the recognition of more than eight species, however. Here, we test the eight-species hypothesis with DNA data derived from four independently segregating nuclear loci. We find many areas of congruence between the mitochondrial and nuclear data, but incomplete lineage sorting and low mutation rates limit the phylogenetic resolution of the nuclear data. Even so, the nuclear loci unanimously find evidence for three deeply diverged lineages within the mouse lemur radiation: one that is congruent with the mtDNA "southern clade", another that is congruent with the mtDNA "northern clade", and one monospecific branch comprised of the species Microcebus ravelobensis. The latter result in particular emphasizes the need for careful biological study of this species.  相似文献   

17.
Despite the broad adoption of multispecies coalescent (MSC) methods for nuclear phylogenomics, they have yet to be applied to mitochondrial (mt) genomic data. As the potential sources of phylogenomic bias that MSC methods can address, such as incomplete lineage sorting, horizontal gene transfer and gene tree heterogeneity, have been found in mt genomic data, these approaches may improve the accuracy of phylogenetic inference with these data. In the present study, we examined the behaviour of MSC methods in reconstructing the phylogeny of Lepidoptera (butterflies and moths), a group for which mt genomic data are known to have strong resolving power. Traditional concatenation methods of analysing mt genomes for Lepidoptera infer topologies highly congruent with those generated from independent nuclear datasets. Individual mt gene trees performed poorly in recovering consensus relationships at deep levels (i.e. superfamily monophyly and inter-relationships) and only moderately well for shallow relationships (i.e. within Papilionoidea). In contrast, MSC analyses with ASTRAL performed strongly with almost complete concordance to both concatenated mt genome analyses and independent nuclear analyses at both deep and shallow phylogenetic scales. Outgroup choice had a limited impact on tree accuracy, with even phylogenetically distant outgroups still resulting in topologies highly congruent with results from nuclear datasets, although MSC analyses appeared to be marginally more affected by outgroup choice than concatenation analyses. In general, discordance between concatenation and MSC analyses was found at nodes whose resolution varied between previous nuclear phylogenomic studies. The sensitivity of individual relationships to analysis with MSC vs concatenation can thus be used to test the robustness of phylogenetic hypotheses. For insect phylogenetics, MSC is a reliable inference method for mt genomic data and is thus a useful complement to the already widely used concatenation approaches.  相似文献   

18.
Gene trees will often differ from the true species history, the species tree, as a result of processes such as incomplete lineage sorting. New methods such as Bayesian Estimation of the Species Tree (BEST) use the multispecies coalescent to model lineage sorting, and directly infer the species tree from multilocus DNA sequence data. The Sulidae (Aves: Pelecaniformes) is a family of ten booby and gannet species with a global distribution. We sequenced five nuclear intron loci and one mitochondrial locus to estimate a species tree for the Sulidae using both BEST and by concatenating nuclear loci. We also used fossil calibrated strict and relaxed molecular clocks in BEAST to estimate divergence times for major nodes in the sulid phylogeny. Individual gene trees showed little phylogenetic conflict but varied in resolution. With the exception of the mitochondrial gene tree, no gene tree was completely resolved. On the other hand, both the BEST and concatenated species trees were highly resolved, strongly supported, and topologically consistent with each other. The three sulid genera (Morus, Sula, Papasula) were monophyletic and the relationships within genera were mostly consistent with both a previously estimated mtDNA gene tree and the mtDNA gene tree estimated here. However, our species trees conflicted with the mtDNA gene trees in the relationships among the three genera. Most notably, we find that the endemic and endangered Abbott's booby (Papasula abbotti) is likely basal to all other members of the Sulidae and diverged from them approximately 22 million years ago.  相似文献   

19.
Analyses of the increasingly available genomic data continue to reveal the extent of hybridization and its role in the evolutionary diversification of various groups of species. We show, through extensive coalescent-based simulations of multilocus data sets on phylogenetic networks, how divergence times before and after hybridization events can result in incomplete lineage sorting with gene tree incongruence signatures identical to those exhibited by hybridization. Evolutionary analysis of such data under the assumption of a species tree model can miss all hybridization events, whereas analysis under the assumption of a species network model would grossly overestimate hybridization events. These issues necessitate a paradigm shift in evolutionary analysis under these scenarios, from a model that assumes a priori a single source of gene tree incongruence to one that integrates multiple sources in a unifying framework. We propose a framework of coalescence within the branches of a phylogenetic network and show how this framework can be used to detect hybridization despite incomplete lineage sorting. We apply the model to simulated data and show that the signature of hybridization can be revealed as long as the interval between the divergence times of the species involved in hybridization is not too small. We reanalyze a data set of 106 loci from 7 in-group Saccharomyces species for which a species tree with no hybridization has been reported in the literature. Our analysis supports the hypothesis that hybridization occurred during the evolution of this group, explaining a large amount of the incongruence in the data. Our findings show that an integrative approach to gene tree incongruence and its reconciliation is needed. Our framework will help in systematically analyzing genomic data for the occurrence of hybridization and elucidating its evolutionary role.  相似文献   

20.
Discordance between mitochondrial and nuclear phylogenies is being increasingly recognized in animals and may confound DNA‐based taxonomy. This is especially relevant for taxa whose microscopic size often challenges any effort to distinguish between cryptic species without the assistance of molecular data. Regarding mitonuclear discordance, two strikingly contrasting scenarios have been recently demonstrated in the monogonont rotifers of the genus Brachionus. While strict mitonuclear concordance was observed in the marine B. plicatilis species complex, widespread hybridization‐driven mitonuclear discordance was revealed in the freshwater B. calyciflorus species complex. Here, we investigated the frequency of occurrence and the potential drivers of mitonuclear discordance in three additional freshwater monogonont rotifer taxa, and assessed its potential impact on the reliability of DNA taxonomy results based on commonly used single markers. We studied the cryptic species complexes of Keratella cochlearis, Polyarthra dolichoptera and Synchaeta pectinata. Phylogenetic reconstructions were based on the mitochondrial barcoding marker cytochrome c oxidase subunit I gene and the nuclear internal transcribed spacer 1 locus, which currently represent the two most typical genetic markers used in rotifer DNA taxonomy. Species were delimited according to each marker separately using a combination of tree‐based coalescent, distance‐based and allele‐sharing‐based approaches. Mitonuclear discordance was observed in all species complexes with incomplete lineage sorting and unresolved phylogenetic reconstructions recognized as the likely drivers. Evidence from additional sources, such as morphology and ecology, is thus advisable for deciding between often contrasting mitochondrial and nuclear species scenarios in these organisms.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号