首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 500 毫秒
1.
The use of diverse data sets in phylogenetic studies aiming for understanding evolutionary histories of species can yield conflicting inference. Phylogenetic conflicts observed in animal and plant systems have often been explained by hybridization, incomplete lineage sorting (ILS), or horizontal gene transfer. Here, we used target enrichment data, species tree, and species network approaches to infer the backbone phylogeny of the family Caprifoliaceae, while distinguishing among sources of incongruence. We used 713 nuclear loci and 46 complete plastome sequence data from 43 samples representing 38 species from all major clades to reconstruct the phylogeny of the family using concatenation and coalescence approaches. We found significant nuclear gene tree conflict as well as cytonuclear discordance. Additionally, coalescent simulations and phylogenetic species network analyses suggested putative ancient hybridization among subfamilies of Caprifoliaceae, which seems to be the main source of phylogenetic discordance. Ancestral state reconstruction of six morphological characters revealed some homoplasy for each character examined. By dating the branching events, we inferred the origin of Caprifoliaceae at approximately 66.65 Ma in the late Cretaceous. By integrating evidence from molecular phylogeny, divergence times, and morphology, we here recognize Zabelioideae as a new subfamily in Caprifoliaceae. This work shows the necessity of using a combination of multiple approaches to identify the sources of gene tree discordance. Our study also highlights the importance of using data from both nuclear and plastid genomes to reconstruct deep and shallow phylogenies of plants.  相似文献   

2.
With the continued adoption of genome‐scale data in evolutionary biology comes the challenge of adequately harnessing the information to make accurate phylogenetic inferences. Coalescent‐based methods of species tree inference have become common, and concatenation has been shown in simulation to perform well, particularly when levels of incomplete lineage sorting are low. However, simulation conditions are often overly simplistic, leaving empiricists with uncertainty regarding analytical tools. We use a large ultraconserved element data set (>3,000 loci) from rattlesnakes of the Crotalus triseriatus group to delimit lineages and estimate species trees using concatenation and several coalescent‐based methods. Unpartitioned and partitioned maximum likelihood and Bayesian analysis of the concatenated matrix yield a topology identical to coalescent analysis of a subset of the data in bpp . ASTRAL analysis on a subset of the more variable loci also results in a tree consistent with concatenation and bpp , whereas the SVDquartets phylogeny differs at additional nodes. The size of the concatenated matrix has a strong effect on species tree inference using SVDquartets , warranting additional investigation on optimal data characteristics for this method. Species delimitation analyses suggest up to 16 unique lineages may be present within the C. triseriatus group, with divergences occurring during the Neogene and Quaternary. Network analyses suggest hybridization within the group is relatively rare. Altogether, our results reaffirm the Mexican highlands as a biodiversity hotspot and suggest that coalescent‐based species tree inference on data subsets can provide a strongly supported species tree consistent with concatenation of all loci with a large amount of missing data.  相似文献   

3.
Despite the broad adoption of multispecies coalescent (MSC) methods for nuclear phylogenomics, they have yet to be applied to mitochondrial (mt) genomic data. As the potential sources of phylogenomic bias that MSC methods can address, such as incomplete lineage sorting, horizontal gene transfer and gene tree heterogeneity, have been found in mt genomic data, these approaches may improve the accuracy of phylogenetic inference with these data. In the present study, we examined the behaviour of MSC methods in reconstructing the phylogeny of Lepidoptera (butterflies and moths), a group for which mt genomic data are known to have strong resolving power. Traditional concatenation methods of analysing mt genomes for Lepidoptera infer topologies highly congruent with those generated from independent nuclear datasets. Individual mt gene trees performed poorly in recovering consensus relationships at deep levels (i.e. superfamily monophyly and inter-relationships) and only moderately well for shallow relationships (i.e. within Papilionoidea). In contrast, MSC analyses with ASTRAL performed strongly with almost complete concordance to both concatenated mt genome analyses and independent nuclear analyses at both deep and shallow phylogenetic scales. Outgroup choice had a limited impact on tree accuracy, with even phylogenetically distant outgroups still resulting in topologies highly congruent with results from nuclear datasets, although MSC analyses appeared to be marginally more affected by outgroup choice than concatenation analyses. In general, discordance between concatenation and MSC analyses was found at nodes whose resolution varied between previous nuclear phylogenomic studies. The sensitivity of individual relationships to analysis with MSC vs concatenation can thus be used to test the robustness of phylogenetic hypotheses. For insect phylogenetics, MSC is a reliable inference method for mt genomic data and is thus a useful complement to the already widely used concatenation approaches.  相似文献   

4.
We explored the phylogenetic utility and limits of the individual and concatenated mitochondrial genes for reconstructing the higher-level relationships of teleosts, using the complete (or nearly complete) mitochondrial DNA sequences of eight teleosts (including three newly determined sequences), whose relative phylogenetic positions were noncontroversial. Maximum-parsimony analyses of the nucleotide and amino acid sequences of 13 protein-coding genes from the above eight teleosts, plus two outgroups (bichir and shark), indicated that all of the individual protein-coding genes, with the exception of ND5, failed to recover the expected phylogeny, although unambiguously aligned sequences from 22 concatenated transfer RNA (tRNA) genes (stem regions only) recovered the expected phylogeny successfully with moderate statistical support. The phylogenetic performance of the 13 protein-coding genes in recovering the expected phylogeny was roughly classified into five groups, viz. very good (ND5, ND4, COIII, COI), good (COII, cyt b), medium (ND3, ND2), poor (ND1, ATPase 6), and very poor (ND4L, ND6, ATPase 8). Although the universality of this observation was unclear, analysis of successive concatenation of the 13 protein-coding genes in the same ranking order revealed that the combined data sets comprising nucleotide sequences from the several top-ranked protein-coding genes (no 3rd codon positions) plus the 22 concatenated tRNA genes (stem regions only) best recovered the expected phylogeny, with all internal branches being supported by bootstrap values >90%. We conclude that judicious choice of mitochondrial genes and appropriate data weighting, in conjunction with purposeful taxonomic sampling, are prerequisites for resolving higher-level relationships in teleosts under the maximum-parsimony optimality criterion.  相似文献   

5.
Species complexes undergoing rapid radiation present a challenge in molecular systematics because of the possibility that ancestral polymorphism is retained in component gene trees. Coalescent theory has demonstrated that gene trees often fail to match lineage trees when taxon divergence times are less than the ancestral effective population sizes. Suggestions to increase the number of loci and the number of individuals per taxon have been proposed; however, phylogenetic methods to adequately analyze these data in a coalescent framework are scarce. We compare two approaches to estimating lineage (species) trees using multiple individuals and multiple loci: the commonly used partitioned Bayesian analysis of concatenated sequences and a modification of a newly developed hierarchical Bayesian method (BEST) that simultaneously estimates gene trees and species trees from multilocus data. We test these approaches on a phylogeny of rapidly radiating species wherein divergence times are likely to be smaller than effective population sizes, and incomplete lineage sorting is known, in the rodent genus, Thomomys. We use seven independent noncoding nuclear sequence loci (total approximately 4300 bp) and between 1 and 12 individuals per taxon to construct a phylogenetic hypothesis for eight Thomomys species. The majority-rule consensus tree from the partitioned concatenated analysis included 14 strongly supported bipartitions, corroborating monophyletic species status of five of the eight named species. The BEST tree strongly supported only the split between the two subgenera and showed very low support for any other clade. Comparison of both lineage trees to individual gene trees revealed that the concatenation method appears to ignore conflicting signals among gene trees, whereas the BEST tree considers conflicting signals and downweights support for those nodes. Bayes factor analysis of posterior tree distributions from both analyses strongly favor the model underlying the BEST analysis. This comparison underscores the risks of overreliance on results from concatenation, and ignoring the properties of coalescence, especially in cases of recent, rapid radiations.  相似文献   

6.
Testing congruence in phylogenomic analysis   总被引:1,自引:0,他引:1  
Phylogenomic analyses of large sets of genes or proteins have the potential to revolutionize our understanding of the tree of life. However, problems arise because estimated phylogenies from individual loci often differ because of different histories, systematic bias, or stochastic error. We have developed Concaterpillar, a hierarchical clustering method based on likelihood-ratio testing that identifies congruent loci for phylogenomic analysis. Concaterpillar also includes a test for shared relative evolutionary rates between genes indicating whether they should be analyzed separately or by concatenation. In simulation studies, the performance of this method is excellent when a multiple comparison correction is applied. We analyzed a phylogenomic data set of 60 translational protein sequences from the major supergroups of eukaryotes and identified three congruent subsets of proteins. Analysis of the largest set indicates improved congruence relative to the full data set and produced a phylogeny with stronger support for five eukaryote supergroups including the Opisthokonts, the Plantae, the stramenopiles + Apicomplexa (chromalveolates), the Amoebozoa, and the Excavata. In contrast, the phylogeny of the second largest set indicates a close relationship between stramenopiles and red algae, to the exclusion of alveolates, suggesting gene transfer from the red algal secondary symbiont to the ancestral stramenopile host nucleus during the origin of their chloroplast. Investigating phylogenomic data sets for conflicting signals has the potential to both improve phylogenetic accuracy and inform our understanding of genome evolution.  相似文献   

7.
Interest in methods that estimate speciation and extinction rates from molecular phylogenies has increased over the last decade. The application of such methods requires reliable estimates of tree topology and node ages, which are frequently obtained using standard phylogenetic inference combining concatenated loci and molecular dating. However, this practice disregards population‐level processes that generate gene tree/species tree discordance. We evaluated the impact of employing concatenation and coalescent‐based phylogeny inference in recovering the correct macroevolutionary regime using simulated data based on the well‐established diversification rate shift of delphinids in Cetacea. We found that under scenarios of strong incomplete lineage sorting, macroevolutionary analysis of phylogenies inferred by concatenating loci failed to recover the delphinid diversification shift, while the coalescent‐based tree consistently retrieved the correct rate regime. We suggest that ignoring microevolutionary processes reduces the power of methods that estimate macroevolutionary regimes from molecular data.  相似文献   

8.
Extant gars represent the remaining members of a formerly diverse assemblage of ancient ray-finned fishes and have been the subject of multiple phylogenetic analyses using morphological data. Here, we present the first hypothesis of phylogenetic relationships among living gar species based on molecular data, through the examination of gene tree heterogeneity and coalescent species tree analyses of a portion of one mitochondrial (COI) and seven nuclear (ENC1, myh6, plagl2, S7 ribosomal protein intron 1, sreb2, tbr1, and zic1) genes. Individual gene trees displayed varying degrees of resolution with regards to species-level relationships, and the gene trees inferred from COI and the S7 intron were the only two that were completely resolved. Coalescent species tree analyses of nuclear genes resulted in a well-resolved and strongly supported phylogenetic tree of living gar species, for which Bayesian posterior node support was further improved by the inclusion of the mitochondrial gene. Species-level relationships among gars inferred from our molecular data set were highly congruent with previously published morphological phylogenies, with the exception of the placement of two species, Lepisosteus osseus and L. platostomus. Re-examination of the character coding used by previous authors provided partial resolution of this topological discordance, resulting in broad concordance in the phylogenies inferred from individual genes, the coalescent species tree analysis, and morphology. The completely resolved phylogeny inferred from the molecular data set with strong Bayesian posterior support at all nodes provided insights into the potential for introgressive hybridization and patterns of allopatric speciation in the evolutionary history of living gars, as well as a solid foundation for future examinations of functional diversification and evolutionary stasis in a "living fossil" lineage.  相似文献   

9.
The causes and consequences of rapid radiations are major unresolved issues in evolutionary biology. This is in part because phylogeny estimation is confounded by processes such as stochastic lineage sorting and hybridization. Because these processes are expected to be heterogeneous across the genome, comparison among marker classes may provide a means of disentangling these elements. Here we use introns from nuclear-encoded reproductive protein genes expected to be resistant to introgression to estimate the phylogeny of the western chipmunks (Tamias: subgenus: Neotamias), a rapid radiation that has experienced introgressive hybridization of mitochondrial DNA (mtDNA). We analyze the nuclear loci using coalescent-based species-tree estimation methods and concatenation to estimate a species tree and we use parametric bootstraps and coalescent simulations to differentiate between phylogenetic error, coalescent stochasticity and introgressive hybridization. Results indicate that the mtDNA gene tree reflects several introgression events that have occurred between taxa of varying levels of divergence and at different time points in the tree. T. panamintinus and T. speciosus appear to be fixed for ancient mitochondrial introgressions from T. minimus. A southern Rocky Mountains clade appears well sorted (i.e., species are largely monophyletic) at multiple nuclear loci, while five of six taxa are nonmonophyletic based on cytochrome b. Our simulations reject phylogenetic error and coalescent stochasticity as causes. The results represent an advance in our understanding of the processes at work during the radiation of Tamias and suggest that sampling reproductive-protein genes may be a viable strategy for phylogeny estimation of rapid radiations in which reproductive isolation is incomplete. However, a genome-scale survey that can statistically compare heterogeneity of genealogical process at many more loci will be necessary to test this conclusion.  相似文献   

10.
The New World swallow genus Tachycineta comprises nine species that collectively have a wide geographic distribution and remarkable variation both within- and among-species in ecologically important traits. Existing phylogenetic hypotheses for Tachycineta are based on mitochondrial DNA sequences, thus they provide estimates of a single gene tree. In this study we sequenced multiple individuals from each species at 16 nuclear intron loci. We used gene concatenated approaches (Bayesian and maximum likelihood) as well as coalescent-based species tree inference to reconstruct phylogenetic relationships of the genus. We examined the concordance and conflict between the nuclear and mitochondrial trees and between concatenated and coalescent-based inferences. Our results provide an alternative phylogenetic hypothesis to the existing mitochondrial DNA estimate of phylogeny. This new hypothesis provides a more accurate framework in which to explore trait evolution and examine the evolution of the mitochondrial genome in this group.  相似文献   

11.
Two different methods of using paralogous genes for phylogenetic inference have been proposed: reconciled trees (or gene tree parsimony) and uninode coding. Gene tree parsimony suffers from 10 serious problems, including differential weighting of nucleotide and gap characters, undersampling which can be misinterpreted as synapomorphy, all of the characters not being allowed to interact, and conflict between gene trees being given equal weight, regardless of branch support. These problems are largely avoided by using uninode coding. The uninode coding method is elaborated to address multiple gene duplications within a single gene tree family and handle problems caused by lack of gene tree resolution. An example of vertebrate phylogeny inferred from nine genes is reanalyzed using uninode coding. We suggest that uninode coding be used instead of gene tree parsimony for phylogenetic inference from paralogous genes.  相似文献   

12.
The construction and interpretation of gene trees is fundamental in molecular systematics. If the gene is defined in a historical (coalescent) sense, there can be multiple gene trees within the single contiguous set of nucleotides, and attempts to construct a single tree for such a sequence must deal with homoplasy created by conflict among divergent histories. On a larger scale, incongruence is expected among gene tree topologies at different loci of individuals within sexually reproducing species, and it has been suggested that this discordance can be used to delimit species. A practical concern for such topological methods is that polymorphisms may be maintained through numerous cladogenic events; this polymorphism problem is less of a concern for nontopological approaches to species delimitation using molecular data. Although a central theoretical concern in molecular systematics is discordance between a given gene tree and the true "species tree," the primary empirical problem faced in reconstructing taxic phylogeny is incongruence among the trees inferred from different sequences. Linkage relationships limit character independence and thus have important implications for handling multiple data sets in phylogenetic analysis, particularly at the species level, where incongruence among different historically associated loci is expected. Gene trees can also be reconstructed for loci that influence phenotypic characters, but there is at best a tenuous relationship between phenotypic homoplasy and homoplasy in such gene trees. Nevertheless, expression patterns and orthology relationships of genes involved in the expression of phenotypes can in theory provide criteria for homology assessment of morphological characters.  相似文献   

13.
Although nuclear protein-coding genes have proven broadly useful for phylogenetic inference, relatively few such genes are regularly employed in studies of Coleoptera, the most diverse insect order. We increase the number of loci available for beetle systematics by developing protocols for three genes previously unused in beetles (alpha-spectrin, RNA polymerase II and topoisomerase I) and by refining protocols for five genes already in use (arginine kinase, CAD, enolase, PEPCK and wingless). We evaluate the phylogenetic performance of each gene in a Bayesian framework against a presumably known test phylogeny. The test phylogeny covers 31 beetle specimens and two outgroup taxa of varying age, including three of the four extant beetle suborders and a denser sampling in Adephaga and in the carabid genus Bembidion. All eight genes perform well for Cenozoic divergences and accurately separate closely related species within Bembidion, but individual genes differ markedly in accuracy over the older Mesozoic and Permian divergences. The concatenated data reconstruct the test phylogeny with high support in both Bayesian and parsimony analyses, indicating that combining data from multiple nuclear loci will be a fruitful approach for assembling the beetle tree of life.  相似文献   

14.
Xanthophyceae are a group of heterokontophyte algae. Few molecular studies have investigated the evolutionary history and phylogenetic relationships of this class. We sequenced the nuclear-encoded SSU rDNA and chloroplast-encoded rbcL genes of several xanthophycean species from different orders, families, and genera. Neither SSU rDNA nor rbcL genes show intraspecific sequence variation and are good diagnostic markers for characterization of problematic species. New sequences, combined with those previously available, were used to create different multiple alignments. Analyses included sequences from 26 species of Xanthophyceae plus three Phaeothamniophyceae and two Phaeophyceae taxa used as outgroups. Phylogenetic analyses were performed according to Bayesian inference, maximum likelihood, and maximum parsimony methods. We explored effects produced on the phylogenetic outcomes by both taxon sampling as well as selected genes. Congruent results were obtained from analyses performed on single gene multiple alignments as well as on a data set including both SSU rDNA and rbcL sequences. Trees obtained in this study show that several currently recognized xanthophycean taxa do not form monophyletic groups. The order Mischococcales is paraphyletic, while Tribonematales and Botrydiales are polyphyletic even if evidence for the second order is not conclusive. Botrydiales and Vaucheriales, both including siphonous taxa, do not form a clade. The families Botrydiopsidaceae, Botryochloridaceae, and Pleurochloridaceae as well as the genera Botrydiopsis and Chlorellidium are polyphyletic. The Centritractaceae and the genus Bumilleriopsis also appear to be polyphyletic but their monophyly cannot be completely rejected with current evidence. Our results support morphological convergence at any taxonomic rank in the evolution of the Xanthophyceae. Finally, our phylogenetic analyses exclude an origin of the Xanthophyceae from a Vaucheria-like ancestor and favor a single early origin of the coccoid cell form.  相似文献   

15.
The ability to generate large molecular datasets for phylogenetic studies benefits biologists, but such data expansion introduces numerous analytical problems. A typical molecular phylogenetic study implicitly assumes that sequences evolve under stationary, reversible and homogeneous conditions, but this assumption is often violated in real datasets. When an analysis of large molecular datasets results in unexpected relationships, it often reflects violation of phylogenetic assumptions, rather than a correct phylogeny. Molecular evolutionary phenomena such as base compositional heterogeneity and among‐site rate variation are known to affect phylogenetic inference, resulting in incorrect phylogenetic relationships. The ability of methods to overcome such bias has not been measured on real and complex datasets. We investigated how base compositional heterogeneity and among‐site rate variation affect phylogenetic inference in the context of a mitochondrial genome phylogeny of the insect order Coleoptera. We show statistically that our dataset is affected by base compositional heterogeneity regardless of how the data are partitioned or recoded. Among‐site rate variation is shown by comparing topologies generated using models of evolution with and without a rate variation parameter in a Bayesian framework. When compared for their effectiveness in dealing with systematic bias, standard phylogenetic methods tend to perform poorly, and parsimony without any data transformation performs worst. Two methods designed specifically to overcome systematic bias, LogDet and a Bayesian method implementing variable composition vectors, can overcome some level of base compositional heterogeneity, but are still affected by among‐site rate variation. A large degree of variation in both noise and phylogenetic signal among all three codon positions is observed. We caution and argue that more data exploration is imperative, especially when many genes are included in an analysis.  相似文献   

16.
The mitochondrial genome is one of the most frequently used loci in phylogenetic and phylogeographic analyses, and it is becoming increasingly possible to sequence and analyze this genome in its entirety from diverse taxa. However, sequencing the entire genome is not always desirable or feasible. Which genes should be selected to best infer the evolutionary history of the mitochondria within a group of organisms, and what properties of a gene determine its phylogenetic performance? The current study addresses these questions in a Bayesian phylogenetic framework with reference to a phylogeny of plethodontid and related salamanders derived from 27 complete mitochondrial genomes; this topology is corroborated by nuclear DNA and morphological data. Evolutionary rates for each mitochondrial gene and divergence dates for all nodes in the plethodontid mitochondrial genome phylogeny were estimated in both Bayesian and maximum likelihood frameworks using multiple fossil calibrations, multiple data partitions, and a clock-independent approach. Bayesian analyses of individual genes were performed, and the resulting trees compared against the reference topology. Ordinal logistic regression analysis of molecular evolution rate, gene length, and the G-shape parameter a demonstrated that slower rate of evolution and longer gene length both increased the probability that a gene would perform well phylogenetically. Estimated rates of molecular evolution vary 84-fold among different mitochondrial genes and different salamander lineages, and mean rates among genes vary 15-fold. Despite having conserved amino acid sequences, cox1, cox2, cox3, and cob have the fastest mean rates of nucleotide substitution, and the greatest variation in rates, whereas rrnS and rrnL have the slowest rates. Reasons underlying this rate variation are discussed, as is the extensive rate variation in cox1 in light of its proposed role in DNA barcoding.  相似文献   

17.
The structural genes for nitrogenase, nifK, nifD, and nifH, are crucial for nitrogen fixation. Previous phylogenetic analysis of the amino acid sequence of nifH suggested that this gene had been horizontally transferred from a proteobacterium to the gram-positive/cyanobacterial clade, although the confounding effects of paralogous comparisons made interpretation of the data difficult. An additional test of nif gene horizontal transfer using nifD was made, but the NifD phylogeny lacked resolution. Here nif gene phylogeny is addressed with a phylogenetic analysis of a third and longer nif gene, nifK. As part of the study, the nifK gene of the key taxon Frankia was sequenced. Parsimony and some distance analyses of the nifK amino acid sequences provide support for vertical descent of nifK, but other distance trees provide support for the lateral transfer of the gene. Bootstrap support was found for both hypotheses in all trees; the nifK data do not definitively favor one or the other hypothesis. A parsimony analysis of NifH provides support for horizontal transfer in accord with previous reports, although bootstrap analysis also shows some support for vertical descent of the orthologous nifH genes. A wider sampling of taxa and more sophisticated methods of phylogenetic inference are needed to understand the evolution of nif genes. The nif genes may also be powerful phylogenetic tools. If nifK evolved by vertical descent, it provides strong evidence that the cyanobacteria and proteobacteria are sister groups to the exclusion of the firmicutes, whereas 16S rRNA sequences are unable to resolve the relationships of these three major eubacterial lineages.   相似文献   

18.
The increasing availability of complete genome sequences and the development of new, faster methods for phylogenetic reconstruction allow the exploration of the set of evolutionary trees for each gene in the genome of any species. This has led to the development of new phylogenomic methods. Here, we have compared different phylogenetic and phylogenomic methods in the analysis of the monophyletic origin of insect endosymbionts from the gamma-Proteobacteria, a hotly debated issue with several recent, conflicting reports. We have obtained the phylogenetic tree for each of the 579 identified protein-coding genes in the genome of the primary endosymbiont of carpenter ants, Blochmannia floridanus, after determining their presumed orthologs in 20 additional Proteobacteria genomes. A reference phylogeny reflecting the monophyletic origin of insect endosymbionts was further confirmed with different approaches, which led us to consider it as the presumed species tree. Remarkably, only 43 individual genes produced exactly the same topology as this presumed species tree. Most discrepancies between this tree and those obtained from individual genes or by concatenation of different genes were due to the grouping of Xanthomonadales with beta-Proteobacteria and not to uncertainties over the monophyly of insect endosymbionts. As previously noted, operational genes were more prone to reject the presumed species tree than those included in information-processing categories, but caution should be exerted when selecting genes for phylogenetic inference on the basis of their functional category assignment. We have obtained strong evidence in support of the monophyletic origin of gamma-Proteobacteria insect endosymbionts by a combination of phylogenetic and phylogenomic methods. In our analysis, the use of concatenated genes has shown to be a valuable tool for analyzing primary phylogenetic signals coded in the genomes. Nevertheless, other phylogenomic methods such as supertree approaches were useful in revealing alternative phylogenetic signals and should be included in comprehensive phylogenomic studies.  相似文献   

19.
The coalescent with recombination describes the distribution of genealogical histories and resulting patterns of genetic variation in samples of DNA sequences from natural populations. However, using the model as the basis for inference is currently severely restricted by the computational challenge of estimating the likelihood. We discuss why the coalescent with recombination is so challenging to work with and explore whether simpler models, under which inference is more tractable, may prove useful for genealogy-based inference. We introduce a simplification of the coalescent process in which coalescence between lineages with no overlapping ancestral material is banned. The resulting process has a simple Markovian structure when generating genealogies sequentially along a sequence, yet has very similar properties to the full model, both in terms of describing patterns of genetic variation and as the basis for statistical inference.  相似文献   

20.
Neotropical rivers are home to the largest assemblage of freshwater fishes, but little is known about the phylogeny of these fishes at the species level using multi-locus molecular markers. Here, we present a phylogeny for all known species of the genus Satanoperca, a widespread group of Neotropical cichlid fishes, based on analysis of six unlinked genetic loci. To test nominal and proposed species limits for this group, we surveyed mtDNA sequence variation among 320 individuals representing all know species. Most nominal species were supported by this approach but we determined that populations in the Xingu, Tapajós, and Araguaia+Paraná Rivers are likely undescribed species, while S. jurupari and S. mapiritensis did not show clear genetic distinction. To infer a phylogeny of these putative species, we conducted maximum likelihood and Bayesian non-clock and relaxed clock analyses of concatenated data from three genes (one mitochondrial, two nuclear). We also used a multi-species coalescent model to estimate a species tree from six unlinked loci (one mitochondrial, five nuclear). The topologies obtained were congruent with other results, but showed only minimal to moderate support for some nodes, suggesting that more loci will be needed to satisfactorily estimate the distribution of coalescent histories within Satanoperca. We determined that this variation results from topological discordance among separate gene trees, likely due to differential sorting of ancestral polymorphisms.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号