首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace , combines tree metrics and multivariate analysis to provide low‐dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group‐specific consensus phylogenies. treespace also provides a user‐friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results.  相似文献   

2.
Phylogenetic comparative methods have long considered phylogenetic signal as a source of statistical bias in the correlative analysis of biological traits. However, the main life-history strategies existing in a set of taxa are often combinations of life history traits that are inherently phylogenetically structured. In this paper, we present a method for identifying evolutionary strategies from large sets of biological traits, using phylogeny as a source of meaningful historical and ecological information. Our methodology extends a multivariate method developed for the analysis of spatial patterns, and relies on finding combinations of traits that are phylogenetically autocorrelated. Using extensive simulations, we show that our method efficiently uncovers phylogenetic structures with respect to various tree topologies, and remains powerful in cases where a large majority of traits are not phylogenetically structured. Our methodology is illustrated using empirical data, and implemented in the adephylo package for the free software R.  相似文献   

3.
APE: Analyses of Phylogenetics and Evolution in R language   总被引:9,自引:0,他引:9  
Analysis of Phylogenetics and Evolution (APE) is a package written in the R language for use in molecular evolution and phylogenetics. APE provides both utility functions for reading and writing data and manipulating phylogenetic trees, as well as several advanced methods for phylogenetic and evolutionary analysis (e.g. comparative and population genetic methods). APE takes advantage of the many R functions for statistics and graphics, and also provides a flexible framework for developing and implementing further statistical methods for the analysis of evolutionary processes. AVAILABILITY: The program is free and available from the official R package archive at http://cran.r-project.org/src/contrib/PACKAGES.html#ape. APE is licensed under the GNU General Public License.  相似文献   

4.
The assumption that maximum parsimony in a phylogenetic tree is achieved by the reduction of the number of overall homoplasies to a minimum is questioned and an alternative approach of using the most parsimonius distribution of taxa at each dichotomy suggested. This approach can be put into practice using Hennigian logic to determine relationships, DNA sequences data as the character set and standard statistical techniques to determine the significance to be placed on the resultipng phylogenetic tree. The logic and robustness of such an approach is described. A software package, SYNAPO, is availabe to assist such analyses.  相似文献   

5.
Ollier S  Couteron P  Chessel D 《Biometrics》2006,62(2):471-477
In recent years, there has been an increased interest in studying the variability of a quantitative life-history trait across a set of species sharing a common phylogeny. However, such studies have suffered from an insufficient development of statistical methods aimed at decomposing the trait variance with respect to the topological structure of the tree. Here we propose a new and generic approach that expresses the topological properties of the phylogenetic tree via an orthonormal basis, which is further used to decompose the trait variance. Such a decomposition provides a structure function, referred to as an "orthogram," which is relevant to characterize in both graphical and statistical aspects the dependence of trait values on the topology of the tree ("phylogenetic dependence"). We also propose four complementary test statistics to be computed from orthogram values that help to diagnose both the intensity and the nature of phylogenetic dependence. The relevance of the method is illustrated by the analysis of three phylogenetic data sets, drawn from the literature and typifying contrasted levels and aspects of phylogenetic dependence. Freely available routines which have been programmed in the R framework are also proposed.  相似文献   

6.
T-REX (tree and reticulogram reconstruction) is an application to reconstruct phylogenetic trees and reticulation networks from distance matrices. The application includes a number of tree fitting methods like NJ, UNJ or ADDTREE which have been very popular in phylogenetic analysis. At the same time, the software comprises several new methods of phylogenetic analysis such as: tree reconstruction using weights, tree inference from incomplete distance matrices or modeling a reticulation network for a collection of objects or species. T-REX also allows the user to visualize obtained tree or network structures using Hierarchical, Radial or Axial types of tree drawing and manipulate them interactively. AVAILABILITY: T-REX is a freeware package available online at: http://www.fas.umontreal.ca/biol/casgrain/en/labo/t-rex  相似文献   

7.
The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation.  相似文献   

8.
We introduce in this paper the dendroTools R package for studying the statistical relationships between tree-ring parameters and daily environmental data. The core function of the package is daily_response(), which works by sliding a moving window through daily environmental data and calculating statistical metrics with one or more tree ring proxies. Possible metrics are correlation coefficient, coefficient of determination and adjusted coefficient of determination. In addition to linear regression, it is possible to use a nonlinear artificial neural network with the Bayesian regularization training algorithm (brnn). dendroTools provides the opportunity to use daily climate data and robust nonlinear functions for the analysis of climate-growth relationships. Models should thus be better adapted to the real (continuous) growth of trees and should gain in predictive capabilities. The dendroTools R package is freely available in the CRAN repository. The functionality of the package is demonstrated on two examples, one using a mean vessel area (MVA) chronology and one a traditional tree-ring width (TRW).  相似文献   

9.
10.
Phylogenetic signal is the tendency for closely related species to display similar trait values as a consequence of their phylogenetic proximity. Ecologists and evolutionary biologists are becoming increasingly interested in studying the phylogenetic signal and the processes which drive patterns of trait values in the phylogeny. Here, we present a new R package, phylosignal which provides a collection of tools to explore the phylogenetic signal for continuous biological traits. These tools are mainly based on the concept of autocorrelation and have been first developed in the field of spatial statistics. To illustrate the use of the package, we analyze the phylogenetic signal in pollution sensitivity for 17 species of diatoms.  相似文献   

11.
12.
《Dendrochronologia》2014,32(2):107-112
Studies using tree-rings to reconstruct forest disturbance dynamics are common and their number has been increasing in the recent years. Despite the evident need for a common set of tools for verification, replication and comparison across studies, only a few DOS programmes for disturbance detection exist and they are for limited purposes only. Currently, the ideal statistical environment for the task is R, which is becoming the primary tool for various types of tree-ring analyses. This has led to the development of TRADER (Tree Ring Analysis of Disturbance Events in R), an open-source software package for R that provides an analysis of tree growth history for disturbance reconstructions. We have implemented four methods, which are commonly used for the detection of disturbance events: radial-growth averaging criteria developed by Nowacki and Abrams, 1997, the boundary-line method (Black and Abrams, 2003), the absolute-increase method (Fraver and White, 2005), and the combination of radial-growth averaging and boundary-line techniques (Splechtna et al., 2005). TRADER, however, enables the analysis of disturbance history by a total of 24 published methods. Furthermore, functions for the detection of tree recruitment and growth trends were also included. The main features of the presented package are described and their application is shown on a real tree-ring datasets. The package requires little knowledge of the R environment giving straightforward analyses with suitable parameters, but at the same time it is easily modifiable by the more experienced user. The package improves research efficiency and facilitates replication of previous studies. One of its major advantages is that it offers the possibility for comparison between different methods of disturbance history reconstruction.  相似文献   

13.
Pvclust: an R package for assessing the uncertainty in hierarchical clustering   总被引:11,自引:0,他引:11  
SUMMARY: Pvclust is an add-on package for a statistical software R to assess the uncertainty in hierarchical cluster analysis. Pvclust can be used easily for general statistical problems, such as DNA microarray analysis, to perform the bootstrap analysis of clustering, which has been popular in phylogenetic analysis. Pvclust calculates probability values (p-values) for each cluster using bootstrap resampling techniques. Two types of p-values are available: approximately unbiased (AU) p-value and bootstrap probability (BP) value. Multiscale bootstrap resampling is used for the calculation of AU p-value, which has superiority in bias over BP value calculated by the ordinary bootstrap resampling. In addition the computation time can be enormously decreased with parallel computing option.  相似文献   

14.
Sample size calculations are an essential component of the design and evaluation of scientific studies. However, there is a lack of clear guidance for determining the sample size needed for phylogenetic studies, which are becoming an essential part of studying pathogen transmission. We introduce a statistical framework for determining the number of true infector-infectee transmission pairs identified by a phylogenetic study, given the size and population coverage of that study. We then show how characteristics of the criteria used to determine linkage and aspects of the study design can influence our ability to correctly identify transmission links, in sometimes counterintuitive ways. We test the overall approach using outbreak simulations and provide guidance for calculating the sensitivity and specificity of the linkage criteria, the key inputs to our approach. The framework is freely available as the R package phylosamp, and is broadly applicable to designing and evaluating a wide array of pathogen phylogenetic studies.  相似文献   

15.
Dendrometers are vital tools for studying the response of trees to intra-annual environmental changes in different temporal resolutions, ranging from hourly, daily to weekly time resolution. Dendrometers are increasingly used in forest management and tree physiological studies. Besides the data analysis, data processing is also challenging, time-consuming and potentially error-prone due to the immense number of measurements generated by self-registering electronic dendrometers. We present the package ‘dendRoAnalyst’ based on R statistical software to process and analyse dendrometer data using various approaches. This package offers algorithms for handling and pre-cleaning of dendrometer data before the application of subsequent data analytical steps. This includes identifying and erasing artefacts in dendrometer datasets not related to actual stem circumference change, identifying data gaps within records, and the possibility of change(s) in temporal resolution. Furthermore, the package can calculate different daily statistics of dendrometer data, including the daily amplitude of tree growth. The package dendRoAnalyst is therefore intended to facilitate researchers with a collection of functions for handling and analysing dendrometer data.  相似文献   

16.
Genealogies estimated from haplotypic genetic data play a prominent role in various biological disciplines in general and in phylogenetics, population genetics and phylogeography in particular. Several software packages have specifically been developed for the purpose of reconstructing genealogies from closely related, and hence, highly similar haplotype sequence data. Here, we use simulated data sets to test the performance of traditional phylogenetic algorithms, neighbour-joining, maximum parsimony and maximum likelihood in estimating genealogies from nonrecombining haplotypic genetic data. We demonstrate that these methods are suitable for constructing genealogies from sets of closely related DNA sequences with or without migration. As genealogies based on phylogenetic reconstructions are fully resolved, but not necessarily bifurcating, and without reticulations, these approaches outperform widespread 'network' constructing methods. In our simulations of coalescent scenarios involving panmictic, symmetric and asymmetric migration, we found that phylogenetic reconstruction methods performed well, while the statistical parsimony approach as implemented in TCS performed poorly. Overall, parsimony as implemented in the PHYLIP package performed slightly better than other methods. We further point out that we are not making the case that widespread 'network' constructing methods are bad, but that traditional phylogenetic tree finding methods are applicable to haplotypic data and exhibit reasonable performance with respect to accuracy and robustness. We also discuss some of the problems of converting a tree to a haplotype genealogy, in particular that it is nonunique.  相似文献   

17.
phangorn: phylogenetic analysis in R   总被引:4,自引:0,他引:4  
SUMMARY: phangorn is a package for phylogenetic reconstruction and analysis in the R language. Previously it was only possible to estimate phylogenetic trees with distance methods in R. phangorn, now offers the possibility of reconstructing phylogenies with distance based methods, maximum parsimony or maximum likelihood (ML) and performing Hadamard conjugation. Extending the general ML framework, this package provides the possibility of estimating mixture and partition models. Furthermore, phangorn offers several functions for comparing trees, phylogenetic models or splits, simulating character data and performing congruence analyses. AVAILABILITY: phangorn can be obtained through the CRAN homepage http://cran.r-project.org/web/packages/phangorn/index.html. phangorn is licensed under GPL 2.  相似文献   

18.
The PHylogenetic Analysis with Space/Time models (PHAST) software package consists of a collection of command-line programs and supporting libraries for comparative genomics. PHAST is best known as the engine behind the Conservation tracks in the University of California, Santa Cruz (UCSC) Genome Browser. However, it also includes several other tools for phylogenetic modeling and functional element identification, as well as utilities for manipulating alignments, trees and genomic annotations. PHAST has been in development since 2002 and has now been downloaded more than 1000 times, but so far it has been released only as provisional ('beta') software. Here, we describe the first official release (v1.0) of PHAST, with improved stability, portability and documentation and several new features. We outline the components of the package and detail recent improvements. In addition, we introduce a new interface to the PHAST libraries from the R statistical computing environment, called RPHAST, and illustrate its use in a series of vignettes. We demonstrate that RPHAST can be particularly useful in applications involving both large-scale phylogenomics and complex statistical analyses. The R interface also makes the PHAST libraries acccessible to non-C programmers, and is useful for rapid prototyping. PHAST v1.0 and RPHAST v1.0 are available for download at http://compgen.bscb.cornell.edu/phast, under the terms of an unrestrictive BSD-style license. RPHAST can also be obtained from the Comprehensive R Archive Network (CRAN; http://cran.r-project.org).  相似文献   

19.
Ecologists and biogeographers usually rely on a single phylogenetic tree to study evolutionary processes that affect macroecological patterns. This approach ignores the fact that each phylogenetic tree is a hypothesis about the evolutionary history of a clade, and cannot be directly observed in nature. Also, trees often leave out many extant species, or include missing species as polytomies because of a lack of information on the relationship among taxa. Still, researchers usually do not quantify the effects of phylogenetic uncertainty in ecological analyses. We propose here a novel analytical strategy to maximize the use of incomplete phylogenetic information, while simultaneously accounting for several sources of phylogenetic uncertainty that may distort statistical inferences about evolutionary processes. We illustrate the approach using a clade‐wide analysis of the hummingbirds, evaluating how different sources of uncertainty affect several phylogenetic comparative analyses of trait evolution and biogeographic patterns. Although no statistical approximation can fully substitute for a complete and robust phylogeny, the method we describe and illustrate enables researchers to broaden the number of clades for which studies informed by evolutionary relationships are possible, while allowing the estimation and control of statistical error that arises from phylogenetic uncertainty. Software tools to carry out the necessary computations are offered.  相似文献   

20.
We present the ggtreeExtra package for visualizing heterogeneous data with a phylogenetic tree in a circular or rectangular layout (https://www.bioconductor.org/packages/ggtreeExtra). The package supports more data types and visualization methods than other tools. It supports using the grammar of graphics syntax to present data on a tree with richly annotated layers and allows evolutionary statistics inferred by commonly used software to be integrated and visualized with external data. GgtreeExtra is a universal tool for tree data visualization. It extends the applications of the phylogenetic tree in different disciplines by making more domain-specific data to be available to visualize and interpret in the evolutionary context.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号