首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In macroevolutionary studies, different approaches are commonly used to measure phylogenetic signal-the tendency of related taxa to resemble one another-including the K statistic and the Mantel test. The latter was recently criticized for lacking statistical power. Using new simulations, we show that the power of the Mantel test depends on the metrics used to define trait distances and phylogenetic distances between species. Increasing power is obtained by lowering variance and increasing negative skewness in interspecific distances, as obtained using Euclidean trait distances and the complement of Abouheif proximity as a phylogenetic distance. We show realistic situations involving "measurement error" due to intraspecific variability where the Mantel test is more powerful to detect a phylogenetic signal than a permutation test based on the K statistic. We highlight limitations of the K-statistic (univariate measure) and show that its application should take into account measurement errors using repeated measures per species to avoid estimation bias. Finally, we argue that phylogenetic distograms representing Euclidean trait distance as a function of the square root of patristic distance provide an insightful representation of the phylogenetic signal that can be used to assess both the impact of measurement error and the departure from a Brownian evolution model.  相似文献   

2.
To evaluate rates of evolution, to establish tests of correlation between two traits, or to investigate to what degree the phylogeny of a species assemblage is predictive of a trait value so‐called tests for phylogenetic signal are used. Being based on different approaches, these tests are generally thought to possess quite different statistical performances. In this article, we show that the Blomberg et al. K and K*, the Abouheif index, the Moran's I, and the Mantel correlation are all based on a cross‐product statistic, and are thus all related to each other when they are associated to a permutation test of phylogenetic signal. What changes is only the way phylogenetic and trait similarities (or dissimilarities) among the tips of a phylogeny are computed. The definitions of the phylogenetic and trait‐based (dis)similarities among tips thus determines the performance of the tests. We shortly discuss the biological and statistical consequences (in terms of power and type I error of the tests) of the observed relatedness among the statistics that allow tests for phylogenetic signal. Blomberg et al. K* statistic appears as one on the most efficient approaches to test for phylogenetic signal. When branch lengths are not available or not accurate, Abouheif's Cmean statistic is a powerful alternative to K*.  相似文献   

3.
Phylogenies are fundamental to comparative biology as they help to identify independent events on which statistical tests rely. Two groups of phylogenetic comparative methods (PCMs) can be distinguished: those that take phylogenies into account by introducing explicit models of evolution and those that only consider phylogenies as a statistical constraint and aim at partitioning trait values into a phylogenetic component (phylogenetic inertia) and one or multiple specific components related to adaptive evolution. The way phylogenetic information is incorporated into the PCMs depends on the method used. For the first group of methods, phylogenies are converted into variance-covariance matrices of traits following a given model of evolution such as Brownian motion (BM). For the second group of methods, phylogenies are converted into distance matrices that are subsequently transformed into Euclidean distances to perform principal coordinate analyses. Here, we show that simply taking the elementwise square root of a distance matrix extracted from a phylogenetic tree ensures having a Euclidean distance matrix. This is true for any type of distances between species (patristic or nodal) and also for trees harboring multifurcating nodes. Moreover, we illustrate that this simple transformation using the square root imposes less geometric distortion than more complex transformations classically used in the literature such as the Cailliez method. Given the Euclidean nature of the elementwise square root of phylogenetic distance matrices, the positive semidefinitiveness of the phylogenetic variance-covariance matrix of a trait following a BM model, or related models of trait evolution, can be established. In that way, we build a bridge between the two groups of statistical methods widely used in comparative analysis. These results should be of great interest for ecologists and evolutionary biologists performing statistical analyses incorporating phylogenies.  相似文献   

4.
Aim Despite the importance of the niche concept in ecological and evolutionary theory, there are still many discussions about its definition and operational evaluation, especially when dealing with niche divergence and conservatism in an explicit phylogenetic context. Here we evaluate patterns of niche evolution in 67 New World Carnivora species, measured using Hellinger distances based on MAXENT models of species distribution. We show how inferences on niche conservatism or divergence depend on the way phylogenetic patterns are analysed using matrix comparison techniques. Innovation Initially we used the simplest approach of Mantel tests to compare Hellinger distances ( N ) derived from MAXENT and phylogenetic distances ( P ) among species. Then we extended the Mantel test to generate a multivariate correlogram, in which phylogenetic patterns are analysed at multiple levels in the phylogeny and can reveal nonlinearity in the relationship between divergence and time. Finally, we proposed a new approach to generate ‘local’ (or ‘specific’) leverages of components for Mantel correlation, evaluating the non‐stationarity in the relationship between N and P for each species. This new approach was used to show if some lineages are more prone to niche shift or conservatism than others. Main conclusions Standard Mantel tests indicated a poor correspondence between N and P matrices, discarding the idea of niche conservatism for Carnivora, but the correlogram supports that closely related species tend to be more similar than expected by chance. Moreover, the variance among Hellinger distances between pairs of closely phylogenetically related species is much larger than for the entire clade. Phylogenetic non‐stationarity analysis shows that in some Carnivora families the niche tends to divergence (Mustelidae and Canidae), whereas in others it tends to conservatism (Procyonidae and Mustelidae) at short phylogenetic distances. Our analyses clearly show that misleading results may appear if niche divergence is analysed only by simple matrix correlations not taking into account complex patterns of phylogenetic nonlinearity and non‐stationarity.  相似文献   

5.
Prediction of protein functional sites from sequence-derived data remains an open bioinformatics problem. We have developed a phylogenetic motif (PM) functional site prediction approach that identifies functional sites from alignment fragments that parallel the evolutionary patterns of the family. In our approach, PMs are identified by comparing tree topologies of each alignment fragment to that of the complete phylogeny. Herein, we bypass the phylogenetic reconstruction step and identify PMs directly from distance matrix comparisons. In order to optimize the new algorithm, we consider three different distance matrices and 13 different matrix similarity scores. We assess the performance of the various approaches on a structurally nonredundant data set that includes three types of functional site definitions. Without exception, the predictive power of the original approach outperforms the distance matrix variants. While the distance matrix methods fail to improve upon the original approach, our results are important because they clearly demonstrate that the improved predictive power is based on the topological comparisons. Meaning that phylogenetic trees are a straightforward, yet powerful way to improve functional site prediction accuracy. While complementary studies have shown that topology improves predictions of protein-protein interactions, this report represents the first demonstration that trees improve functional site predictions as well.  相似文献   

6.
In this study, we attempt to highlight part of the adaptive and phylogenetic constraints in mycobacterial pathogenicity. For this purpose, we first provide a phylogeny of Mycobacteria based on cladistic analyses of 64 different taxa. We then performed a comparative analysis, taking into account both ecological factors and phylogenetic relationships. The GLIM modelling analysis showed that different ecological and phylogenetic factors might be invoked to explain the variation in pathogenicity levels. Interestingly, the most harmful species were shown to be connected with the most diversified habitats. However, the independent contrast analysis revealed that once phylogeny was taken into account, none of the relationships between ecological factors and pathogenicity remained significant, and the pathogenicity appeared to be phylogenetically inherited among mycobacteria. The most pathogen were found in the slow-growing/long helix 18 group, and within this group in the most derived taxa. This revised version was published online in June 2006 with corrections to the Cover Date.  相似文献   

7.
It is fundamentally important to assess the fit of data to model in phylogenetic and evolutionary studies. Phylogenetic methods using molecular sequences typically start with a multiple alignment. It is possible to measure the fit of data to model expectations of data, for example, via the likelihood-ratio (G) test or the X(2) test, if all sites in all sequences have an unambiguous residue. However, nearly all alignments of interest contain sites (columns of the alignment) with missing data, that is, ambiguous nucleotides, gaps, or unsequenced regions, which must presently be removed before using the above tests. Unfortunately, this is often either undesirable or impractical, as it will discard much of the data. Here, we show how iterative ML estimators may directly estimate the site-pattern probabilities for columns with missing data, given only standard i.i.d. assumptions. The optimization may use an EM or Newton algorithm, or any other hill-climbing approach. The resulting optimal likelihood under the unconstrained or multinomial model may be compared directly with the likelihood of the data coming from the model (a G statistic). Alternatively the modified observed and the expected frequencies of site patterns may be compared using a X(2) test. The distribution of such statistics is best assessed using appropriate simulations. The new method is applicable to models using codons or paired sites. The methods are also useful with Hadamard conjugations (spectral analysis) and are illustrated with these and with ML evolutionary models that allow site-rate variability.  相似文献   

8.
Summary Increasing data onDrosophila alcohol dehydrogenase (ADH) sequences have made it possible to calculate the rate of amino acid replacement per year, which is 1.7×10–9. This value makes this protein suitable for reconstructing phylogenetic relationships within the genus for those species for which no molecular data are available such asScaptodrosophila. The amino acid sequence ofDrosophila lebanonensis is compared to all of the already knownDrosophila ADHs, stressing the unique characteristic features of this protein such as the conservation of an initiating methionine at the N-terminus, the unique replacement of a glycine by an alanine at a very conserved position in the NAD domain of all dehydrogenases, the lack of a slowmigrating peptide, and the total conservation of the maximally hydrophilic peptide. The functional significance of these features is discussed.Although the percent amino acid identity of the ADH molecule inDrosophila decreases as the number of sequences compared increases, the conservation of residue type in terms of size and hydrophobocity for the ADH molecule is shown to be very high throughout the genusDrosophila. The distance matrix and parsimony methods used to establish the phylogenetic relationships ofD. lebanonensis show that the three subgenera,Scaptodrosophila, Drosophila, andSophophora separated at approximately the same time.  相似文献   

9.
A broad multilocus phylogenetic analysis (MLPA) of the representative diversity of a genus offers the opportunity to incorporate concatenated inter-species phylogenies into bacterial systematics. Recent analyses based on single housekeeping genes have provided coherent phylogenies of Aeromonas. However, to date, a multi-gene phylogenetic analysis has never been tackled. In the present study, the intra- and inter-species phylogenetic relationships of 115 strains representing all Aeromonas species described to date were investigated by MLPA. The study included the independent analysis of seven single gene fragments (gyrB, rpoD, recA, dnaJ, gyrA, dnaX, and atpD), and the tree resulting from the concatenated 4705 bp sequence. The phylogenies obtained were consistent with each other, and clustering agreed with the Aeromonas taxonomy recognized to date. The highest clustering robustness was found for the concatenated tree (i.e. all Aeromonas species split into 100% bootstrap clusters). Both possible chronometric distortions and poor resolution encountered when using single-gene analysis were buffered in the concatenated MLPA tree. However, reliable phylogenetic species delineation required an MLPA including several “bona fide” strains representing all described species.  相似文献   

10.
BACKGROUND AND AIMS: A new infrageneric rearrangement for Veronica has been proposed based on the most recent evidence from DNA sequence data, morphological evidence, and biogeographical considerations. Looking for morphological synapomorphies for each monophyletic subgenus has been problematic, due to difficulties arising from widespread homoplasy (mainly parallel evolution). In an attempt to overcome these difficulties, previously underexplored morphological characters are starting to be studied in more depth. METHODS: A molecular phylogenetic hypothesis was used based on sequences of ITS (nuclear ribosomal DNA) and plastid trnL-F regions, as a framework to test the use of seed coat ultrastructure (studied under scanning electron microscope) in the systematics of the genus. A sample of 132 taxa representing ten of the 13 subgenera in Veronica, excluding the species of the southern hemisphere Hebe complex and the exclusively North American subgenus Synthyris, was studied. KEY RESULTS AND CONCLUSIONS: The results demonstrate that, in many cases, the ultrastucture of the testa can be employed to assess relationships of taxa within the genus, and the character provides additional support for molecular trees. Further characters relevant for the classification of Veronica, i.e. base chromosome number, iridoid chemical data, life cycle, inflorescence position, have been taken into consideration in a discussion where an attempt is made to highlight the best traits to characterize each subgenus investigated.  相似文献   

11.
The identification of the conventionally accepted species of Clavulina (Cantharellales, Basidiomycota) in Europe (Clavulina amethystina, Clavulina cinerea, Clavulina cristata, and Clavulina rugosa) is often difficult and many specimens are not straightforwardly assignable to any of those four species, which is why some authors have questioned their identity. In order to assess the status of those species, a morphological examination was combined with the molecular analysis of the ITS region. The same six major clades were obtained in the Bayesian and parsimony phylogenetic analyses, and all six clades were well-supported at least by one of the analyses. Morphological characters, such as the overall branching pattern, the presence and intensity of grey colour, the cristation of the apices, and basidiospore size and shape were to various extents correlated with the phylogenetic signal obtained from the ITS region. The congruence between the molecular analyses and morphology, rather than geographical origin, suggests the existence of several species that can be delimited using a combined phylogenetic and morphological species recognition. The analyses revealed that C. cristata and C. rugosa are well-delimited species. In contrast, more than one taxa could be subsumed under the names C. amethystina and C. cinerea, the taxonomical complexity of which is discussed. The ITS region is proved to be adequate to separate phylogenetic species of Clavulina.  相似文献   

12.
Given a distance matrix M that specifies the pairwise evolutionary distances between n species, the phylogenetic tree reconstruction problem asks for an edge-weighted phylogenetic tree that satisfies M, if one exists. We study some extensions of this problem to rooted phylogenetic networks. Our main result is an O(n(2) log n)-time algorithm for determining whether there is an ultrametric galled network that satisfies M, and if so, constructing one. In fact, if such an ultrametric galled network exists, our algorithm is guaranteed to construct one containing the minimum possible number of nodes with more than one parent (hybrid nodes). We also prove that finding a largest possible submatrix M' of M such that there exists an ultrametric galled network that satisfies M' is NP-hard. Furthermore, we show that given an incomplete distance matrix (i.e. where some matrix entries are missing), it is also NP-hard to determine whether there exists an ultrametric galled network which satisfies it.  相似文献   

13.
Ollier S  Couteron P  Chessel D 《Biometrics》2006,62(2):471-477
In recent years, there has been an increased interest in studying the variability of a quantitative life-history trait across a set of species sharing a common phylogeny. However, such studies have suffered from an insufficient development of statistical methods aimed at decomposing the trait variance with respect to the topological structure of the tree. Here we propose a new and generic approach that expresses the topological properties of the phylogenetic tree via an orthonormal basis, which is further used to decompose the trait variance. Such a decomposition provides a structure function, referred to as an "orthogram," which is relevant to characterize in both graphical and statistical aspects the dependence of trait values on the topology of the tree ("phylogenetic dependence"). We also propose four complementary test statistics to be computed from orthogram values that help to diagnose both the intensity and the nature of phylogenetic dependence. The relevance of the method is illustrated by the analysis of three phylogenetic data sets, drawn from the literature and typifying contrasted levels and aspects of phylogenetic dependence. Freely available routines which have been programmed in the R framework are also proposed.  相似文献   

14.
We develop a new approach to estimate a matrix of pairwise evolutionary distances from a codon-based alignment based on a codon evolutionary model. The method first computes a standard distance matrix for each of the three codon positions. Then these three distance matrices are weighted according to an estimate of the global evolutionary rate of each codon position and averaged into a unique distance matrix. Using a large set of both real and simulated codon-based alignments of nucleotide sequences, we show that this approach leads to distance matrices that have a significantly better treelikeness compared to those obtained by standard nucleotide evolutionary distances. We also propose an alternative weighting to eliminate the part of the noise often associated with some codon positions, particularly the third position, which is known to induce a fast evolutionary rate. Simulation results show that fast distance-based tree reconstruction algorithms on distance matrices based on this codon position weighting can lead to phylogenetic trees that are at least as accurate as, if not better, than those inferred by maximum likelihood. Finally, a well-known multigene dataset composed of eight yeast species and 106 codon-based alignments is reanalyzed and shows that our codon evolutionary distances allow building a phylogenetic tree which is similar to those obtained by non-distance-based methods (e.g., maximum parsimony and maximum likelihood) and also significantly improved compared to standard nucleotide evolutionary distance estimates.  相似文献   

15.
16.
17.

Background

Brucellosis is an important zoonotic disease that affects both humans and animals. We sequenced the full genome and characterised the genetic diversity of two Brucella melitensis isolates from Malaysia and the Philippines. In addition, we performed a comparative whole-genome single nucleotide polymorphism (SNP) analysis of B. melitensis strains collected from around the world, to investigate the potential origin and the history of the global spread of B. melitensis.

Results

Single sequencing runs of each genome resulted in draft genome sequences of MY1483/09 and Phil1136/12, which covered 99.85% and 99.92% of the complete genome sequences, respectively. The B. melitensis genome sequences, and two B. abortus strains used as the outgroup strains, yielded a total of 13,728 SNP sites. Phylogenetic analysis using whole-genome SNPs and geographical distribution of the isolates revealed spatial clustering of the B. melitensis isolates into five genotypes, I, II, III, IV and V. The Mediterranean strains, identified as genotype I, occupied the basal node of the phylogenetic tree, suggesting that B. melitensis may have originated from the Mediterranean regions. All of the Asian B. melitensis strains clustered into genotype II with the SEA strains, including the two isolates sequenced in this study, forming a distinct clade denoted here as genotype IId. Genotypes III, IV and V of B. melitensis demonstrated a restricted geographical distribution, with genotype III representing the African lineage, genotype IV representing the European lineage and genotype V representing the American lineage.

Conclusion

We showed that SNPs retrieved from the B. melitensis draft full genomes were sufficient to resolve the interspecies relationships between B. melitensis strains and to discriminate between the vaccine and endemic strains. Phylogeographic reconstruction of the history of B. melitensis global spread at a finer scale by using whole-genome SNP analyses supported the origin of all B. melitensis strains from the Mediterranean region. The possible global distribution of B. melitensis following the ancient trade routes was also consistent with whole-genome SNP phylogeny. The whole genome SNP phylogenetics analysis, hence is a powerful tool for intraspecies discrimination of closely related species.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1294-x) contains supplementary material, which is available to authorized users.  相似文献   

18.
T-REX (tree and reticulogram reconstruction) is an application to reconstruct phylogenetic trees and reticulation networks from distance matrices. The application includes a number of tree fitting methods like NJ, UNJ or ADDTREE which have been very popular in phylogenetic analysis. At the same time, the software comprises several new methods of phylogenetic analysis such as: tree reconstruction using weights, tree inference from incomplete distance matrices or modeling a reticulation network for a collection of objects or species. T-REX also allows the user to visualize obtained tree or network structures using Hierarchical, Radial or Axial types of tree drawing and manipulate them interactively. AVAILABILITY: T-REX is a freeware package available online at: http://www.fas.umontreal.ca/biol/casgrain/en/labo/t-rex  相似文献   

19.
Tarka and Tarkadectes are Middle Eocene mammals known only from the Rocky Mountains region of North America. Previous work has suggested that they are members of the Plagiomenidae, an extinct family often included in the order Dermoptera. Here we describe a new primate, Tarkops mckennai gen. et sp. nov., from the early Middle Eocene Irdinmanha Formation of Inner Mongolia, China. The new taxon is particularly similar to Tarka and Tarkadectes, but it also displays many features observed in omomyids. A phylogenetic analysis based on a data matrix including 59 taxa and 444 dental characters suggests that Tarkops, Tarka and Tarkadectes form a monophyletic group—the Tarkadectinae—that is nested within the omomyid clade. Within Omomyidae, tarkadectines appear to be closely related to Macrotarsius. Dermoptera, including extant and extinct flying lemurs and plagiomenids, is recognized as a clade nesting within the polyphyletic group of plesiadapiforms, therefore supporting the previous suggestion that the relationship between dermopterans and primates is as close as that between plesiadapiforms and primates. The distribution of tarkadectine primates on both sides of the Pacific Ocean basin suggests that palaeoenvironmental conditions appropriate to sustain primates occurred across a vast expanse of Asia and North America during the Middle Eocene.  相似文献   

20.
We develop a new method for testing a portion of a tree (called a clade) based on multiple tests of many 4-taxon trees in this paper. This is particularly useful when the phylogenetic tree constructed by other methods have a clade that is difficult to explain from a biological point of view. The statement about the test of the clade can be made through the multiple P values from these individual tests. By controlling the familywise error rate or the false discovery rate (FDR), 4 different tree test methods are evaluated through simulation methods. It shows that the combination of the approximately unbiased (AU) test and the FDR-controlling procedure provides strong power along with reasonable type I error rate and less heavy computation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号