期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Properties of phylogenetic trees generated by Yule-type speciation models

Steel M McKenzie A 《Mathematical biosciences》2001,170(1):91-112

We investigate some discrete structural properties of evolutionary trees generated under simple null models of speciation, such as the Yule model. These models have been used as priors in Bayesian approaches to phylogenetic analysis, and also to test hypotheses concerning the speciation process. In this paper we describe new results for three properties of trees generated under such models. Firstly, for a rooted tree generated by the Yule model we describe the probability distribution on the depth (number of edges from the root) of the most recent common ancestor of a random subset of k species. Next we show that, for trees generated under the Yule model, the approximate position of the root can be estimated from the associated unrooted tree, even for trees with a large number of leaves. Finally, we analyse a biologically motivated extension of the Yule model and describe its distribution on tree shapes when speciation occurs in rapid bursts. 相似文献

2.

Statistical Properties of Pairwise Distances between Leaves on a Random Yule Tree

Michael Sheinman Florian Massip Peter F. Arndt 《PloS one》2015,10(3)

A Yule tree is the result of a branching process with constant birth and death rates. Such a process serves as an instructive null model of many empirical systems, for instance, the evolution of species leading to a phylogenetic tree. However, often in phylogeny the only available information is the pairwise distances between a small fraction of extant species representing the leaves of the tree. In this article we study statistical properties of the pairwise distances in a Yule tree. Using a method based on a recursion, we derive an exact, analytic and compact formula for the expected number of pairs separated by a certain time distance. This number turns out to follow a increasing exponential function. This property of a Yule tree can serve as a simple test for empirical data to be well described by a Yule process. We further use this recursive method to calculate the expected number of the n-most closely related pairs of leaves and the number of cherries separated by a certain time distance. To make our results more useful for realistic scenarios, we explicitly take into account that the leaves of a tree may be incompletely sampled and derive a criterion for poorly sampled phylogenies. We show that our result can account for empirical data, using two families of birds species. 相似文献

3.

Screening for partial conjunction hypotheses

Benjamini Y Heller R 《Biometrics》2008,64(4):1215-1222

SUMMARY: We consider the problem of testing for partial conjunction of hypothesis, which argues that at least u out of n tested hypotheses are false. It offers an in-between approach to the testing of the conjunction of null hypotheses against the alternative that at least one is not, and the testing of the disjunction of null hypotheses against the alternative that all hypotheses are not null. We suggest powerful test statistics for testing such a partial conjunction hypothesis that are valid under dependence between the test statistics as well as under independence. We then address the problem of testing many partial conjunction hypotheses simultaneously using the false discovery rate (FDR) approach. We prove that if the FDR controlling procedure in Benjamini and Hochberg (1995, Journal of the Royal Statistical Society, Series B 57, 289-300) is used for this purpose the FDR is controlled under various dependency structures. Moreover, we can screen at all levels simultaneously in order to display the findings on a superimposed map and still control an appropriate FDR measure. We apply the method to examples from microarray analysis and functional magnetic resonance imaging (fMRI), two application areas where the need for partial conjunction analysis has been identified. 相似文献

4.

Stochastic properties of generalised Yule models, with biodiversity applications

Gernhard T Hartmann K Steel M 《Journal of mathematical biology》2008,57(5):713-735

The Yule model is a widely used speciation model in evolutionary biology. Despite its simplicity many aspects of the Yule model have not been explored mathematically. In this paper, we formalise two analytic approaches for obtaining probability densities of individual branch lengths of phylogenetic trees generated by the Yule model. These methods are flexible and permit various aspects of the trees produced by Yule models to be investigated. One of our methods is applicable to a broader class of evolutionary processes, namely the Bellman-Harris models. Our methods have many practical applications including biodiversity and conservation related problems. In this setting the methods can be used to characterise the expected rate of biodiversity loss for Yule trees, as well as the expected gain of including the phylogeny in conservation management. We briefly explore these applications. 相似文献

5.

On the power of some binomial modifications of the Bonferroni multiple test

Teriokhin AT de Meeûs T Guégan JF 《Zhurnal obshche? biologii》2007,68(5):332-340

Widely used in testing statistical hypotheses, the Bonferroni multiple test has a rather low power that entails a high risk to accept falsely the overall null hypothesis and therefore to not detect really existing effects. We suggest that when the partial test statistics are statistically independent, it is possible to reduce this risk by using binomial modifications of the Bonferroni test. Instead of rejecting the null hypothesis when at least one of n partial null hypotheses is rejected at a very high level of significance (say, 0.005 in the case of n = 10), as it is prescribed by the Bonferroni test, the binomial tests recommend to reject the null hypothesis when at least k partial null hypotheses (say, k = [n/2]) are rejected at much lower level (up to 30-50%). We show that the power of such binomial tests is essentially higher as compared with the power of the original Bonferroni and some modified Bonferroni tests. In addition, such an approach allows us to combine tests for which the results are known only for a fixed significance level. The paper contains tables and a computer program which allow to determine (retrieve from a table or to compute) the necessary binomial test parameters, i.e. either the partial significance level (when k is fixed) or the value of k (when the partial significance level is fixed). 相似文献

6.

Experiments in ecology and management: Their logics,functions and interpretations

A. J. UNDERWOOD 《Austral ecology》1990,15(4):365-389

Experimental tests of clearly articulated hypotheses are an increasingly widespread feature of modern marine ecology. Increased use of experiments has not, however, been accompanied by increased understanding of the logical structure of falsificationist tests. Most observations can be explained by several different models or theories. To distinguish among these requires demonstration of the falsity of the consequences or predictions of incorrect models. This is best achieved by deriving from each model one or more hypotheses (predictions) about the type, form or nature of observations that should occur in some not-yet-examined set of circumstances. Because of logical constraints on the possibility of proving the correctness of such hypotheses, they must be inverted to form logical null hypotheses which comprise all alternative possibilities to those predicted in the hypotheses. Correctness or not of null hypotheses can then be ascertained by an appropriately designed experiment (or test), leading to unambiguous rejection or retention of the null hypotheses. The former corroborates the hypotheses and provides support for the correctness of the explanatory model for the original observations. In contrast, retention of a null hypothesis identifies an incorrect model. The growth of knowledge is thus the elimination of false models, theories and explanations. Ecological experiments usually require statistical procedures for determining whether or not null hypotheses should be retained. Construction of statistical null hypotheses (i.e. definitions of parameters of frequency distributions of test statistics) sometimes requires that these be identical to logical hypotheses (and not to the logical nulls). This leads to irrational acceptance of hypotheses and the models or theories from which they were derived. It also poses immense problems for determinations of statistical power of experiments. Ecological experiments are analysed to reveal the nature of, and linkages between, their components in relation to falsificationism, statistical procedures and the logical properties and interpretations of ecological theories. 相似文献

7.

On statistical tests of phylogenetic tree imbalance: the Sackin and other indices revisited

Blum MG François O 《Mathematical biosciences》2005,195(2):141-153

相似文献

8.

Partitioning species diversity across landscapes and regions: a hierarchical analysis of alpha, beta, and gamma diversity 总被引：1，自引：0，他引：1

Crist TO Veech JA Gering JC Summerville KS 《The American naturalist》2003,162(6):734-743

Species diversity may be additively partitioned within and among samples (alpha and beta diversity) from hierarchically scaled studies to assess the proportion of the total diversity (gamma) found in different habitats, landscapes, or regions. We developed a statistical approach for testing null hypotheses that observed partitions of species richness or diversity indices differed from those expected by chance, and we illustrate these tests using data from a hierarchical study of forest-canopy beetles. Two null hypotheses were implemented using individual- and sample-based randomization tests to generate null distributions for alpha and beta components of diversity at multiple sampling scales. The two tests differed in their null distributions and power to detect statistically significant diversity components. Individual-based randomization was more powerful at all hierarchical levels and was sensitive to departures between observed and null partitions due to intraspecific aggregation of individuals. Sample-based randomization had less power but still may be useful for determining whether different habitats show a higher degree of differentiation in species diversity compared with random samples from the landscape. Null hypothesis tests provide a basis for inferences on partitions of species richness or diversity indices at multiple sampling levels, thereby increasing our understanding of how alpha and beta diversity change across spatial scales. 相似文献

9.

When does diversity fit null model predictions? Scale and range size mediate the mid‐domain effect 总被引：1，自引：0，他引：1

Robert R. Dunn Christy M. McCain Nathan J. Sanders 《Global Ecology and Biogeography》2007,16(3):305-312

Aim Recently, a flurry of studies have focused on the extent to which geographical patterns of diversity fit mid-domain effect (MDE) null models. While some studies find strong support for MDE null models, others find little. We test two hypotheses that might explain this variation among studies: small-ranged groups of species are less likely than large-ranged species to show mid-domain peaks in species richness, and mid-domain null model predictions are less robust for smaller spatial extents than for larger spatial extents.
Location We analyse data sets from elevational, riverine, continental and other domains from around the world.
Methods We use a combination of Spearman rank correlations and binomial tests to examine whether differences within and among studies and domains in the predictive power of MDE null models vary with spatial scale and range size.
Results Small-ranged groups of species are less likely to fit mid-domain predictions than large-ranged groups of species. At large spatial extents, diversity patterns of taxonomic groups with large mean range sizes fit MDE null model predictions better than did diversity patterns of groups with small mean range sizes. MDE predictions were more explanatory at larger spatial extents than at smaller extents. Diversity patterns at smaller spatial extents fit MDE predictions poorly across all range sizes. Thus, MDE predictions should be expected to explain patterns of species richness when ranges and the scale of analysis are both large.
Main conclusions Taken together, the support for these hypotheses offers a more sophisticated model of when MDE predictions should be expected to explain patterns of species richness, namely when ranges and the scale of analysis are both large. Thus the circumstances in which the MDE is important are finite and apparently predictable. 相似文献

10.

An exact test for the association between the disease and alleles at highly polymorphic loci with particular interest in the haplotype analysis

Hirotsu C Aoki S Inada T Kitao Y 《Biometrics》2001,57(3):769-778

The association analysis between the disease and genetic alleles is one of the simple methods for localizing the susceptibility locus in the genes. For revealing the association, several statistical tests have been proposed without discussing explicitly the alternative hypotheses. We therefore specify two types of alternative hypotheses (i.e., there is only one susceptibility allele in the locus, and there is an extension or shortening of alleles associated with the disease) and derive exact tests for the respective hypotheses. We also propose to combine these two tests when the prior knowledge is not sufficient enough to specify one of these two hypotheses. In particular, these ideas are extended to the haplotype analysis of three-way association between the disease and bivariate allele frequencies at two closely linked loci. As a by-product, a factorization of the probability distribution of the three-way cell frequencies under the null hypothesis of no three-way interaction is obtained. 相似文献

11.

Modelling p-value distributions to improve theme-driven survival analysis of cancer transcriptome datasets

Esteban Czwan Benedikt Brors David Kipling 《BMC bioinformatics》2010,11(1):19

Background

Theme-driven cancer survival studies address whether the expression signature of genes related to a biological process can predict patient survival time. Although this should ideally be achieved by testing two separate null hypotheses, current methods treat both hypotheses as one. The first test should assess whether a geneset, independent of its composition, is associated with prognosis (frequently done with a survival test). The second test then verifies whether the theme of the geneset is relevant (usually done with an empirical test that compares the geneset of interest with random genesets). Current methods do not test this second null hypothesis because it has been assumed that the distribution of p-values for random genesets (when tested against the first null hypothesis) is uniform. Here we demonstrate that such an assumption is generally incorrect and consequently, such methods may erroneously associate the biology of a particular geneset with cancer prognosis. 相似文献

12.

Significance analysis of groups of genes in expression profiling studies 总被引：1，自引：0，他引：1

Chen JJ Lee T Delongchamp RR Chen T Tsai CA 《Bioinformatics (Oxford, England)》2007,23(16):2104-2112

MOTIVATION: Gene class testing (GCT) is a statistical approach to determine whether some functionally predefined classes of genes express differently under two experimental conditions. GCT computes the P-value of each gene class based on the null distribution and the gene classes are ranked for importance in accordance with their P-values. Currently, two null hypotheses have been considered: the Q1 hypothesis tests the relative strength of association with the phenotypes among the gene classes, and the Q2 hypothesis assesses the statistical significance. These two hypotheses are related but not equivalent. METHOD: We investigate three one-sided and two two-sided test statistics under Q1 and Q2. The null distributions of gene classes under Q1 are generated by permuting gene labels and the null distributions under Q2 are generated by permuting samples. RESULTS: We applied the five statistics to a diabetes dataset with 143 gene classes and to a breast cancer dataset with 508 GO (Gene Ontology) terms. In each statistic, the null distributions of the gene classes under Q1 are different from those under Q2 in both datasets, and their rankings can be different too. We clarify the one-sided and two-sided hypotheses, and discuss some issues regarding the Q1 and Q2 hypotheses for gene class ranking in the GCT. Because Q1 does not deal with correlations among genes, we prefer test based on Q2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. 相似文献

13.

A broad symmetry criterion for nonparametric validity of parametrically based tests in randomized trials

Shinohara RT Frangakis CE Lyketsos CG 《Biometrics》2012,68(1):85-91

相似文献

14.

ConReg-R: Extrapolative recalibration of the empirical distribution of p-values to improve false discovery rate estimates

Juntao Li Puteri Paramita Kwok Pui Choi R Krishna Murthy Karuturi 《Biology direct》2011,6(1):1-14

相似文献

15.

Avoiding misinterpretation of biotic interactions with the intertype K12‐function: population independence vs. random labelling hypotheses

F. Goreaud R. Plissier 《植被学杂志》2003,14(5):681-692

Abstract. The interactions between plants of different species, age or size play an important role in the dynamics of an ecosystem and can induce specific structures. These interactions can be studied by analysing the spatial structure of the corresponding bivariate patterns. The intertype L₁₂‐function has recently been successfully used in many papers for that purpose. However, when interpreting the results obtained with ecological data, at least two different null hypotheses – independence or random labelling – can be appropriate, depending on the context of the study and the nature of the data. As these two hypotheses correspond to different confidence intervals, an inappropriate choice of the null hypothesis can lead to misinterpretations of biotic interactions when studying ecological data. This problem has rarely been mentioned in the literature. In this paper we clarify the differences between these two null hypotheses, and illustrate the risk of misinterpretation when using an inappropriate null hypothesis. We review the main characteristics of these two hypotheses, and analyse the spatial structure of both real data from forest stands and simulated virtual stands of different structures. We demonstrate that the risk of misinterpretation is quite high, and that extreme misinterpretations, i.e. cases leading to opposite conclusions in terms of spatial interaction, can occur in a significant number of cases. We therefore propose some guidelines to help ecologists avoid such misinterpretations. 相似文献

16.

Rate heterogeneity across Squamata,misleading ancestral state reconstruction and the importance of proper null model specification

下载免费PDF全文

S. Harrington T. W. Reeder 《Journal of evolutionary biology》2017,30(2):313-325

The binary‐state speciation and extinction (BiSSE) model has been used in many instances to identify state‐dependent diversification and reconstruct ancestral states. However, recent studies have shown that the standard procedure of comparing the fit of the BiSSE model to constant‐rate birth–death models often inappropriately favours the BiSSE model when diversification rates vary in a state‐independent fashion. The newly developed HiSSE model enables researchers to identify state‐dependent diversification rates while accounting for state‐independent diversification at the same time. The HiSSE model also allows researchers to test state‐dependent models against appropriate state‐independent null models that have the same number of parameters as the state‐dependent models being tested. We reanalyse two data sets that originally used BiSSE to reconstruct ancestral states within squamate reptiles and reached surprising conclusions regarding the evolution of toepads within Gekkota and viviparity across Squamata. We used this new method to demonstrate that there are many shifts in diversification rates across squamates. We then fit various HiSSE submodels and null models to the state and phylogenetic data and reconstructed states under these models. We found that there is no single, consistent signal for state‐dependent diversification associated with toepads in gekkotans or viviparity across all squamates. Our reconstructions show limited support for the recently proposed hypotheses that toepads evolved multiple times independently in Gekkota and that transitions from viviparity to oviparity are common in Squamata. Our results highlight the importance of considering an adequate pool of models and null models when estimating diversification rate parameters and reconstructing ancestral states. 相似文献

17.

Clades, clans, and reciprocal monophyly under neutral evolutionary models

Zhu S Degnan JH Steel M 《Theoretical population biology》2011,79(4):220-227

The Yule model and the coalescent model are two neutral stochastic models for generating trees in phylogenetics and population genetics, respectively. Although these models are quite different, they lead to identical distributions concerning the probability that pre-specified groups of taxa form monophyletic groups (clades) in the tree. We extend earlier work to derive exact formulae for the probability of finding one or more groups of taxa as clades in a rooted tree, or as ‘clans’ in an unrooted tree. Our findings are relevant for calculating the statistical significance of observed monophyly and reciprocal monophyly in phylogenetics. 相似文献

18.

Studying spatial interactions between sympatric populations of large herbivores: a null model approach

Emmanuelle Richard Clément Calenge Sonia Saïd Jean‐Luc Hamann Jean‐Michel Gaillard 《Ecography》2013,36(2):157-165

相似文献

19.

Toward multifactorial null models of range contraction in terrestrial vertebrates

下载免费PDF全文

Pablo M. Lucas Manuela González‐Suárez Eloy Revilla 《Ecography》2016,39(11):1100-1108

相似文献

20.

Tree models for macroevolution and phylogenetic analysis

Jones GR 《Systematic biology》2011,60(6):735-746

It has long been recognized that phylogenetic trees are more unbalanced than those generated by a Yule process. Recently, the degree of this imbalance has been quantified using the large set of phylogenetic trees available in the TreeBASE data set. In this article, a more precise analysis of imbalance is undertaken. Trees simulated under a range of models are compared with trees from TreeBASE and two smaller data sets. Several simple models can match the amount of imbalance measured in real data. Most of them also match the variance of imbalance among empirical trees to a remarkable degree. Statistics are developed to measure balance and to distinguish between trees with the same overall imbalance. The match between models and data for these statistics is investigated. In particular, age-dependent (Bellman-Harris) branching process are studied in detail. It remains difficult to separate the process of macroevolution from biases introduced by sampling. The lessons for phylogenetic analysis are clearer. In particular, the use of the usual proportional to distinguishable arrangements (uniform) prior on tree topologies in Bayesian phylogenetic analysis is not recommended. 相似文献