期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian phylogenetic model selection using reversible jump Markov chain Monte Carlo

Huelsenbeck JP Larget B Alfaro ME 《Molecular biology and evolution》2004,21(6):1123-1133

A common problem in molecular phylogenetics is choosing a model of DNA substitution that does a good job of explaining the DNA sequence alignment without introducing superfluous parameters. A number of methods have been used to choose among a small set of candidate substitution models, such as the likelihood ratio test, the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and Bayes factors. Current implementations of any of these criteria suffer from the limitation that only a small set of models are examined, or that the test does not allow easy comparison of non-nested models. In this article, we expand the pool of candidate substitution models to include all possible time-reversible models. This set includes seven models that have already been described. We show how Bayes factors can be calculated for these models using reversible jump Markov chain Monte Carlo, and apply the method to 16 DNA sequence alignments. For each data set, we compare the model with the best Bayes factor to the best models chosen using AIC and BIC. We find that the best model under any of these criteria is not necessarily the most complicated one; models with an intermediate number of substitution types typically do best. Moreover, almost all of the models that are chosen as best do not constrain a transition rate to be the same as a transversion rate, suggesting that it is the transition/transversion rate bias that plays the largest role in determining which models are selected. Importantly, the reversible jump Markov chain Monte Carlo algorithm described here allows estimation of phylogeny (and other phylogenetic model parameters) to be performed while accounting for uncertainty in the model of DNA substitution. 相似文献

2.

An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants 总被引：2，自引：0，他引：2

Moller J.; Pettitt A. N.; Reeves R.; Berthelsen K. K. 《Biometrika》2006,93(2):451-458

相似文献

3.

Miscellanea. On quantile estimation and Markov chain Monte Carlo convergence

Brooks SP; Roberts GO 《Biometrika》1999,86(3):710-717

相似文献

4.

Markov chain Monte Carlo methods for switching diffusion models 总被引：1，自引：0，他引：1

Liechty John C.; Roberts Gareth O. 《Biometrika》2001,88(2):299-315

相似文献

5.

Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo

Pagel M Meade A 《Philosophical transactions of the Royal Society of London. Series B, Biological sciences》2008,363(1512):3955-3964

The rate at which a given site in a gene sequence alignment evolves over time may vary. This phenomenon--known as heterotachy--can bias or distort phylogenetic trees inferred from models of sequence evolution that assume rates of evolution are constant. Here, we describe a phylogenetic mixture model designed to accommodate heterotachy. The method sums the likelihood of the data at each site over more than one set of branch lengths on the same tree topology. A branch-length set that is best for one site may differ from the branch-length set that is best for some other site, thereby allowing different sites to have different rates of change throughout the tree. Because rate variation may not be present in all branches, we use a reversible-jump Markov chain Monte Carlo algorithm to identify those branches in which reliable amounts of heterotachy occur. We implement the method in combination with our 'pattern-heterogeneity' mixture model, applying it to simulated data and five published datasets. We find that complex evolutionary signals of heterotachy are routinely present over and above variation in the rate or pattern of evolution across sites, that the reversible-jump method requires far fewer parameters than conventional mixture models to describe it, and serves to identify the regions of the tree in which heterotachy is most pronounced. The reversible-jump procedure also removes the need for a posteriori tests of 'significance' such as the Akaike or Bayesian information criterion tests, or Bayes factors. Heterotachy has important consequences for the correct reconstruction of phylogenies as well as for tests of hypotheses that rely on accurate branch-length information. These include molecular clocks, analyses of tempo and mode of evolution, comparative studies and ancestral state reconstruction. The model is available from the authors' website, and can be used for the analysis of both nucleotide and morphological data. 相似文献

6.

A comparison of strategies for Markov chain Monte Carlo computation in quantitative genetics

Rasmus Waagepetersen Noelia Ibán?z-Escriche Daniel Sorensen 《遗传、选种与进化》2008,40(2):161-176

In quantitative genetics, Markov chain Monte Carlo (MCMC) methods are indispensable for statistical inference in non-standard models like generalized linear models with genetic random effects or models with genetically structured variance heterogeneity. A particular challenge for MCMC applications in quantitative genetics is to obtain efficient updates of the high-dimensional vectors of genetic random effects and the associated covariance parameters. We discuss various strategies to approach this problem including reparameterization, Langevin-Hastings updates, and updates based on normal approximations. The methods are compared in applications to Bayesian inference for three data sets using a model with genetically structured variance heterogeneity. 相似文献

7.

A tutorial introduction to Bayesian inference for stochastic epidemic models using Markov chain Monte Carlo methods 总被引：1，自引：0，他引：1

Philip D. O&#x;Neill 《Mathematical biosciences》2002,180(1-2)

Recent Bayesian methods for the analysis of infectious disease outbreak data using stochastic epidemic models are reviewed. These methods rely on Markov chain Monte Carlo methods. Both temporal and non-temporal data are considered. The methods are illustrated with a number of examples featuring different models and datasets. 相似文献

8.

Markov chain Monte Carlo model determination for hierarchical and graphical log-linear models 总被引：6，自引：0，他引：6

Dellaportas P; Forster JJ 《Biometrika》1999,86(3):615-633

相似文献

9.

Detecting recombination in 4-taxa DNA sequence alignments with Bayesian hidden Markov models and Markov chain Monte Carlo

Husmeier D McGuire G 《Molecular biology and evolution》2003,20(3):315-337

This article presents a statistical method for detecting recombination in DNA sequence alignments, which is based on combining two probabilistic graphical models: (1) a taxon graph (phylogenetic tree) representing the relationship between the taxa, and (2) a site graph (hidden Markov model) representing interactions between different sites in the DNA sequence alignments. We adopt a Bayesian approach and sample the parameters of the model from the posterior distribution with Markov chain Monte Carlo, using a Metropolis-Hastings and Gibbs-within-Gibbs scheme. The proposed method is tested on various synthetic and real-world DNA sequence alignments, and we compare its performance with the established detection methods RECPARS, PLATO, and TOPAL, as well as with two alternative parameter estimation schemes. 相似文献

10.

A Markov chain Monte Carlo strategy for sampling from the joint posterior distribution of pedigrees and population parameters under a Fisher-Wright model with partial selfing

Wilson IJ Dawson KJ 《Theoretical population biology》2007,72(3):436-458

A simple population genetic model is presented for a hermaphrodite annual species, allowing both selfing and outcrossing. Those male gametes (pollen) responsible for outcrossing are assumed to disperse much further than seeds. Under this model, the pedigree of a sample from a single locality is loop-free. A novel Markov chain Monte Carlo strategy is presented for sampling from the joint posterior distribution of the pedigree of such a sample and the parameters of the population genetic model (including the selfing rate) given the genotypes of the sampled individuals at unlinked marker loci. The computational costs of this Markov chain Monte Carlo strategy scale well with the number of individuals in the sample, and the number of marker loci, but increase exponentially with the age (time since colonisation from the source population) of the local population. Consequently, this strategy is particularly suited to situations where the sample has been collected from a population which is the result of a recent colonisation process. 相似文献

11.

Reversible jump Markov chain Monte Carlo computation and Bayesian model determination 总被引：102，自引：0，他引：102

GREEN PETER J. 《Biometrika》1995,82(4):711-732

Markov chain Monte Carlo methods for Bayesian computation haveuntil recently been restricted to problems where the joint distributionof all variables has a density with respect to some fixed standardunderlying measure. They have therefore not been available forapplication to Bayesian model determination, where the dimensionalityof the parameter vector is typically not fixed. This paper proposesa new framework for the construction of reversible Markov chainsamplers that jump between parameter subspaces of differingdimensionality, which is flexible and entirely constructive.It should therefore have wide applicability in model determinationproblems. The methodology is illustrated with applications tomultiple change-point analysis in one and two dimensions, andto a Bayesian comparison of binomial experiments. 相似文献

12.

Population-Based Reversible Jump Markov Chain Monte Carlo 总被引：2，自引：0，他引：2

Jasra Ajay; Stephens David A.; Holmes Christopher C. 《Biometrika》2007,94(4):787-807

We present an extension of population-based Markov chain MonteCarlo to the transdimensional case. A major challenge is thatof simulating from high- and transdimensional target measures.In such cases, Markov chain Monte Carlo methods may not adequatelytraverse the support of the target; the simulation results willbe unreliable. We develop population methods to deal with suchproblems, and give a result proving the uniform ergodicity ofthese population algorithms, under mild assumptions. This resultis used to demonstrate the superiority, in terms of convergencerate, of a population transition kernel over a reversible jumpsampler for a Bayesian variable selection problem. We also givean example of a population algorithm for a Bayesian multivariatemixture model with an unknown number of components. This isapplied to gene expression data of 1000 data points in six dimensionsand it is demonstrated that our algorithm outperforms some competingMarkov chain samplers. In this example, we show how to combinethe methods of parallel chains (Geyer, 1991), tempering (Geyer& Thompson, 1995), snooker algorithms (Gilks et al., 1994),constrained sampling and delayed rejection (Green & Mira,2001). 相似文献

13.

Phylogenetic tree construction using sequential stochastic approximation Monte Carlo 总被引：1，自引：0，他引：1

Cheon S Liang F 《Bio Systems》2008,91(1):94-107

Monte Carlo methods have received much attention recently in the literature of phylogenetic tree construction. However, they often suffer from two difficulties, the curse of dimensionality and the local-trap problem. The former one is due to that the number of possible phylogenetic trees increases at a super-exponential rate as the number of taxa increases. The latter one is due to that the phylogenetic tree has often a rugged energy landscape. In this paper, we propose a new phylogenetic tree construction method, which attempts to alleviate these two difficulties simultaneously by making use of the sequential structure of phylogenetic trees in conjunction with stochastic approximation Monte Carlo (SAMC) simulations. The use of the sequential structure of the problem provides substantial help to reduce the curse of dimensionality in simulations, and SAMC effectively prevents the system from getting trapped in local energy minima. The new method is compared with a variety of existing Bayesian and non-Bayesian methods on simulated and real datasets. Numerical results are in favor of the new method in terms of quality of the resulting phylogenetic trees. 相似文献

14.

Alternative implementations of Monte Carlo EM algorithms for likelihood inferences

Louis Alberto García-Cortés Daniel Sorensen 《遗传、选种与进化》2001,33(4):443-452

Two methods of computing Monte Carlo estimators of variance components using restricted maximum likelihood via the expectation-maximisation algorithm are reviewed. A third approach is suggested and the performance of the methods is compared using simulated data. 相似文献

15.

Data cloning: easy maximum likelihood estimation for complex ecological models using Bayesian Markov chain Monte Carlo methods 总被引：1，自引：0，他引：1

Lele SR Dennis B Lutscher F 《Ecology letters》2007,10(7):551-563

We introduce a new statistical computing method, called data cloning, to calculate maximum likelihood estimates and their standard errors for complex ecological models. Although the method uses the Bayesian framework and exploits the computational simplicity of the Markov chain Monte Carlo (MCMC) algorithms, it provides valid frequentist inferences such as the maximum likelihood estimates and their standard errors. The inferences are completely invariant to the choice of the prior distributions and therefore avoid the inherent subjectivity of the Bayesian approach. The data cloning method is easily implemented using standard MCMC software. Data cloning is particularly useful for analysing ecological situations in which hierarchical statistical models, such as state-space models and mixed effects models, are appropriate. We illustrate the method by fitting two nonlinear population dynamics models to data in the presence of process and observation noise. 相似文献

16.

Bayesian analysis of discrete survival data with a hidden Markov chain

Kozumi H 《Biometrics》2000,56(4):1002-1006

This paper considers the discrete survival data from a Bayesian point of view. A sequence of the baseline hazard functions, which plays an important role in the discrete hazard function, is modeled with a hidden Markov chain. It is explained how the resultant model is implemented via Markov chain Monte Carlo methods. The model is illustrated by an application of real data. 相似文献

17.

A quantitative approach for polymerase chain reactions based on a hidden Markov model

Nadia Lalam 《Journal of mathematical biology》2009,59(4):517-533

Polymerase chain reaction (PCR) is a major DNA amplification technology from molecular biology. The quantitative analysis of PCR aims at determining the initial amount of the DNA molecules from the observation of typically several PCR amplifications curves. The mainstream observation scheme of the DNA amplification during PCR involves fluorescence intensity measurements. Under the classical assumption that the measured fluorescence intensity is proportional to the amount of present DNA molecules, and under the assumption that these measurements are corrupted by an additive Gaussian noise, we analyze a single amplification curve using a hidden Markov model(HMM). The unknown parameters of the HMM may be separated into two parts. On the one hand, the parameters from the amplification process are the initial number of the DNA molecules and the replication efficiency, which is the probability of one molecule to be duplicated. On the other hand, the parameters from the observational scheme are the scale parameter allowing to convert the fluorescence intensity into the number of DNA molecules and the mean and variance characterizing the Gaussian noise. We use the maximum likelihood estimation procedure to infer the unknown parameters of the model from the exponential phase of a single amplification curve, the main parameter of interest for quantitative PCR being the initial amount of the DNA molecules. An illustrative example is provided. This research was financed by the Swedish foundation for Strategic Research through the Gothenburg Mathematical Modelling Centre. 相似文献

18.

A Monte Carlo method for Bayesian analysis of linkage between single markers and quantitative trait loci. II. A simulation study

G. Thaller I. Hoeschele 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1996,93(7):1167-1174

A Bayesian approach to the statistical mapping of Quantitative Trait Loci (QTLs) using single markers was implemented via Markov Chain Monte Carlo (MCMC) algorithms for parameter estimation and hypothesis testing. Parameters were estimated by marginal posterior means computed with a Gibbs sampler with data augmentation. Variables sampled included the augmented data (marker-QTL genotypes, polygenic effects), the event of linkage or nonlinkage, and the parameters (allele frequencies, QTL substitution effect, recombination rate, polygenic and residual variances). The analysis was evaluated empirically via application to simulated granddaughter designs consisting of 2000 sons, 20 related sires and their ancestors. Results obtained in this study and preliminary work on multiple linked markers and multiple QTLs support the usefulness of the Bayesian method for the statistical mapping of QTLs. 相似文献

19.

Computation of identity by descent probabilities conditional on DNA markers via a Monte Carlo Markov Chain method

Miguel Pérez-Enciso Luis Varona Max F Rothschild 《遗传、选种与进化》2000,32(5):467-482

The accurate estimation of the probability of identity by descent (IBD) at loci or genome positions of interest is paramount to the genetic study of quantitative and disease resistance traits. We present a Monte Carlo Markov Chain method to compute IBD probabilities between individuals conditional on DNA markers and on pedigree information. The IBDs can be obtained in a completely general pedigree at any genome position of interest, and all marker and pedigree information available is used. The method can be split into two steps at each iteration. First, phases are sampled using current genotypic configurations of relatives and second, crossover events are simulated conditional on phases. Internal track is kept of all founder origins and crossovers such that the IBD probabilities averaged over replicates are rapidly obtained. We illustrate the method with some examples. First, we show that all pedigree information should be used to obtain line origin probabilities in F2 crosses. Second, the distribution of genetic relationships between half and full sibs is analysed in both simulated data and in real data from an F2 cross in pigs. 相似文献

20.

The structure of polymer chains in confinement. A Monte Carlo study

Piotr Romiszowski Andrzej Sikorski 《Journal of molecular modeling》2009,15(6):681-686

A coarse-grained model of polymer star chains confined in two parallel impenetrable surfaces, which were attractive for polymer beads was studied. The flexible homopolymer chains were built of united atoms whose positions in space were restricted to vertices of a simple cubic lattice. The chains were modeled in good solvent conditions and, thus, there were no long-range specific interactions between polymer beads—only the excluded volume was present. The influence of the polymer density and the distances between the confining surfaces on the properties of star-branched polymers was studied. It is shown that the chains adsorbed on one surface could change their position so that they swap between both surfaces with frequency depending on the size of the slit and on the density of the system only. The increase of the polymer density diminished the frequency of jumps and caused that chains became only partially adsorbed. The analysis of structural elements of chains showed that the increase of the density of the system leads to increase of the number of bridges connecting the two adsorbing surfaces, thus, the frequency of jumps between them decreases. 相似文献