期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian phylogenetic model selection using reversible jump Markov chain Monte Carlo

Huelsenbeck JP Larget B Alfaro ME 《Molecular biology and evolution》2004,21(6):1123-1133

A common problem in molecular phylogenetics is choosing a model of DNA substitution that does a good job of explaining the DNA sequence alignment without introducing superfluous parameters. A number of methods have been used to choose among a small set of candidate substitution models, such as the likelihood ratio test, the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and Bayes factors. Current implementations of any of these criteria suffer from the limitation that only a small set of models are examined, or that the test does not allow easy comparison of non-nested models. In this article, we expand the pool of candidate substitution models to include all possible time-reversible models. This set includes seven models that have already been described. We show how Bayes factors can be calculated for these models using reversible jump Markov chain Monte Carlo, and apply the method to 16 DNA sequence alignments. For each data set, we compare the model with the best Bayes factor to the best models chosen using AIC and BIC. We find that the best model under any of these criteria is not necessarily the most complicated one; models with an intermediate number of substitution types typically do best. Moreover, almost all of the models that are chosen as best do not constrain a transition rate to be the same as a transversion rate, suggesting that it is the transition/transversion rate bias that plays the largest role in determining which models are selected. Importantly, the reversible jump Markov chain Monte Carlo algorithm described here allows estimation of phylogeny (and other phylogenetic model parameters) to be performed while accounting for uncertainty in the model of DNA substitution. 相似文献

2.

QTL mapping in outbred half-sib families using Bayesian model selection

Fang M Liu J Sun D Zhang Y Zhang Q Zhang Y Zhang S 《Heredity》2011,107(3):265-276

In this article, we propose a model selection method, the Bayesian composite model space approach, to map quantitative trait loci (QTL) in a half-sib population for continuous and binary traits. In our method, the identity-by-descent-based variance component model is used. To demonstrate the performance of this model, the method was applied to map QTL underlying production traits on BTA6 in a Chinese half-sib dairy cattle population. A total of four QTLs were detected, whereas only one QTL was identified using the traditional least square (LS) method. We also conducted two simulation experiments to validate the efficiency of our method. The results suggest that the proposed method based on a multiple-QTL model is efficient in mapping multiple QTL for an outbred half-sib population and is more powerful than the LS method based on a single-QTL model. 相似文献

3.

Comparing vector–host and SIR models for dengue transmission

Abhishek Pandey Anuj Mubayi Jan Medlock 《Mathematical biosciences》2013

Various simple mathematical models have been used to investigate dengue transmission. Some of these models explicitly model the mosquito population, while others model the mosquitoes implicitly in the transmission term. We study the impact of modeling assumptions on the dynamics of dengue in Thailand by fitting dengue hemorrhagic fever (DHF) data to simple vector–host and SIR models using Bayesian Markov chain Monte Carlo estimation. The parameter estimates obtained for both models were consistent with previous studies. Most importantly, model selection found that the SIR model was substantially better than the vector–host model for the DHF data from Thailand. Therefore, explicitly incorporating the mosquito population may not be necessary in modeling dengue transmission for some populations. 相似文献

4.

Bayesian Estimation of Positively Selected Sites

Huelsenbeck JP Dyer KA 《Journal of molecular evolution》2004,58(6):661-672

In protein-coding DNA sequences, historical patterns of selection can be inferred from amino acid substitution patterns. High relative rates of nonsynonymous to synonymous changes (=d _N/d _S) are a clear indicator of positive, or directional, selection, and several recently developed methods attempt to distinguish these sites from those under neutral or purifying selection. One method uses an empirical Bayesian framework that accounts for varying selective pressures across sites while conditioning on the parameters of the model of DNA evolution and on the phylogenetic history. We describe a method that identifies sites under diversifying selection using a fully Bayesian framework. Similar to earlier work, the method presented here allows the rate of nonsynonymous to synonymous changes to vary among sites. The significant difference in using a fully Bayesian approach lies in our ability to account for uncertainty in parameters including the tree topology, branch lengths, and the codon model of DNA substitution. We demonstrate the utility of the fully Bayesian approach by applying our method to a data set of the vertebrate -globin gene. Compared to a previous analysis of this data set, the hierarchical model found most of the same sites to be in the positive selection class, but with a few striking exceptions. 相似文献

5.

Bayesian point estimation of quantitative trait loci

Sisson SA Hurn MA 《Biometrics》2004,60(1):60-68

In this article, we consider the problem of the estimation of quantitative trait loci (QTL), those chromosomal regions at which genetic information affecting some quantitative trait is encoded. Generally the number of such encoding sites is unknown, and associations between neutral molecular marker genotypes and observed trait phenotypes are sought to locate them. We consider a Bayesian model for simple experimental designs, and discuss the existing approaches to inference for this problem. In particular, we focus on locating positions of the best candidate markers segregating for the trait, a situation which is of primary interest in comparative mapping. We introduce a loss function for estimating both the number of QTL and their location, and we illustrate its application via simulated and real data. 相似文献

6.

Efficient Bayes factor estimation from the reversible jump output

Bartolucci Francesco; Scaccia Luisa; Mira Antonietta 《Biometrika》2006,93(1):41-52

相似文献

7.

Bayesian analysis of case-control studies with categorical covariates

Seaman Shaun R.; Richardson Sylvia 《Biometrika》2001,88(4):1073-1088

相似文献

8.

Bayesian analysis of a time series of counts with covariates: an application to the control of an infectious disease

Hay JL Pettitt AN 《Biostatistics (Oxford, England)》2001,2(4):433-444

This paper presents a Bayesian analysis of a time series of counts to assess its dependence on an explanatory variable. The time series represented is the incidence of the infectious disease ESBL-producing Klebsiella pneumoniae in an Australian hospital and the explanatory variable is the number of grams of antibiotic (third generation) cephalosporin used during that time. We demonstrate that there is a statistically significant relationship between disease occurrence and use of the antibiotic, lagged by three months. The model used is a parameter-driven model in the form of a generalized linear mixed model. Comparison of models is made in terms of mean square error. 相似文献

9.

Equivalence of prospective and retrospective models in the Bayesian analysis of case-control studies

Seaman Shaun R.; Richardson Sylvia 《Biometrika》2004,91(1):15-25

相似文献

10.

A comparison of three models of the inositol trisphosphate receptor

Sneyd J Falcke M Dufour JF Fox C 《Progress in biophysics and molecular biology》2004,85(2-3):121-140

The inositol (1,4,5)-trisphosphate receptor (IPR) plays a crucial role in calcium dynamics in a wide range of cell types, and is often a central feature in quantitative models of calcium oscillations and waves. We compare three mathematical models of the IPR, fitting each of them to the same data set to determine ranges for the parameter values. Each of the fits indicates that fast activation of the receptor, followed by slow inactivation, is an important feature of the model, and also that the speed of inositol trisphosphate (IP₃) binding cannot necessarily be assumed to be faster than Ca²⁺ activation. In addition, the model which assumed saturating binding rates of Ca²⁺ to the IPR demonstrated the best fit. However, lack of convergence in the fitting procedure indicates that responses to step increases of [Ca²⁺] and [IP₃] provide insufficient data to determine the parameters unambiguously in any of the models. 相似文献

11.

Modeling the euglycemic hyperinsulinemic clamp by stochastic differential equations

Picchini U Ditlevsen S De Gaetano A 《Journal of mathematical biology》2006,53(5):771-796

The Euglycemic Hyperinsulinemic Clamp (EHC) is the most widely used experimental procedure for the determination of insulin sensitivity. In the present study, 16 subjects with BMI between 18.5 and 63.6 kg/m² have been studied with a long-duration (5 hours) EHC. In order to explain the oscillations of glycemia occurring in response to the hyperinsulinization and to the continuous glucose infusion at varying speeds, we first hypothesized a system of ordinary differential equations (ODEs), with limited success. We then extended the model and represented the experiment using a system of stochastic differential equations (SDEs). The latter allow for distinction between (i) random variation imputable to observation error and (ii) system noise (intrinsic variability of the metabolic system), due to a variety of influences which change over time. The stochastic model of the EHC was fitted to data and the system noise was estimated by means of a (simulated) maximum likelihood procedure, for a series of different hypothetical measurement error values. We showed that, for the whole range of reasonable measurement error values: (i) the system noise estimates are non-negligible; and (ii) these estimates are robust to changes in the likely value of the measurement error. Explicit expression of system noise is physiologically relevant in this case, since glucose uptake rate is known to be affected by a host of additive influences, usually neglected when modeling metabolism. While in some of the studied subjects system noise appeared to only marginally affect the dynamics, in others the system appeared to be driven more by the erratic oscillations in tissue glucose transport rather than by the overall glucose-insulin control system. It is possible that the quantitative relevance of the unexpressed effects (system noise) should be considered in other physiological situations, represented so far only with deterministic models.The work was supported by grants from the Danish Medical Research Council and the Lundbeck Foundation to S. Ditlevsen. 相似文献

12.

Model selection and parameter estimation for ion channel recordings with an application to the K + outward-rectifier in barley leaf

de Gunst MC Schouten JG 《Journal of mathematical biology》2005,50(3):233-256

We present a statistical method, and its accompanying algorithms, for the selection of a mathematical model of the gating mechanism of an ion channel and for the estimation of the parameters of this model. The method assumes a hidden Markov model that incorporates filtering, colored noise and state-dependent white excess noise for the recorded data. The model selection and parameter estimation are performed via a Bayesian approach using Markov chain Monte Carlo. The method is illustrated by its application to single-channel recordings of the K⁺ outward-rectifier in barley leaf.Acknowledgement The authors thank Sake Vogelzang, Bert van Duijn and Bert de Boer for their helpful advice and useful comments and suggestions. 相似文献

13.

Time-varying Markov regression random-effect model with Bayesian estimation procedures: Application to dynamics of functional recovery in patients with stroke

Shin-Liang Pan 《Mathematical biosciences》2010,227(1):72-79

The rates of functional recovery after stroke tend to decrease with time. Time-varying Markov processes (TVMP) may be more biologically plausible than time-invariant Markov process for modeling such data. However, analysis of such stochastic processes, particularly tackling reversible transitions and the incorporation of random effects into models, can be analytically intractable. We make use of ordinary differential equations to solve continuous-time TVMP with reversible transitions. The proportional hazard form was used to assess the effects of an individual’s covariates on multi-state transitions with the incorporation of random effects that capture the residual variation after being explained by measured covariates under the concept of generalized linear model. We further built up Bayesian directed acyclic graphic model to obtain full joint posterior distribution. Markov chain Monte Carlo (MCMC) with Gibbs sampling was applied to estimate parameters based on posterior marginal distributions with multiple integrands. The proposed method was illustrated with empirical data from a study on the functional recovery after stroke. 相似文献

14.

Estimation of amino acid residue substitution rates at local spatial regions and application in protein function inference: a Bayesian Monte Carlo approach 总被引：2，自引：0，他引：2

Tseng YY Liang J 《Molecular biology and evolution》2006,23(2):421-436

The amino acid sequences of proteins provide rich information for inferring distant phylogenetic relationships and for predicting protein functions. Estimating the rate matrix of residue substitutions from amino acid sequences is also important because the rate matrix can be used to develop scoring matrices for sequence alignment. Here we use a continuous time Markov process to model the substitution rates of residues and develop a Bayesian Markov chain Monte Carlo method for rate estimation. We validate our method using simulated artificial protein sequences. Because different local regions such as binding surfaces and the protein interior core experience different selection pressures due to functional or stability constraints, we use our method to estimate the substitution rates of local regions. Our results show that the substitution rates are very different for residues in the buried core and residues on the solvent-exposed surfaces. In addition, the rest of the proteins on the binding surfaces also have very different substitution rates from residues. Based on these findings, we further develop a method for protein function prediction by surface matching using scoring matrices derived from estimated substitution rates for residues located on the binding surfaces. We show with examples that our method is effective in identifying functionally related proteins that have overall low sequence identity, a task known to be very challenging. 相似文献

15.

A Bayesian change-point analysis of electromyographic data: detecting muscle activation patterns and associated applications

Johnson TD Elashoff RM Harkema SJ 《Biostatistics (Oxford, England)》2003,4(1):143-164

Many facets of neuromuscular activation patterns and control can be assessed via electromyography and are important for understanding the control of locomotion. After spinal cord injury, muscle activation patterns can affect locomotor recovery. We present a novel application of reversible jump Markov chain Monte Carlo simulation to estimate activation patterns from electromyographic data. We assume the data to be a zero-mean, heteroscedastic process. The variance is explicitly modeled using a step function. The number and location of points of discontinuity, or change-points, in the step function, the inter-change-point variances, and the overall mean are jointly modeled along with the mean and variance from baseline data. The number of change-points is considered a nuisance parameter and is integrated out of the posterior distribution. Whereas current methods of detecting activation patterns are deterministic or provide only point estimates, ours provides distributional estimates of muscle activation. These estimates, in turn, are used to estimate physiologically relevant quantities such as muscle coactivity, total integrated energy, and average burst duration and to draw valid statistical inferences about these quantities. 相似文献

16.

Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data

Bustamante CD Nielsen R Hartl DL 《Theoretical population biology》2003,63(2):91-103

Maximum likelihood and Bayesian approaches are presented for analyzing hierarchical statistical models of natural selection operating on DNA polymorphism within a panmictic population. For analyzing Bayesian models, we present Markov chain Monte-Carlo (MCMC) methods for sampling from the joint posterior distribution of parameters. For frequentist analysis, an Expectation-Maximization (EM) algorithm is presented for finding the maximum likelihood estimate of the genome wide mean and variance in selection intensity among classes of mutations. The framework presented here provides an ideal setting for modeling mutations dispersed through the genome and, in particular, for the analysis of how natural selection operates on different classes of single nucleotide polymorphisms (SNPs). 相似文献

17.

On parametric estimation for mortal branching processes

BECKER NIELS 《Biometrika》1974,61(2):393-399

相似文献

18.

Hierarchical Bayesian modeling in dichotomous processes in the presence of nonresponse

Oleson JJ He CZ 《Biometrics》2004,60(1):50-59

Sampling units that do not answer a survey may dramatically affect the estimation results of interest. The response may even be conditional on the outcome of interest in the survey. If estimates are found using only those who responded, the estimate may be biased, known as nonresponse bias. We are interested in finding estimates of success rates from a survey. We begin by looking at two current Bayesian approaches to treating nonresponse in a hierarchical model. However, these approaches do not consider possible spatial correlations between domains for either success rate or response rate. We build a Bayesian hierarchical spatial model to explicitly estimate the success rate, response rate given success, and response rate given failure. The success rates in the domains of the survey are allowed to be spatially correlated. We also allow spatial dependence between domains in both response rate given success and response rate given failure. Spatial dependence is induced by a common latent spatial structure between the two conditional response rates. We use the 1998 Missouri Turkey Hunting Survey to illustrate this methodology. We find significant spatial correlation in the success rates and incorporating nonrespondents has an impact on the success rate estimates. 相似文献

19.

Bayesian analysis of covariance matrices and dynamic models for longitudinal data 总被引：3，自引：0，他引：3

Daniels Michael J.; Pourahmadi Mohsen 《Biometrika》2002,89(3):553-566

相似文献

20.

On the accelerated failure time model for current status and interval censored data 总被引：1，自引：0，他引：1

Tian Lu; Cai Tianxi 《Biometrika》2006,93(2):329-342

相似文献