共查询到16条相似文献,搜索用时 0 毫秒
1.
A tutorial introduction to Bayesian inference for stochastic epidemic models using Markov chain Monte Carlo methods 总被引:1,自引:0,他引:1
Philip D. ONeill 《Mathematical biosciences》2002,180(1-2)
Recent Bayesian methods for the analysis of infectious disease outbreak data using stochastic epidemic models are reviewed. These methods rely on Markov chain Monte Carlo methods. Both temporal and non-temporal data are considered. The methods are illustrated with a number of examples featuring different models and datasets. 相似文献
2.
A two-component model for counts of infectious diseases 总被引:1,自引:0,他引:1
We propose a stochastic model for the analysis of time series of disease counts as collected in typical surveillance systems on notifiable infectious diseases. The model is based on a Poisson or negative binomial observation model with two components: a parameter-driven component relates the disease incidence to latent parameters describing endemic seasonal patterns, which are typical for infectious disease surveillance data. An observation-driven or epidemic component is modeled with an autoregression on the number of cases at the previous time points. The autoregressive parameter is allowed to change over time according to a Bayesian changepoint model with unknown number of changepoints. Parameter estimates are obtained through the Bayesian model averaging using Markov chain Monte Carlo techniques. We illustrate our approach through analysis of simulated data and real notification data obtained from the German infectious disease surveillance system, administered by the Robert Koch Institute in Berlin. Software to fit the proposed model can be obtained from http://www.statistik.lmu.de/ approximately mhofmann/twins. 相似文献
3.
A common problem in molecular phylogenetics is choosing a model of DNA substitution that does a good job of explaining the DNA sequence alignment without introducing superfluous parameters. A number of methods have been used to choose among a small set of candidate substitution models, such as the likelihood ratio test, the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and Bayes factors. Current implementations of any of these criteria suffer from the limitation that only a small set of models are examined, or that the test does not allow easy comparison of non-nested models. In this article, we expand the pool of candidate substitution models to include all possible time-reversible models. This set includes seven models that have already been described. We show how Bayes factors can be calculated for these models using reversible jump Markov chain Monte Carlo, and apply the method to 16 DNA sequence alignments. For each data set, we compare the model with the best Bayes factor to the best models chosen using AIC and BIC. We find that the best model under any of these criteria is not necessarily the most complicated one; models with an intermediate number of substitution types typically do best. Moreover, almost all of the models that are chosen as best do not constrain a transition rate to be the same as a transversion rate, suggesting that it is the transition/transversion rate bias that plays the largest role in determining which models are selected. Importantly, the reversible jump Markov chain Monte Carlo algorithm described here allows estimation of phylogeny (and other phylogenetic model parameters) to be performed while accounting for uncertainty in the model of DNA substitution. 相似文献
4.
In this paper we develop a Bayesian approach to parameter estimation in a stochastic spatio-temporal model of the spread of invasive species across a landscape. To date, statistical techniques, such as logistic and autologistic regression, have outstripped stochastic spatio-temporal models in their ability to handle large numbers of covariates. Here we seek to address this problem by making use of a range of covariates describing the bio-geographical features of the landscape. Relative to regression techniques, stochastic spatio-temporal models are more transparent in their representation of biological processes. They also explicitly model temporal change, and therefore do not require the assumption that the species' distribution (or other spatial pattern) has already reached equilibrium as is often the case with standard statistical approaches. In order to illustrate the use of such techniques we apply them to the analysis of data detailing the spread of an invasive plant, Heracleum mantegazzianum, across Britain in the 20th Century using geo-referenced covariate information describing local temperature, elevation and habitat type. The use of Markov chain Monte Carlo sampling within a Bayesian framework facilitates statistical assessments of differences in the suitability of different habitat classes for H. mantegazzianum, and enables predictions of future spread to account for parametric uncertainty and system variability. Our results show that ignoring such covariate information may lead to biased estimates of key processes and implausible predictions of future distributions. 相似文献
5.
A Bayesian multivariate adaptive regression spline fitting approach is used to model univariate and multivariate survival data with censoring. The possible models contain the proportional hazards model as a subclass and automatically detect departures from this. A reversible jump Markov chain Monte Carlo algorithm is described to obtain the estimate of the hazard function as well as the survival curve. 相似文献
6.
In this article, we consider the problem of the estimation of quantitative trait loci (QTL), those chromosomal regions at which genetic information affecting some quantitative trait is encoded. Generally the number of such encoding sites is unknown, and associations between neutral molecular marker genotypes and observed trait phenotypes are sought to locate them. We consider a Bayesian model for simple experimental designs, and discuss the existing approaches to inference for this problem. In particular, we focus on locating positions of the best candidate markers segregating for the trait, a situation which is of primary interest in comparative mapping. We introduce a loss function for estimating both the number of QTL and their location, and we illustrate its application via simulated and real data. 相似文献
7.
Many facets of neuromuscular activation patterns and control can be assessed via electromyography and are important for understanding the control of locomotion. After spinal cord injury, muscle activation patterns can affect locomotor recovery. We present a novel application of reversible jump Markov chain Monte Carlo simulation to estimate activation patterns from electromyographic data. We assume the data to be a zero-mean, heteroscedastic process. The variance is explicitly modeled using a step function. The number and location of points of discontinuity, or change-points, in the step function, the inter-change-point variances, and the overall mean are jointly modeled along with the mean and variance from baseline data. The number of change-points is considered a nuisance parameter and is integrated out of the posterior distribution. Whereas current methods of detecting activation patterns are deterministic or provide only point estimates, ours provides distributional estimates of muscle activation. These estimates, in turn, are used to estimate physiologically relevant quantities such as muscle coactivity, total integrated energy, and average burst duration and to draw valid statistical inferences about these quantities. 相似文献
8.
A stochastic epidemic model featuring fixed-length latent periods, gamma-distributed infectious periods and randomly varying heterogeneity among susceptibles is considered. A Markov chain Monte Carlo algorithm is developed for performing Bayesian inference for the parameters governing the infectious-period length and the hyper-parameters governing the heterogeneity of susceptibility. This method of analysis applies to a wider class of diseases than methods proposed previously. An application to smallpox data confirms results about heterogeneity suggested by an earlier analysis that relied on less realistic assumptions. 相似文献
9.
10.
Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability 总被引:16,自引:0,他引:16
Douady CJ Delsuc F Boucher Y Doolittle WF Douzery EJ 《Molecular biology and evolution》2003,20(2):248-254
Owing to the exponential growth of genome databases, phylogenetic trees are now widely used to test a variety of evolutionary hypotheses. Nevertheless, computation time burden limits the application of methods such as maximum likelihood nonparametric bootstrap to assess reliability of evolutionary trees. As an alternative, the much faster Bayesian inference of phylogeny, which expresses branch support as posterior probabilities, has been introduced. However, marked discrepancies exist between nonparametric bootstrap proportions and Bayesian posterior probabilities, leading to difficulties in the interpretation of sometimes strongly conflicting results. As an attempt to reconcile these two indices of node reliability, we apply the nonparametric bootstrap resampling procedure to the Bayesian approach. The correlation between posterior probabilities, bootstrap maximum likelihood percentages, and bootstrapped posterior probabilities was studied for eight highly diverse empirical data sets and were also investigated using experimental simulation. Our results show that the relation between posterior probabilities and bootstrapped maximum likelihood percentages is highly variable but that very strong correlations always exist when Bayesian node support is estimated on bootstrapped character matrices. Moreover, simulations corroborate empirical observations in suggesting that, being more conservative, the bootstrap approach might be less prone to strongly supporting a false phylogenetic hypothesis. Thus, apparent conflicts in topology recovered by the Bayesian approach were reduced after bootstrapping. Both posterior probabilities and bootstrap supports are of great interest to phylogeny as potential upper and lower bounds of node reliability, but they are surely not interchangeable and cannot be directly compared. 相似文献
11.
12.
The amino acid sequences of proteins provide rich information for inferring distant phylogenetic relationships and for predicting protein functions. Estimating the rate matrix of residue substitutions from amino acid sequences is also important because the rate matrix can be used to develop scoring matrices for sequence alignment. Here we use a continuous time Markov process to model the substitution rates of residues and develop a Bayesian Markov chain Monte Carlo method for rate estimation. We validate our method using simulated artificial protein sequences. Because different local regions such as binding surfaces and the protein interior core experience different selection pressures due to functional or stability constraints, we use our method to estimate the substitution rates of local regions. Our results show that the substitution rates are very different for residues in the buried core and residues on the solvent-exposed surfaces. In addition, the rest of the proteins on the binding surfaces also have very different substitution rates from residues. Based on these findings, we further develop a method for protein function prediction by surface matching using scoring matrices derived from estimated substitution rates for residues located on the binding surfaces. We show with examples that our method is effective in identifying functionally related proteins that have overall low sequence identity, a task known to be very challenging. 相似文献
13.
This paper presents a Bayesian analysis of a time series of counts to assess its dependence on an explanatory variable. The time series represented is the incidence of the infectious disease ESBL-producing Klebsiella pneumoniae in an Australian hospital and the explanatory variable is the number of grams of antibiotic (third generation) cephalosporin used during that time. We demonstrate that there is a statistically significant relationship between disease occurrence and use of the antibiotic, lagged by three months. The model used is a parameter-driven model in the form of a generalized linear mixed model. Comparison of models is made in terms of mean square error. 相似文献
14.
Bayesian analysis of factorial experiments by mixture modelling 总被引:3,自引:0,他引:3
15.
16.
Morris JS Wang N Lupton JR Chapkin RS Turner ND Hong M Carroll RJ 《Biostatistics (Oxford, England)》2002,3(4):529-546
This paper is concerned with modeling the architecture of colonic crypts and the implications of this modeling for understanding possible coordinated response of carcinogen-induced DNA damage between various regions of the colon. The methods we develop to address these two issues are applied to a particular important example in colon carcinogenesis. We cast the problem as an unusual and not previously studied hierarchical mixed-effects model characterized by completely missing covariates in units at a structurally base level, except for some randomly selected units. Information concerning the missing covariates is available through certain known ordering constraints and surrogate measures. Our methods use Bayesian machinery. We exploit the biological structure of this problem to generate the missing covariates simultaneously and efficiently at the base levels, as opposed to the naive practice of generating units at the base levels one-at-a-time with Metropolis-Hastings steps. We apply our methods to show that different regions of the colon have different architectures, and to estimate an important but non-standard function that measures the interrelationship of DNA damage mechanisms in different regions of the colon. 相似文献