共查询到20条相似文献,搜索用时 0 毫秒
1.
Yin G 《Biometrics》2005,61(2):552-558
Due to natural or artificial clustering, multivariate survival data often arise in biomedical studies, for example, a dental study involving multiple teeth from each subject. A certain proportion of subjects in the population who are not expected to experience the event of interest are considered to be \"cured\" or insusceptible. To model correlated or clustered failure time data incorporating a surviving fraction, we propose two forms of cure rate frailty models. One model naturally introduces frailty based on biological considerations while the other is motivated from the Cox proportional hazards frailty model. We formulate the likelihood functions based on piecewise constant hazards and derive the full conditional distributions for Gibbs sampling in the Bayesian paradigm. As opposed to the Cox frailty model, the proposed methods demonstrate great potential in modeling multivariate survival data with a cure fraction. We illustrate the cure rate frailty models with a root canal therapy data set. 相似文献
2.
Summary . We present a Bayesian approach to modeling dynamic smoking addiction behavior processes when cure is not directly observed due to censoring. Subject-specific probabilities model the stochastic transitions among three behavioral states: smoking, transient quitting, and permanent quitting (absorbent state). A multivariate normal distribution for random effects is used to account for the potential correlation among the subject-specific transition probabilities. Inference is conducted using a Bayesian framework via Markov chain Monte Carlo simulation. This framework provides various measures of subject-specific predictions, which are useful for policy-making, intervention development, and evaluation. Simulations are used to validate our Bayesian methodology and assess its frequentist properties. Our methods are motivated by, and applied to, the Alpha-Tocopherol, Beta-Carotene Lung Cancer Prevention study, a large (29,133 individuals) longitudinal cohort study of smokers from Finland. 相似文献
3.
4.
For multivariate failure time data, we propose a new class of shared gamma frailty models by imposing the Box-Cox transformation on the hazard function, and the product of the baseline hazard and the frailty. This novel class of models allows for a very broad range of shapes and relationships between the hazard and baseline hazard functions. It includes the well-known Cox gamma frailty model and a new additive gamma frailty model as two special cases. Due to the nonnegative hazard constraint, this shared gamma frailty model is computationally challenging in the Bayesian paradigm. The joint priors are constructed through a conditional-marginal specification, in which the conditional distribution is univariate, and it absorbs the nonlinear parameter constraints. The marginal part of the prior specification is free of constraints. The prior distributions allow us to easily compute the full conditionals needed for Gibbs sampling, while incorporating the constraints. This class of shared gamma frailty models is illustrated with a real dataset. 相似文献
5.
荒漠河岸林是塔里木极端干旱区唯一的森林群落类型,群落结构组成简单。我们采用种-面积曲线、群落系数-面积曲线与重要值-面积曲线3种方法对塔里木河上游荒漠河岸林灰胡杨(Populus pruinosa Schrenk)群落最小面积进行研究。结果显示,4种饱和种-面积曲线的拟合效果差异明显,S=aA/(1+bA)和S=a(1-e-bA)模型拟合的相关系数较高(P < 0.01),拟合效果优于其它2种模型。当取样面积为100 m2时,可包括群落60%~80%的物种数,取样面积为200 m2时,则可包括群落90%的物种数。群落系数-面积曲线与重要值-面积曲线2种方法确定的群落最小面积均为400 m2,二者克服了种-面积曲线仅关注物种出现与否和饱和种的估算问题,拟合结果更符合实际情况,更适用于塔里木荒漠河岸林最小取样面积的确定。本研究结果表明荒漠河岸林包括60%、80%、90%群落物种的临界抽样面积分别为100 m2、200 m2和400 m 2,可以满足不同研究精度的要求;塔里木河上游荒漠河岸林灰胡杨群落学调查的最适取样面积是400 m2。 相似文献
6.
We describe a Bayesian quantile regression model that uses a confirmatory factor structure for part of the design matrix. This model is appropriate when the covariates are indicators of scientifically determined latent factors, and it is these latent factors that analysts seek to include as predictors in the quantile regression. We apply the model to a study of birth weights in which the effects of latent variables representing psychosocial health and actual tobacco usage on the lower quantiles of the response distribution are of interest. The models can be fit using an R package called factorQR. 相似文献
7.
Yang Z 《Philosophical transactions of the Royal Society of London. Series B, Biological sciences》2008,363(1512):4031-4039
The Bayesian method of phylogenetic inference often produces high posterior probabilities (PPs) for trees or clades, even when the trees are clearly incorrect. The problem appears to be mainly due to large sizes of molecular datasets and to the large-sample properties of Bayesian model selection and its sensitivity to the prior when several of the models under comparison are nearly equally correct (or nearly equally wrong) and are of the same dimension. A previous suggestion to alleviate the problem is to let the internal branch lengths in the tree become increasingly small in the prior with the increase in the data size so that the bifurcating trees are increasingly star-like. In particular, if the internal branch lengths are assigned the exponential prior, the prior mean mu0 should approach zero faster than 1/square root n but more slowly than 1/n, where n is the sequence length. This paper examines the usefulness of this data size-dependent prior using a dataset of the mitochondrial protein-coding genes from the baleen whales, with the prior mean fixed at mu0=0.1n(-2/3). In this dataset, phylogeny reconstruction is sensitive to the assumed evolutionary model, species sampling and the type of data (DNA or protein sequences), but Bayesian inference using the default prior attaches high PPs for conflicting phylogenetic relationships. The data size-dependent prior alleviates the problem to some extent, giving weaker support for unstable relationships. This prior may be useful in reducing apparent conflicts in the results of Bayesian analysis or in making the method less sensitive to model violations. 相似文献
8.
In many cell types, the inositol trisphosphate receptor (IPR) is one of the important components that control intracellular calcium dynamics, and an understanding of this receptor (which is also a calcium channel) is necessary for an understanding of calcium oscillations and waves. Recent advances in experimental techniques now allow for the measurement of single-channel activity of the IPR in conditions similar to its native environment, and these data can be used to determine the rate constants in Markov models of the IPR. We illustrate a parameter estimation method based on Markov chain Monte Carlo, which can be used to fit directly to single-channel data, and determining, as an intrinsic part of the fit, the times at which the IPR is opening and closing. We show, using simulated data, the most complex Markov model that can be unambiguously determined from steady-state data and show that non-steady-state data is required to determine more complex models. 相似文献
9.
The positive and negative predictive values are standard ways of quantifying predictive accuracy when both the outcome and the prognostic factor are binary. Methods for comparing the predictive values of two or more binary factors have been discussed previously (Leisenring et al., 2000, Biometrics 56, 345-351). We propose extending the standard definitions of the predictive values to accommodate prognostic factors that are measured on a continuous scale and suggest a corresponding graphical method to summarize predictive accuracy. Drawing on the work of Leisenring et al. we make use of a marginal regression framework and discuss methods for estimating these predictive value functions and their differences within this framework. The methods presented in this paper have the potential to be useful in a number of areas including the design of clinical trials and health policy analysis. 相似文献
10.
The extent to which natural selection shapes diversity within populations is a key question for population genetics. Thus, there is considerable interest in quantifying the strength of selection. A full likelihood approach for inference about selection at a single site within an otherwise neutral fully linked sequence of sites is described here. A coalescent model of evolution is used to model the ancestry of a sample of DNA sequences which have the selected site segregating. The mutation model, for the selected and neutral sites, is the infinitely many-sites model where there is no back or parallel mutation at sites. A unique perfect phylogeny, a gene tree, can be constructed from the configuration of mutations on the sample sequences under this model of mutation. The approach is general and can be used for any bi-allelic selection scheme. Selection is incorporated through modelling the frequency of the selected and neutral allelic classes stochastically back in time, then using a subdivided population model considering the population frequencies through time as variable population sizes. An importance sampling algorithm is then used to explore over coalescent tree space consistent with the data. The method is applied to a simulated data set and the gene tree presented in Verrelli et al. (2002). 相似文献
11.
Stephens and Donnelly have introduced a simple yet powerful importance sampling scheme for computing the likelihood in population genetic models. Fundamental to the method is an approximation to the conditional probability of the allelic type of an additional gene, given those currently in the sample. As noted by Li and Stephens, the product of these conditional probabilities for a sequence of draws that gives the frequency of allelic types in a sample is an approximation to the likelihood, and can be used directly in inference. The aim of this note is to demonstrate the high level of accuracy of "product of approximate conditionals" (PAC) likelihood when used with microsatellite data. Results obtained on simulated microsatellite data show that this strategy leads to a negligible bias over a wide range of the scaled mutation parameter theta. Furthermore, the sampling variance of likelihood estimates as well as the computation time are lower than that obtained with importance sampling on the whole range of theta. It follows that this approach represents an efficient substitute to IS algorithms in computer intensive (e.g. MCMC) inference methods in population genetics. 相似文献
12.
High throughput technologies, such as gene expression arrays and protein mass spectrometry, allow one to simultaneously evaluate thousands of potential biomarkers that could distinguish different tissue types. Of particular interest here is distinguishing between cancerous and normal organ tissues. We consider statistical methods to rank genes (or proteins) in regards to differential expression between tissues. Various statistical measures are considered, and we argue that two measures related to the Receiver Operating Characteristic Curve are particularly suitable for this purpose. We also propose that sampling variability in the gene rankings be quantified, and suggest using the \"selection probability function,\" the probability distribution of rankings for each gene. This is estimated via the bootstrap. A real dataset, derived from gene expression arrays of 23 normal and 30 ovarian cancer tissues, is analyzed. Simulation studies are also used to assess the relative performance of different statistical gene ranking measures and our quantification of sampling variability. Our approach leads naturally to a procedure for sample-size calculations, appropriate for exploratory studies that seek to identify differentially expressed genes. 相似文献
13.
Multiple shrinkage and subset selection in wavelets 总被引:6,自引:0,他引:6
14.
Cigarette smoking has long been viewed as a means to control body weight. However, studies on the association between smoking cessation and weight gain have reported mixed findings and, notably, there is limited evidence among the Chinese population – the world’s largest smoker population. The extent to which smoking cessation is positively associated with body weight is of interest as excessive weight gain contributes to heart disease, diabetes, hypertension, musculoskeletal disorders, and some cancers. Additionally, concerns over weight gain may dissuade current smokers from quitting. Using data from the China Health and Nutrition Survey (CHNS), we examine the association between smoking cessation and body weight in China. To account for the nonrandom nature of smoking cessation, our research design relies on within-individual variation in smoking status to remove the influence of time-invariant unobserved differences across individuals that are correlated with both cessation and body weight. We find that smoking cessation is associated with a modest increase in weight (0.329 kg, 0.51 % off the mean) and no significant changes in the prevalence of overweight or obesity. 相似文献
15.
Cancer survivors benefit from evidence-based smoking cessation treatment. A crucial first step in this process is a clinician recommending that the patient quit smoking. However, contemporary delivery of advice to quit among patients with cancer is not well known. In a cross-sectional analysis of all adult smokers included in a prospective population-representative study of US adults, we analyzed the frequency that patients reported receiving advice to quit smoking from a healthcare professional according to reported cancer history (no cancer, tobacco-related cancer, non-tobacco related cancer history). Among an estimated 28.3 million smokers, 9.3% reported a history of cancer, 48.8% of which were tobacco-related cancers. In general, advice to quit was reported by more (67.8%) cancer survivors than those adults without any cancer (56.0%). After adjustment for sociodemographic factors, smokers with a non tobacco-related cancer (0.51, 95% CI 0.32–0.83) and those without any cancer history (0.43, 95% CI 0.30–0.63) were both less likely to report being advised to quit smoking than patients with a tobacco-related cancer history. 相似文献
16.
Getachew A. Dagne 《Biometrical journal. Biometrische Zeitschrift》2004,46(6):653-663
This article presents two‐component hierarchical Bayesian models which incorporate both overdispersion and excess zeros. The components may be resultants of some intervention (treatment) that changes the rare event generating process. The models are also expanded to take into account any heterogeneity that may exist in the data. Details of the model fitting, checking and selecting alternative models from a Bayesian perspective are also presented. The proposed methods are applied to count data on the assessment of an efficacy of pesticides in controlling the reproduction of whitefly. (© 2004 WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim) 相似文献
17.
Summary This article presents some statistical methods for estimating the parameters of a population dynamics model for annual plants. The model takes account of reproduction, immigration, seed survival in a seed bank, and plant growth. The data consist of the number of plants in several developmental stages that were measured in a number of populations for a few consecutive years; they are incomplete since seeds could not be counted. It is assumed that there are no measurement errors or that measurement errors are binomial and not frequent. Some statistical methods are developed within the framework of estimating equations or Bayesian inference. These methods are applied to oilseed rape data. 相似文献
18.
Individual based models (IBMs) and Agent based models (ABMs) have become widely used tools to understand complex biological systems. However, general methods of parameter inference for IBMs are not available. In this paper we show that it is possible to address this problem with a traditional likelihood-based approach, using an example of an IBM developed to describe the spread of chytridiomycosis in a population of frogs as a case study. We show that if the IBM satisfies certain criteria we can find the likelihood (or posterior) analytically, and use standard computational techniques, such as MCMC, for parameter inference. 相似文献
19.
G.J.M. Rosa C.R. Padovani D. Gianola 《Biometrical journal. Biometrische Zeitschrift》2003,45(5):573-590
Linear mixed effects models have been widely used in analysis of data where responses are clustered around some random effects, so it is not reasonable to assume independence between observations in the same cluster. In most biological applications, it is assumed that the distributions of the random effects and of the residuals are Gaussian. This makes inferences vulnerable to the presence of outliers. Here, linear mixed effects models with normal/independent residual distributions for robust inferences are described. Specific distributions examined include univariate and multivariate versions of the Student‐ t, the slash and the contaminated normal. A Bayesian framework is adopted and Markov chain Monte Carlo is used to carry out the posterior analysis. The procedures are illustrated using birth weight data on rats in a toxicological experiment. Results from the Gaussian and robust models are contrasted, and it is shown how the implementation can be used for outlier detection. The thick‐tailed distributions provide an appealing robust alternative to the Gaussian process in linear mixed models, and they are easily implemented using data augmentation and MCMC techniques. 相似文献
20.
Zhang Z Zheng G Pivavarchyk M Deaciuc AG Dwoskin LP Crooks PA 《Bioorganic & medicinal chemistry letters》2011,21(1):88-91
A series of tertiary amine analogs derived from lead azaaromatic quaternary ammonium salts has been designed and synthesized. The preliminary structure-activity relationships of these new analogs suggest that such tertiary amine analogs, which potently inhibit nicotine-evoked dopamine release from rat striatum, represent drug-like inhibitors of α6-containing nicotinic acetylcholine receptors. The bis-tertiary amine analog 7 exhibited an IC50 of 0.95 nM, while the tris-tertiary amine analog 19 had an IC50 of 0.35 nM at nAChRs mediating nicotine-evoked dopamine release. 相似文献