首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Yin G 《Biometrics》2005,61(2):552-558
Due to natural or artificial clustering, multivariate survival data often arise in biomedical studies, for example, a dental study involving multiple teeth from each subject. A certain proportion of subjects in the population who are not expected to experience the event of interest are considered to be "cured" or insusceptible. To model correlated or clustered failure time data incorporating a surviving fraction, we propose two forms of cure rate frailty models. One model naturally introduces frailty based on biological considerations while the other is motivated from the Cox proportional hazards frailty model. We formulate the likelihood functions based on piecewise constant hazards and derive the full conditional distributions for Gibbs sampling in the Bayesian paradigm. As opposed to the Cox frailty model, the proposed methods demonstrate great potential in modeling multivariate survival data with a cure fraction. We illustrate the cure rate frailty models with a root canal therapy data set.  相似文献   

2.
Yin G  Ibrahim JG 《Biometrics》2005,61(1):208-216
For multivariate failure time data, we propose a new class of shared gamma frailty models by imposing the Box-Cox transformation on the hazard function, and the product of the baseline hazard and the frailty. This novel class of models allows for a very broad range of shapes and relationships between the hazard and baseline hazard functions. It includes the well-known Cox gamma frailty model and a new additive gamma frailty model as two special cases. Due to the nonnegative hazard constraint, this shared gamma frailty model is computationally challenging in the Bayesian paradigm. The joint priors are constructed through a conditional-marginal specification, in which the conditional distribution is univariate, and it absorbs the nonlinear parameter constraints. The marginal part of the prior specification is free of constraints. The prior distributions allow us to easily compute the full conditionals needed for Gibbs sampling, while incorporating the constraints. This class of shared gamma frailty models is illustrated with a real dataset.  相似文献   

3.
Ripatti S  Palmgren J 《Biometrics》2000,56(4):1016-1022
There exists a growing literature on the estimation of gamma distributed multiplicative shared frailty models. There is, however, often a need to model more complicated frailty structures, but attempts to extend gamma frailties run into complications. Motivated by hip replacement data with a more complicated dependence structure, we propose a model based on multiplicative frailties with a multivariate log-normal joint distribution. We give a justification and an estimation procedure for this generally structured frailty model, which is a generalization of the one presented by McGilchrist (1993, Biometrics 49, 221-225). The estimation is based on Laplace approximation of the likelihood function. This leads to estimating equations based on a penalized fixed effects partial likelihood, where the marginal distribution of the frailty terms determines the penalty term. The tuning parameters of the penalty function, i.e., the frailty variances, are estimated by maximizing an approximate profile likelihood. The performance of the approximation is evaluated by simulation, and the frailty model is fitted to the hip replacement data.  相似文献   

4.
5.
6.
Individual based models (IBMs) and Agent based models (ABMs) have become widely used tools to understand complex biological systems. However, general methods of parameter inference for IBMs are not available. In this paper we show that it is possible to address this problem with a traditional likelihood-based approach, using an example of an IBM developed to describe the spread of chytridiomycosis in a population of frogs as a case study. We show that if the IBM satisfies certain criteria we can find the likelihood (or posterior) analytically, and use standard computational techniques, such as MCMC, for parameter inference.  相似文献   

7.
Spatial weed count data are modeled and predicted using a generalized linear mixed model combined with a Bayesian approach and Markov chain Monte Carlo. Informative priors for a data set with sparse sampling are elicited using a previously collected data set with extensive sampling. Furthermore, we demonstrate that so-called Langevin-Hastings updates are useful for efficient simulation of the posterior distributions, and we discuss computational issues concerning prediction.  相似文献   

8.
Yin G  Li Y  Ji Y 《Biometrics》2006,62(3):777-787
A Bayesian adaptive design is proposed for dose-finding in phase I/II clinical trials to incorporate the bivariate outcomes, toxicity and efficacy, of a new treatment. Without specifying any parametric functional form for the drug dose-response curve, we jointly model the bivariate binary data to account for the correlation between toxicity and efficacy. After observing all the responses of each cohort of patients, the dosage for the next cohort is escalated, deescalated, or unchanged according to the proposed odds ratio criteria constructed from the posterior toxicity and efficacy probabilities. A novel class of prior distributions is proposed through logit transformations which implicitly imposes a monotonic constraint on dose toxicity probabilities and correlates the probabilities of the bivariate outcomes. We conduct simulation studies to evaluate the operating characteristics of the proposed method. Under various scenarios, the new Bayesian design based on the toxicity-efficacy odds ratio trade-offs exhibits good properties and treats most patients at the desirable dose levels. The method is illustrated with a real trial design for a breast medical oncology study.  相似文献   

9.
In randomized clinical trials where the times to event of two treatment groups are compared under a proportional hazards assumption, it has been established that omitting prognostic factors from the model entails an underestimation of the hazards ratio. Heterogeneity due to unobserved covariates in cancer patient populations is a concern since genomic investigations have revealed molecular and clinical heterogeneity in these populations. In HIV prevention trials, heterogeneity is unavoidable and has been shown to decrease the treatment effect over time. This article assesses the influence of trial duration on the bias of the estimated hazards ratio resulting from omitting covariates from the Cox analysis. The true model is defined by including an unobserved random frailty term in the individual hazard that reflects the omitted covariate. Three frailty distributions are investigated: gamma, log‐normal, and binary, and the asymptotic bias of the hazards ratio estimator is calculated. We show that the attenuation of the treatment effect resulting from unobserved heterogeneity strongly increases with trial duration, especially for continuous frailties that are likely to reflect omitted covariates, as they are often encountered in practice. The possibility of interpreting the long‐term decrease in treatment effects as a bias induced by heterogeneity and trial duration is illustrated by a trial in oncology where adjuvant chemotherapy in stage 1B NSCLC was investigated.  相似文献   

10.
At the breeding grounds of most baleen whales the patchiness and gaps in spatial distribution results from interactions between behavior patterns and environmental conditions. We evaluated the influence of environmental factors (bathymetry and distance from shore with quadratic terms, and wind speed), effort, and spatial autocorrelation effects to predict humpback whale group density in the Southwest Atlantic Ocean. Count data of groups by grid cells were fitted with conditional autoregressive models (CAR). Bayesian inference was performed via integrated nested Laplace approximation. The best‐fit model contained distance from shore and its quadratic term, bathymetry, and the autoregressive component. Occupancy probability was high for the Abrolhos Bank, some cells from the northeast continental shelf and southeast margin, but gaps in occurrence were identified. High densities were estimated in the east continental margin, with the highest density in the Abrolhos Bank, in some cells of the northeast continental margin and in the southernmost area. We report that intermediate distances from the coast, and shallow waters were preferred for breeding and calving activities. We suggest that CAR models may incorporate aggregation mechanisms into habitat modeling and may provide advances in marine mammal analyses by accounting for residual autocorrelation.  相似文献   

11.
k independent samples 0, 1, 2,…,k – 1 (or stages 0, 1,…. k – 1) are available from the exponential population. Each of k – 1 samples, except the sample 0 are censored both above and below. Restricted range is defined as the difference between largest and the smallest among the available observations. Prediction of this restricted range at stage k (in the future sample k) on the basis of the restricted ranges at earlier 1, 2,…k – 1 stages is attempted by obtaining the predictive distribution of this range at stage k. Probability that the variance at stage k is smaller than that at stage k – 1, is evaluated along with the probability integral. An illustrative example is given with simulated samples.  相似文献   

12.
  • 1 The present study aimed to propose a method that can improve our understanding of pest outbreaks and spatio‐temporal development in greenhouse crops.
  • 2 The experiment was carried out in a greenhouse rose crop grown under integrated pest management (IPM) for 21 months. The main pests observed were powdery mildew, two‐spotted spider mites and western flower thrips. A quick visual sampling method was established to provide continuous monitoring of overall crop health.
  • 3 A Bayesian inferential approach was then used to analyse temporal and spatial heterogeneity in the occurrence of pests. Interactions between pest dynamics and properties of spatial evolutions were exhibited revealing the influence of biotic and abiotic factors on crop health.
  • 4 In the context of IPM, this information could be used to improve monitoring strategies by identifying periods or locations at risk. It could also facilitate the implementation of the whole IPM procedure through the identification of key factors that have a negative impact on overall crop health.
  相似文献   

13.
The objectives of this study were to 1) compare four models for breeding value prediction using genomic or pedigree information and 2) evaluate the impact of fixed effects that account for family structure. Comparisons were made in a Nellore-Angus population comprising F2, F3 and half-siblings to embryo transfer F2 calves with records for overall temperament at weaning (TEMP; n = 769) and Warner-Bratzler shear force (WBSF; n = 387). After quality control, there were 34,913 whole genome SNP markers remaining. Bayesian methods employed were BayesB (π̃ = 0.995 or 0.997 for WBSF or TEMP, respectively) and BayesC (π = 0 and π̃), where π̃ is the ideal proportion of markers not included. Direct genomic values (DGV) from single trait Bayesian analyses were compared to conventional pedigree-based animal model breeding values. Numerically, BayesC procedures (using π̃) had the highest accuracy of all models for WBSF and TEMP (ρ̂ = 0.843 and 0.923, respectively), but BayesB had the least bias (regression of performance on prediction closest to 1, β̂y,x = 2.886 and 1.755, respectively). Accounting for family structure decreased accuracy and increased bias in prediction of DGV indicating a detrimental impact when used in these prediction methods that simultaneously fit many markers.  相似文献   

14.
A Gaussian-threshold model is described under the general framework of structural equation models for inferring simultaneous and recursive relationships between binary and Gaussian characters, and estimating genetic parameters. Relationships between clinical mastitis (CM) and test-day milk yield (MY) in first-lactation Norwegian Red cows were examined using a recursive Gaussian-threshold model. For comparison, the data were also analyzed using a standard Gaussian-threshold, a multivariate linear model, and a recursive multivariate linear model. The first 180 days of lactation were arbitrarily divided into three periods of equal length, in order to investigate how these relationships evolve in the course of lactation. The recursive model showed negative within-period effects from (liability to) CM to test-day MY in all three lactation periods, and positive between-period effects from test-day MY to (liability to) CM in the following period. Estimates of recursive effects and of genetic parameters were time-dependent. The results suggested unfavorable effects of production on liability to mastitis, and dynamic relationships between mastitis and test-dayMYin the course of lactation. Fitting recursive effects had little influence on the estimation of genetic parameters. However, some differences were found in the estimates of heritability, genetic, and residual correlations, using different types of models (Gaussian-threshold vs. multivariate linear).  相似文献   

15.
Here we focus on discrimination problems where the number of predictors substantially exceeds the sample size and we propose a Bayesian variable selection approach to multinomial probit models. Our method makes use of mixture priors and Markov chain Monte Carlo techniques to select sets of variables that differ among the classes. We apply our methodology to a problem in functional genomics using gene expression profiling data. The aim of the analysis is to identify molecular signatures that characterize two different stages of rheumatoid arthritis.  相似文献   

16.
ObjectiveTo identify clinicopathologic factors predictive of early relapse (platinum-free interval (PFI) of ≤6 months) in advanced epithelial ovarian cancer (EOC) in first-line treatment, and to develop and internally validate risk prediction models for early relapse.MethodsAll consecutive patients diagnosed with advanced stage EOC between 01-01-2008 and 31-12-2015 were identified from the Netherlands Cancer Registry. Patients who underwent cytoreductive surgery and platinum-based chemotherapy as initial EOC treatment were selected. Two prediction models, i.e. pretreatment and postoperative, were developed. Candidate predictors of early relapse were fitted into multivariable logistic regression models. Model performance was assessed on calibration and discrimination. Internal validation was performed through bootstrapping to correct for model optimism.ResultsA total of 4,557 advanced EOC patients were identified, including 1,302 early relapsers and 3,171 late or non-relapsers. Early relapsers were more likely to have FIGO stage IV, mucinous or clear cell type EOC, ascites, >1 cm residual disease, and to have undergone NACT-ICS. The final pretreatment model demonstrated subpar model performance (AUC = 0.64 [95 %-CI 0.62−0.66]). The final postoperative model based on age, FIGO stage, pretreatment CA-125 level, histologic subtype, presence of ascites, treatment approach, and residual disease after debulking, demonstrated adequate model performance (AUC = 0.72 [95 %-CI 0.71−0.74]). Bootstrap validation revealed minimal optimism of the final postoperative model.ConclusionA (postoperative) discriminative model has been developed and presented online that predicts the risk of early relapse in advanced EOC patients. Although external validation is still required, this prediction model can support patient counselling in daily clinical practice.  相似文献   

17.
18.
The investigation of individual heterogeneity in vital rates has recently received growing attention among population ecologists. Individual heterogeneity in wild animal populations has been accounted for and quantified by including individually varying effects in models for mark–recapture data, but the real need for underlying individual effects to account for observed levels of individual variation has recently been questioned by the work of Tuljapurkar et al. (Ecology Letters, 12, 93, 2009) on dynamic heterogeneity. Model‐selection approaches based on information criteria or Bayes factors have been used to address this question. Here, we suggest that, in addition to model‐selection, model‐checking methods can provide additional important insights to tackle this issue, as they allow one to evaluate a model's misfit in terms of ecologically meaningful measures. Specifically, we propose the use of posterior predictive checks to explicitly assess discrepancies between a model and the data, and we explain how to incorporate model checking into the inferential process used to assess the practical implications of ignoring individual heterogeneity. Posterior predictive checking is a straightforward and flexible approach for performing model checks in a Bayesian framework that is based on comparisons of observed data to model‐generated replications of the data, where parameter uncertainty is incorporated through use of the posterior distribution. If discrepancy measures are chosen carefully and are relevant to the scientific context, posterior predictive checks can provide important information allowing for more efficient model refinement. We illustrate this approach using analyses of vital rates with long‐term mark–recapture data for Weddell seals and emphasize its utility for identifying shortfalls or successes of a model at representing a biological process or pattern of interest.  相似文献   

19.
Simulated data were used to investigate the influence of the choice of priors on estimation of genetic parameters in multivariate threshold models using Gibbs sampling. We simulated additive values, residuals and fixed effects for one continuous trait and liabilities of four binary traits, and QTL effects for one of the liabilities. Within each of four replicates six different datasets were generated which resembled different practical scenarios in horses with respect to number and distribution of animals with trait records and availability of QTL information. (Co)Variance components were estimated using a Bayesian threshold animal model via Gibbs sampling. The Gibbs sampler was implemented with both a flat and a proper prior for the genetic covariance matrix. Convergence problems were encountered in > 50% of flat prior analyses, with indications of potential or near posterior impropriety between about round 10 000 and 100 000. Terminations due to non-positive definite genetic covariance matrix occurred in flat prior analyses of the smallest datasets. Use of a proper prior resulted in improved mixing and convergence of the Gibbs chain. In order to avoid (near) impropriety of posteriors and extremely poorly mixing Gibbs chains, a proper prior should be used for the genetic covariance matrix when implementing the Gibbs sampler.  相似文献   

20.
Simulated data were used to determine the properties of multivariate prediction of breeding values for categorical and continuous traits using phenotypic, molecular genetic and pedigree information by mixed linear-threshold animal models via Gibbs sampling. Simulation parameters were chosen such that the data resembled situations encountered in Warmblood horse populations. Genetic evaluation was performed in the context of the radiographic findings in the equine limbs. The simulated pedigree comprised seven generations and 40 000 animals per generation. The simulated data included additive genetic values, residuals and fixed effects for one continuous trait and liabilities of four binary traits. For one of the binary traits, quantitative trait locus (QTL) effects and genetic markers were simulated, with three different scenarios with respect to recombination rate (r) between genetic markers and QTL and polymorphism information content (PIC) of genetic markers being studied: r = 0.00 and PIC = 0.90 (r0p9), r = 0.01 and PIC = 0.90 (r1p9), and r = 0.00 and PIC = 0.70 (r0p7). For each scenario, 10 replicates were sampled from the simulated horse population, and six different data sets were generated per replicate. Data sets differed in number and distribution of animals with trait records and the availability of genetic marker information. Breeding values were predicted via Gibbs sampling using a Bayesian mixed linear-threshold animal model with residual covariances fixed to zero and a proper prior for the genetic covariance matrix. Relative breeding values were used to investigate expected response to multi- and single-trait selection. In the sires with 10 or more offspring with trait information, correlations between true and predicted breeding values ranged between 0.89 and 0.94 for the continuous traits and between 0.39 and 0.77 for the binary traits. Proportions of successful identification of sires of average, favourable and unfavourable genetic value were 81% to 86% for the continuous trait and 57% to 74% for the binary traits in these sires. Expected decrease of prevalence of the QTL trait was 3% to 12% after multi-trait selection for all binary traits and 9% to 17% after single-trait selection for the QTL trait. The combined use of phenotype and genotype data was superior to the use of phenotype data alone. It was concluded that information on phenotypes and highly informative genetic markers should be used for prediction of breeding values in mixed linear-threshold animal models via Gibbs sampling to achieve maximum reduction in prevalences of binary traits.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号