首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Multistate Markov models are frequently used to characterize disease processes, but their estimation from longitudinal data is often hampered by complex patterns of incompleteness. Two algorithms for estimating Markov chain models in the case of intermittent missing data in longitudinal studies, a stochastic EM algorithm and the Gibbs sampler, are described. The first can be viewed as a random perturbation of the EM algorithm and is appropriate when the M step is straightforward but the E step is computationally burdensome. It leads to a good approximation of the maximum likelihood estimates. The Gibbs sampler is used for a full Bayesian inference. The performances of the two algorithms are illustrated on two simulated data sets. A motivating example concerned with the modelling of the evolution of parasitemia by Plasmodium falciparum (malaria) in a cohort of 105 young children in Cameroon is described and briefly analyzed.  相似文献   

2.
FAREWELL  V. T. 《Biometrika》1979,66(1):27-32
  相似文献   

3.
4.
Huggins R 《Biometrics》2000,56(2):537-545
In the study of longitudinal twin and family data, interest is often in the covariance structure of the data and the decomposition of this covariance structure into genetic and environmental components rather than in estimating the mean function. Various parametric models for covariance structures have been proposed but, e.g., in studies of children where growth spurts occur at various ages, it is difficult to a priori determine an appropriate parametric model for the covariance structure. In particular, there is a general lack of the visualization procedures, such as lowess, that are invaluable in the initial stages of constructing a parametric model for a mean function. Here we use kernel smoothing to modify a cross-sectional approach based on the sample covariance matrices to obtain smoothed estimates of the genetic and environmental variances and correlations for longitudinal twin data. The methods are proposed to be exploratory as an aid to parametric modeling rather than inferential, although approximate asymptotic standard errors are derived in the Appendix.  相似文献   

5.
On fitting Cox's proportional hazards models to survey data   总被引:8,自引:0,他引:8  
Lin  DY 《Biometrika》2000,87(1):37-47
  相似文献   

6.
7.
8.
9.
10.
S. Mandal  J. Qin  R.M. Pfeiffer 《Biometrics》2023,79(3):1701-1712
We propose and study a simple and innovative non-parametric approach to estimate the age-of-onset distribution for a disease from a cross-sectional sample of the population that includes individuals with prevalent disease. First, we estimate the joint distribution of two event times, the age of disease onset and the survival time after disease onset. We accommodate that individuals had to be alive at the time of the study by conditioning on their survival until the age at sampling. We propose a computationally efficient expectation–maximization (EM) algorithm and derive the asymptotic properties of the resulting estimates. From these joint probabilities we then obtain non-parametric estimates of the age-at-onset distribution by marginalizing over the survival time after disease onset to death. The method accommodates categorical covariates and can be used to obtain unbiased estimates of the covariate distribution in the source population. We show in simulations that our method performs well in finite samples even under large amounts of truncation for prevalent cases. We apply the proposed method to data from female participants in the Washington Ashkenazi Study to estimate the age-at-onset distribution of breast cancer associated with carrying BRCA1 or BRCA2 mutations.  相似文献   

11.
An easily implemented approach to fitting the proportional odds regression model to interval-censored data is presented. The approach is based on using conditional logistic regression routines in standard statistical packages. Using conditional logistic regression allows the practitioner to sidestep complications that attend estimation of the baseline odds ratio function. The approach is applicable both for interval-censored data in settings in which examinations continue regardless of whether the event of interest has occurred and for current status data. The methodology is illustrated through an application to data from an AIDS study of the effect of treatment with ZDV+ddC versus ZDV alone on 50% drop in CD4 cell count from baseline level. Simulations are presented to assess the accuracy of the procedure.  相似文献   

12.
In this work, we fit pattern-mixture models to data sets with responses that are potentially missing not at random (MNAR, Little and Rubin, 1987). In estimating the regression parameters that are identifiable, we use the pseudo maximum likelihood method based on exponential families. This procedure provides consistent estimators when the mean structure is correctly specified for each pattern, with further information on the variance structure giving an efficient estimator. The proposed method can be used to handle a variety of continuous and discrete outcomes. A test built on this approach is also developed for model simplification in order to improve efficiency. Simulations are carried out to compare the proposed estimation procedure with other methods. In combination with sensitivity analysis, our approach can be used to fit parsimonious semi-parametric pattern-mixture models to outcomes that are potentially MNAR. We apply the proposed method to an epidemiologic cohort study to examine cognition decline among elderly.  相似文献   

13.
Several different methods of analysis are applied to data consisting of weight measurements, taken at specified post-treatment times, of harvested thyroids from rats given one of four treatments. Previous studies of this type of data indicated that the growth is initially rapid, and that a second phase of less rapid growth is followed by a final phase in which little additional growth occurs. The data are further characterized by increasing variance through time. The primary purpose of the analysis is to study the effect of the treatments at the end of the study period. One-way analysis of variance tests among groups are performed on each day, but the results are not particularly helpful. However, results from two-way analyses of variance (over subsets of days and groups) are consistent with the three phase model and accordingly indicate significant group differences during each. Finally, maximum likelihood methods are used to fit a three part segmented linear regression model.  相似文献   

14.
Shin Y  Raudenbush SW 《Biometrics》2007,63(4):1262-1268
The development of model-based methods for incomplete data has been a seminal contribution to statistical practice. Under the assumption of ignorable missingness, one estimates the joint distribution of the complete data for thetainTheta from the incomplete or observed data y(obs). Many interesting models involve one-to-one transformations of theta. For example, with y(i) approximately N(mu, Sigma) for i= 1, ... , n and theta= (mu, Sigma), an ordinary least squares (OLS) regression model is a one-to-one transformation of theta. Inferences based on such a transformation are equivalent to inferences based on OLS using data multiply imputed from f(y(mis) | y(obs), theta) for missing y(mis). Thus, identification of theta from y(obs) is equivalent to identification of the regression model. In this article, we consider a model for two-level data with continuous outcomes where the observations within each cluster are dependent. The parameters of the hierarchical linear model (HLM) of interest, however, lie in a subspace of Theta in general. This identification of the joint distribution overidentifies the HLM. We show how to characterize the joint distribution so that its parameters are a one-to-one transformation of the parameters of the HLM. This leads to efficient estimation of the HLM from incomplete data using either the transformation method or the method of multiple imputation. The approach allows outcomes and covariates to be missing at either of the two levels, and the HLM of interest can involve the regression of any subset of variables on a disjoint subset of variables conceived as covariates.  相似文献   

15.
Fitting regression models to case-control data by maximum likelihood   总被引:3,自引:0,他引:3  
SCOTT  A. J.; WILD  C. J. 《Biometrika》1997,84(1):57-71
  相似文献   

16.
In this article least-squares- and maximum-likelihood-estimators in quantal response models are presented in a standardized terminology. The large sample properties are derived and the convergence properties of the Newton-Raphson- and the Scoring-method for the iterative solution of the likelihood equations are investigated. Finally, it is shown that the iterative least-squares estimation methods are equivalent to the method of scoring.  相似文献   

17.
18.
Logistic disease incidence models and case-control studies   总被引:8,自引:0,他引:8  
PRENTICE  R. L.; PYKE  R. 《Biometrika》1979,66(3):403-411
  相似文献   

19.
Kernel density estimation for length biased data   总被引:3,自引:0,他引:3  
JONES  M. C. 《Biometrika》1991,78(3):511-519
  相似文献   

20.
Most models for incomplete data are formulated within the selection model framework. This paper studies similarities and differences of modeling incomplete data within both selection and pattern-mixture settings. The focus is on missing at random mechanisms and on categorical data. Point and interval estimation is discussed. A comparison of both approaches is done on side effects in a psychiatric study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号