共查询到20条相似文献,搜索用时 15 毫秒
1.
In many observational studies, individuals are measured repeatedly over time, although not necessarily at a set of prespecified occasions. Instead, individuals may be measured at irregular intervals, with those having a history of poorer health outcomes being measured with somewhat greater frequency and regularity; i.e., those individuals with poorer health outcomes may have more frequent follow-up measurements and the intervals between their repeated measurements may be shorter. In this article, we consider estimation of regression parameters in models for longitudinal data where the follow-up times are not fixed by design but can depend on previous outcomes. In particular, we focus on general linear models for longitudinal data where the repeated measures are assumed to have a multivariate Gaussian distribution. We consider assumptions regarding the follow-up time process that result in the likelihood function separating into two components: one for the follow-up time process, the other for the outcome process. The practical implication of this separation is that the former process can be ignored when making likelihood-based inferences about the latter; i.e., maximum likelihood (ML) estimation of the regression parameters relating the mean of the longitudinal outcomes to covariates does not require that a model for the distribution of follow-up times be specified. As a result, standard statistical software, e.g., SAS PROC MIXED (Littell et al., 1996, SAS System for Mixed Models), can be used to analyze the data. However, we also demonstrate that misspecification of the model for the covariance among the repeated measures will, in general, result in regression parameter estimates that are biased. Furthermore, results of a simulation study indicate that the potential bias due to misspecification of the covariance can be quite considerable in this setting. Finally, we illustrate these results using data from a longitudinal observational study (Lipshultz et al., 1995, New England Journal of Medicine 332, 1738-1743) that explored the cardiotoxic effects of doxorubicin chemotherapy for the treatment of acute lymphoblastic leukemia in children. 相似文献
2.
Fitzmaurice GM Lipsitz SR Ibrahim JG Gelber R Lipshultz S 《Biostatistics (Oxford, England)》2006,7(3):469-485
In many observational studies, individuals are measured repeatedly over time, although not necessarily at a set of pre-specified occasions. Instead, individuals may be measured at irregular intervals, with those having a history of poorer health outcomes being measured with somewhat greater frequency and regularity. In this paper, we consider likelihood-based estimation of the regression parameters in marginal models for longitudinal binary data when the follow-up times are not fixed by design, but can depend on previous outcomes. In particular, we consider assumptions regarding the follow-up time process that result in the likelihood function separating into two components: one for the follow-up time process, the other for the outcome measurement process. The practical implication of this separation is that the follow-up time process can be ignored when making likelihood-based inferences about the marginal regression model parameters. That is, maximum likelihood (ML) estimation of the regression parameters relating the probability of success at a given time to covariates does not require that a model for the distribution of follow-up times be specified. However, to obtain consistent parameter estimates, the multinomial distribution for the vector of repeated binary outcomes must be correctly specified. In general, ML estimation requires specification of all higher-order moments and the likelihood for a marginal model can be intractable except in cases where the number of repeated measurements is relatively small. To circumvent these difficulties, we propose a pseudolikelihood for estimation of the marginal model parameters. The pseudolikelihood uses a linear approximation for the conditional distribution of the response at any occasion, given the history of previous responses. The appeal of this approximation is that the conditional distributions are functions of the first two moments of the binary responses only. When the follow-up times depend only on the previous outcome, the pseudolikelihood requires correct specification of the conditional distribution of the current outcome given the outcome at the previous occasion only. Results from a simulation study and a study of asymptotic bias are presented. Finally, we illustrate the main results using data from a longitudinal observational study that explored the cardiotoxic effects of doxorubicin chemotherapy for the treatment of acute lymphoblastic leukemia in children. 相似文献
3.
Márcia A. C. Macera Francisco Louzada Vicente G. Cancho Cor J. F. Fontes 《Biometrical journal. Biometrische Zeitschrift》2015,57(2):201-214
In this paper, we introduce a new model for recurrent event data characterized by a baseline rate function fully parametric, which is based on the exponential‐Poisson distribution. The model arises from a latent competing risk scenario, in the sense that there is no information about which cause was responsible for the event occurrence. Then, the time of each recurrence is given by the minimum lifetime value among all latent causes. The new model has a particular case, which is the classical homogeneous Poisson process. The properties of the proposed model are discussed, including its hazard rate function, survival function, and ordinary moments. The inferential procedure is based on the maximum likelihood approach. We consider an important issue of model selection between the proposed model and its particular case by the likelihood ratio test and score test. Goodness of fit of the recurrent event models is assessed using Cox‐Snell residuals. A simulation study evaluates the performance of the estimation procedure in the presence of a small and moderate sample sizes. Applications on two real data sets are provided to illustrate the proposed methodology. One of them, first analyzed by our team of researchers, considers the data concerning the recurrence of malaria, which is an infectious disease caused by a protozoan parasite that infects red blood cells. 相似文献
4.
We examine issues in estimating population size N with capture-recapture models when there is variable catchability among subjects. We focus on a logistic-normal mixed model, for which the logit of the probability of capture is an additive function of a random subject and a fixed sampling occasion parameter. When the probability of capture is small or the degree of heterogeneity is large, the log-likelihood surface is relatively flat and it is difficult to obtain much information about N. We also discuss a latent class model and a log-linear model that account for heterogeneity and show that the log-linear model has greater scope. Models assuming homogeneity provide much narrower intervals for N but are usually highly overly optimistic, the actual coverage probability being much lower than the nominal level. 相似文献
5.
A generalized mover-stayer model for panel data 总被引:1,自引:0,他引:1
A generalized mover-stayer model is described for conditionally Markov processes under panel observation. Marginally the model represents a mixture of nested continuous-time Markov processes in which sub-models are defined by constraining some transition intensities to zero between two or more states of a full model. A Fisher scoring algorithm is described which facilitates maximum likelihood estimation based only on the first derivatives of the transition probability matrices. The model is fit to data from a smoking prevention study and is shown to provide a significant improvement in fit over a time-homogeneous Markov model. Extensions are developed which facilitate examination of covariate effects on both the transition intensities and the mover-stayer probabilities. 相似文献
6.
Gray RJ 《Biometrics》2000,56(2):571-576
An estimator of the regression parameters in a semiparametric transformed linear survival model is examined. This estimator consists of a single Newton-like update of the solution to a rank-based estimating equation from an initial consistent estimator. An automated penalized likelihood algorithm is proposed for estimating the optimal weight function for the estimating equations and the error hazard function that is needed in the variance estimator. In simulations, the estimated optimal weights are found to give reasonably efficient estimators of the regression parameters, and the variance estimators are found to perform well. The methodology is applied to an analysis of prognostic factors in non-Hodgkin's lymphoma. 相似文献
7.
This paper discusses the analysis of an extended finite mixture model where the latent classes corresponding to the mixture components for one set of observed variables influence a second set of observed variables. The research is motivated by a repeated measurement study using a random coefficient model to assess the influence of latent growth trajectory class membership on the probability of a binary disease outcome. More generally, this model can be seen as a combination of latent class modeling and conventional mixture modeling. The EM algorithm is used for estimation. As an illustration, a random-coefficient growth model for the prediction of alcohol dependence from three latent classes of heavy alcohol use trajectories among young adults is analyzed. 相似文献
8.
9.
Anton K. Formann 《Biometrical journal. Biometrische Zeitschrift》1982,24(2):171-190
In the present paper the linear logistic extension of latent class analysis is described. Thereby it is assumed that the item latent probabilities as well as the class sizes can be attributed to some explanatory variables. The basic equations of the model state the decomposition of the log-odds of the item latent probabilities and of the class sizes into weighted sums of basic parameters representing the effects of the predictor variables. Further, the maximum likelihood equations for these effect parameters and statistical tests for goodness-of-fit are given. Finally, an example illustrates the practical application of the model and the interpretation of the model parameters. 相似文献
10.
The maximum likelihood approach to jointly model the survival time and its longitudinal covariates has been successful to model both processes in longitudinal studies. Random effects in the longitudinal process are often used to model the survival times through a proportional hazards model, and this invokes an EM algorithm to search for the maximum likelihood estimates (MLEs). Several intriguing issues are examined here, including the robustness of the MLEs against departure from the normal random effects assumption, and difficulties with the profile likelihood approach to provide reliable estimates for the standard error of the MLEs. We provide insights into the robustness property and suggest to overcome the difficulty of reliable estimates for the standard errors by using bootstrap procedures. Numerical studies and data analysis illustrate our points. 相似文献
11.
We propose methods for Bayesian inference for a new class of semiparametric survival models with a cure fraction. Specifically, we propose a semiparametric cure rate model with a smoothing parameter that controls the degree of parametricity in the right tail of the survival distribution. We show that such a parameter is crucial for these kinds of models and can have an impact on the posterior estimates. Several novel properties of the proposed model are derived. In addition, we propose a class of improper noninformative priors based on this model and examine the properties of the implied posterior. Also, a class of informative priors based on historical data is proposed and its theoretical properties are investigated. A case study involving a melanoma clinical trial is discussed in detail to demonstrate the proposed methodology. 相似文献
12.
Summary : Dynamic latent class models provide a flexible framework for studying biologic processes that evolve over time. Motivated by studies of markers of the fertile days of the menstrual cycle, we propose a discrete‐time dynamic latent class framework, allowing change points to depend on time, fixed predictors, and random effects. Observed data consist of multivariate categorical indicators, which change dynamically in a flexible manner according to latent class status. Given the flexibility of the framework, which incorporates semi‐parametric components using mixtures of betas, identifiability constraints are needed to define the latent classes. Such constraints are most appropriately based on the known biology of the process. The Bayesian method is developed particularly for analyzing mucus symptom data from a study of women using natural family planning. 相似文献
13.
We propose an extension of the latent class model for the analysis of capture-recapture data which allows us to take into account the effect of a capture on the behavior of a subject with respect to future captures. The approach is based on the assumption that the variable indexing the latent class of a subject follows a Markov chain with transition probabilities depending on the previous capture history. Several constraints are allowed on these transition probabilities and on the parameters of the conditional distribution of the capture configuration given the latent process. We also allow for the presence of discrete explanatory variables, which may affect the parameters of the latent process. To estimate the resulting models, we rely on the conditional maximum likelihood approach and for this aim we outline an EM algorithm. We also give some simple rules for point and interval estimation of the population size. The approach is illustrated by applying it to two data sets concerning small mammal populations. 相似文献
14.
15.
This work was motivated by the need to combine outcome information from a reference population with risk factor information from a screened subpopulation in a setting where the analytic goal was to study the association between risk factors and multiple binary outcomes. To achieve such an analytic goal, this article proposes a two-stage latent class procedure that first summarizes the commonalities among outcomes using a reference population sample, then analyzes the association between outcomes and risk factors. It develops a pseudo-maximum likelihood approach to estimating model parameters. The performance of the proposed method is evaluated in a simulation study and in an illustrative analysis of data from the Women's Health and Aging Study, a recent investigation of the causes and course of disability in older women. Combining information in the proposed way is found to improve both accuracy and precision in summarizing multiple categorical outcomes, which effectively diminishes ambiguity and bias in making risk factor inferences. 相似文献
16.
We describe a method for making inferences about the joint operating characteristics of multiple diagnostic tests applied longitudinally and in the absence of a definitive reference test. Log-linear models are adopted for the classification distributions conditional on the latent state, where inclusion of appropriate interaction terms accommodates conditional dependencies among the tests. A marginal likelihood is constructed by marginalizing over a latent two-state Markov process. Specific latent processes we consider include a first-order Markov model, a second-order Markov model, and a time-nonhomogeneous Markov model, although the method is described in full generality. Adaptations to handle missing data are described. Model diagnostics are considered based on the bootstrap distribution of conditional residuals. The methods are illustrated by application to a study of diffuse bilateral infiltrates among patients in intensive care wards in which the objective was to assess aspects of validity and clinical agreement. 相似文献
17.
A critical issue in modelling binary response data is the choiceof the links. We introduce a new link based on the generalizedt-distribution. There are two parameters in the generalizedt-link: one parameter purely controls the heaviness of the tailsof the link and the second parameter controls the scale of thelink. Two major advantages are offered by the generalized t-links.First, a symmetric generalized t-link with an unknown shapeparameter is much more identifiable than a Student t-link withunknown degrees of freedom and a known scale parameter. Secondly,skewed generalized t-links with both unknown shape and scaleparameters provide much more flexible and improved skewed linkregression models than the existing skewed links. Various theoreticalproperties and attractive features of the proposed links areexamined and explored in detail. An efficient Markov chain MonteCarlo algorithm is developed for sampling from the posteriordistribution. The deviance information criterion measure isused for guiding the choice of links. The proposed methodologyis motivated and illustrated by prostate cancer data. 相似文献
18.
基于贝叶斯网潜类模型的高维SNPs分析 总被引:1,自引:0,他引:1
采用贝叶斯(Bayesian)网的潜类模型对GAW17高维SNPs数据进行分析,为复杂性状疾病遗传以及基因定位等方面的研究提供新的方法支持。本研究从GAW17提供的包含697个个体22条常染色体的上万个SNP中,随机挑选出1号染色体上12个基因的29个SNPs作为研究对象。按照累计信息贡献率达到95%的原则,应用贝叶斯网潜变量模型选出C1S11408,C1S3201,C1S1786等15个与X0互信息量大的SNPs位点来对研究人群进行分类与解释。结果表明697个个体总的被分为2个潜在类别,各类别的概率分别为0.68和0.32。对两类人群的疾病分布状况进行分析,结果表明二者不一致,第二个类别人群患病率(38.64%)明显高于第一个类别人群(25.99%)(χ2=11.46,P=0.001)。由此可见,两类人群疾病患病率的差别正是由选出的15个SNPs造成的,从而有理由认为这些SNPs为可疑致病位点,为进一步的研究提供明确的思路。 相似文献
19.
Misspecified proportional hazard models 总被引:5,自引:0,他引:5
20.
Louzada-Neto F 《Biometrics》1999,55(4):1281-1285
We propose a polyhazard model to deal with lifetime data associated with latent competing risks. The causes of failure are assumed unobserved and affecting individuals independently. The general framework allows a broad class of hazard models that includes the most common hazard-based models. The model accommodates bathtub and multimodal hazards, keeping enough flexibility for common lifetime data that cannot be accommodated by usual hazard-based models. Maximum likelihood estimation is discussed, and parametric simulation is used for hypothesis testing. 相似文献