首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
This paper considers inference methods for case-control logistic regression in longitudinal setups. The motivation is provided by an analysis of plains bison spatial location as a function of habitat heterogeneity. The sampling is done according to a longitudinal matched case-control design in which, at certain time points, exactly one case, the actual location of an animal, is matched to a number of controls, the alternative locations that could have been reached. We develop inference methods for the conditional logistic regression model in this setup, which can be formulated within a generalized estimating equation (GEE) framework. This permits the use of statistical techniques developed for GEE-based inference, such as robust variance estimators and model selection criteria adapted for non-independent data. The performance of the methods is investigated in a simulation study and illustrated with the bison data analysis.  相似文献   

2.
Longitudinal data are common in clinical trials and observational studies, where missing outcomes due to dropouts are always encountered. Under such context with the assumption of missing at random, the weighted generalized estimating equation (WGEE) approach is widely adopted for marginal analysis. Model selection on marginal mean regression is a crucial aspect of data analysis, and identifying an appropriate correlation structure for model fitting may also be of interest and importance. However, the existing information criteria for model selection in WGEE have limitations, such as separate criteria for the selection of marginal mean and correlation structures, unsatisfactory selection performance in small‐sample setups, and so forth. In particular, there are few studies to develop joint information criteria for selection of both marginal mean and correlation structures. In this work, by embedding empirical likelihood into the WGEE framework, we propose two innovative information criteria named a joint empirical Akaike information criterion and a joint empirical Bayesian information criterion, which can simultaneously select the variables for marginal mean regression and also correlation structure. Through extensive simulation studies, these empirical‐likelihood‐based criteria exhibit robustness, flexibility, and outperformance compared to the other criteria including the weighted quasi‐likelihood under the independence model criterion, the missing longitudinal information criterion, and the joint longitudinal information criterion. In addition, we provide a theoretical justification of our proposed criteria, and present two real data examples in practice for further illustration.  相似文献   

3.
This paper considers the impact of bias in the estimation of the association parameters for longitudinal binary responses when there are drop-outs. A number of different estimating equation approaches are considered for the case where drop-out cannot be assumed to be a completely random process. In particular, standard generalized estimating equations (GEE), GEE based on conditional residuals, GEE based on multivariate normal estimating equations for the covariance matrix, and second-order estimating equations (GEE2) are examined. These different GEE estimators are compared in terms of finite sample and asymptotic bias under a variety of drop-out processes. Finally, the relationship between bias in the estimation of the association parameters and bias in the estimation of the mean parameters is explored.  相似文献   

4.
This paper presents a method for analysing longitudinal data when there are dropouts. In particular, we develop a simple method based on generalized linear mixture models for handling nonignorable dropouts for a variety of discrete and continuous outcomes. Statistical inference for the model parameters is based on a generalized estimating equations (GEE) approach (Liang and Zeger, 1986). The proposed method yields estimates of the model parameters that are valid when nonresponse is nonignorable under a variety of assumptions concerning the dropout process. Furthermore, the proposed method can be implemented using widely available statistical software. Finally, an example using data from a clinical trial of contracepting women is used to illustrate the methodology.  相似文献   

5.
Deletion diagnostics are introduced for the regression analysis of clustered binary outcomes estimated with alternating logistic regressions, an implementation of generalized estimating equations (GEE) that estimates regression coefficients in a marginal mean model and in a model for the intracluster association given by the log odds ratio. The diagnostics are developed within an estimating equations framework that recasts the estimating functions for association parameters based upon conditional residuals into equivalent functions based upon marginal residuals. Extensions of earlier work on GEE diagnostics follow directly, including computational formulae for one‐step deletion diagnostics that measure the influence of a cluster of observations on the estimated regression parameters and on the overall marginal mean or association model fit. The diagnostic formulae are evaluated with simulations studies and with an application concerning an assessment of factors associated with health maintenance visits in primary care medical practices. The application and the simulations demonstrate that the proposed cluster‐deletion diagnostics for alternating logistic regressions are good approximations of their exact fully iterated counterparts.  相似文献   

6.
This work develops a joint model selection criterion for simultaneously selecting the marginal mean regression and the correlation/covariance structure in longitudinal data analysis where both the outcome and the covariate variables may be subject to general intermittent patterns of missingness under the missing at random mechanism. The new proposal, termed “joint longitudinal information criterion” (JLIC), is based on the expected quadratic error for assessing model adequacy, and the second‐order weighted generalized estimating equation (WGEE) estimation for mean and covariance models. Simulation results reveal that JLIC outperforms existing methods performing model selection for the mean regression and the correlation structure in a two stage and hence separate manner. We apply the proposal to a longitudinal study to identify factors associated with life satisfaction in the elderly of Taiwan.  相似文献   

7.
Small-Sample Adjustments for Wald-Type Tests Using Sandwich Estimators   总被引:1,自引:0,他引:1  
The sandwich estimator of variance may be used to create robust Wald-type tests from estimating equations that are sums of K independent or approximately independent terms. For example, for repeated measures data on K individuals, each term relates to a different individual. These tests applied to a parameter may have greater than nominal size if K is small, or more generally if the parameter to be tested is essentially estimated from a small number of terms in the estimating equation. We offer some practical modifications to these robust Wald-type tests, which asymptotically approach the usual robust Wald-type tests. We show that one of these modifications provides exact coverage for a simple case and examine by simulation the modifications applied to the generalized estimating equations of Liang and Zeger (1986), conditional logistic regression, and the Cox proportional hazard model.  相似文献   

8.
In this paper, we develop a Gaussian estimation (GE) procedure to estimate the parameters of a regression model for correlated (longitudinal) binary response data using a working correlation matrix. A two‐step iterative procedure is proposed for estimating the regression parameters by the GE method and the correlation parameters by the method of moments. Consistency properties of the estimators are discussed. A simulation study was conducted to compare 11 estimators of the regression parameters, namely, four versions of the GE, five versions of the generalized estimating equations (GEEs), and two versions of the weighted GEE. Simulations show that (i) the Gaussian estimates have the smallest mean square error and best coverage probability if the working correlation structure is correctly specified and (ii) when the working correlation structure is correctly specified, the GE and the GEE with exchangeable correlation structure perform best as opposed to when the correlation structure is misspecified.  相似文献   

9.
The modeling of generalized estimating equations used in the analysis of longitudinal data whether in continuous or discrete variables, necessarily requires the prior specification of a correlation matrix in its iterative process in order to obtain the estimates of the regression parameters. Such a matrix is called working correlation matrix and its incorrect specification produces less efficient estimates for the model parameters. Due to this fact, this study aims to propose a selection criterion of working correlation matrix based on the covariance matrix estimates of correlated responses resulting from the limiting values of the association parameter estimates. For validation of the criterion, we used simulation studies considering normal and binary correlated responses. Compared to some criteria in the literature, it was concluded that the proposed criterion resulted in a better performance when the correlation structure for exchangeable working correlation matrix was considered as true structure in the simulated samples and for large samples, the proposed criterion showed similar behavior to the other criteria, resulting in higher success rates.  相似文献   

10.
Akaike's information criterion in generalized estimating equations   总被引:15,自引:0,他引:15  
Pan W 《Biometrics》2001,57(1):120-125
Correlated response data are common in biomedical studies. Regression analysis based on the generalized estimating equations (GEE) is an increasingly important method for such data. However, there seem to be few model-selection criteria available in GEE. The well-known Akaike Information Criterion (AIC) cannot be directly applied since AIC is based on maximum likelihood estimation while GEE is nonlikelihood based. We propose a modification to AIC, where the likelihood is replaced by the quasi-likelihood and a proper adjustment is made for the penalty term. Its performance is investigated through simulation studies. For illustration, the method is applied to a real data set.  相似文献   

11.
Information theory was applied to select the best model fitting total length ( L T)-at-age data and calculate the averaged model for Japanese eel Anguilla japonica compiled from published literature and the differences in growth between sexes were examined. Five candidate growth models were the von Bertalanffy, generalized von Bertalanffy, Gompertz, logistic and power models. The von Bertalanffy growth model with sex-specific coefficients was best supported by the data and nearly overlapped the averaged growth model based on Akaike weights, indicating a similar fit to the data. The Gompertz, generalized von Bertalanffy and power growth models were also substantially supported by the data. The L T at age of A. japonica were larger in females than in males according to the averaged growth mode, suggesting a sexual dimorphism in growth. Model inferences based on information theory, which deal with uncertainty in model selection and robust parameter estimates, are recommended for modelling the growth of A. japonica .  相似文献   

12.
Currently available methods for model selection used in phylogenetic analysis are based on an initial fixed-tree topology. Once a model is picked based on this topology, a rigorous search of the tree space is run under that model to find the maximum-likelihood estimate of the tree (topology and branch lengths) and the maximum-likelihood estimates of the model parameters. In this paper, we propose two extensions to the decision-theoretic (DT) approach that relax the fixed-topology restriction. We also relax the fixed-topology restriction for the Bayesian information criterion (BIC) and the Akaike information criterion (AIC) methods. We compare the performance of the different methods (the relaxed, restricted, and the likelihood-ratio test [LRT]) using simulated data. This comparison is done by evaluating the relative complexity of the models resulting from each method and by comparing the performance of the chosen models in estimating the true tree. We also compare the methods relative to one another by measuring the closeness of the estimated trees corresponding to the different chosen models under these methods. We show that varying the topology does not have a major impact on model choice. We also show that the outcome of the two proposed extensions is identical and is comparable to that of the BIC, Extended-BIC, and DT. Hence, using the simpler methods in choosing a model for analyzing the data is more computationally feasible, with results comparable to the more computationally intensive methods. Another outcome of this study is that earlier conclusions about the DT approach are reinforced. That is, LRT, Extended-AIC, and AIC result in more complicated models that do not contribute to the performance of the phylogenetic inference, yet cause a significant increase in the time required for data analysis.  相似文献   

13.
In the generalized method of moments approach to longitudinaldata analysis, unbiased estimating functions can be constructedto incorporate both the marginal mean and the correlation structureof the data. Increasing the number of parameters in the correlationstructure corresponds to increasing the number of estimatingfunctions. Thus, building a correlation model is equivalentto selecting estimating functions. This paper proposes a chi-squaredtest to choose informative unbiased estimating functions. Weshow that this methodology is useful for identifying which sourceof correlation it is important to incorporate when there aremultiple possible sources of correlation. This method can alsobe applied to determine the optimal working correlation forthe generalized estimating equation approach.  相似文献   

14.
Missing data are a common problem in longitudinal studies in the health sciences. Motivated by data from the Muscatine Coronary Risk Factor (MCRF) study, a longitudinal study of obesity, we propose a simple imputation method for handling non-ignorable non-responses (i.e., when non-response is related to the specific values that should have been obtained) in longitudinal studies with either discrete or continuous outcomes. In the proposed approach, two regression models are specified; one for the marginal mean of the response, the other for the conditional mean of the response given non-response patterns. Statistical inference for the model parameters is based on the generalized estimating equations (GEE) approach. An appealing feature of the proposed method is that it can be readily implemented using existing, widely-available statistical software. The method is illustrated using longitudinal data on obesity from the MCRF study.  相似文献   

15.
Surveys often contain qualitative variables for which respondents may select any number of the outcome categories. For instance, for the question “What type of contraception have you used?” with possible responses (oral, condom, lubricated condom, spermicide, and diaphragm), respondents would be instructed to select as many of the outcomes that apply. This situation is known as multiple responses. When the data includes stratification variables, we discuss two approaches: (1) the “GEE” approach which uses logit models directly applying the generalized estimating equations (GEE) method (Liang and Zeger, 1986); and (2) the “GMH” approach which extends the generalized Mantel–Haenszel type estimators (Greenland, 1989) to make inferences across multiple responses. These approaches can also be used for data with dependent observations across strata. (© 2008 WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim)  相似文献   

16.
The generalized estimating equations (GEE) derived by Liang and Zeger to analyze longitudinal data have been used in a wide range of medical and biological applications. To make regression a useful and meaningful statistical tool, emphasis should be placed not only on inference or fitting, but also on diagnosing potential data problems. Most of the usual diagnostics for linear regression models have been generalized for GEE. However, global influence measures based on the volume of confidence ellipsoids are not available for GEE analysis. This article presents an extension of these measures that is valid for correlated‐measures regression analysis using GEEs. The proposed measures are illustrated by an analysis of epileptic seizure count data arising from a study of prograbide as an adjuvant therapy for partial seizures and some simulated data sets.  相似文献   

17.
This paper develops a general approach for dealing with parametric transformations of covariates for longitudinal data, where the responses are modeled marginally and generalized estimating equations (GEEs) are used for estimation of regression parameters. We propose an iterative algorithm for obtaining regression and transformation parameters from estimating equations, utilizing existing software for GEE problems. The algorithmic technique is closely related to that used in the Box-Tidwell transformation in classical linear regression, but we develop it under the GEE setting and for more general transformation functions. We provide supporting theorems for consistency and asymptotic Normality of the estimates. Inference between two nested models is also considered. This methodology is applied to two data sets. One consists of pill dissolution data, the other is taken from the Pittsburgh Youth Study (PYS). The PYS is a prospective longitudinal study of the development of delinquency, substance use, and mental health in male youth. We use the model-based parametric approach to examine the association between alcohol use at an early stage of adolescent development and delinquency over the course of adolescence.  相似文献   

18.
Marginal models for longitudinal continuous proportional data   总被引:5,自引:0,他引:5  
Song PX  Tan M 《Biometrics》2000,56(2):496-502
Summary. Continuous proportional data arise when the response of interest is a percentage between zero and one, e.g., the percentage of decrease in renal function at different follow‐up times from the baseline. In this paper, we propose methods to directly model the marginal means of the longitudinal proportional responses using the simplex distribution of Barndorff‐Nielsen and Jørgensen that takes into account the fact that such responses are percentages restricted between zero and one and may as well have large dispersion. Parameters in such a marginal model are estimated using an extended version of the generalized estimating equations where the score vector is a nonlinear function of the observed response. The method is illustrated with an ophthalmology study on the use of intraocular gas in retinal repair surgeries.  相似文献   

19.
Longitudinal data often encounter missingness with monotone and/or intermittent missing patterns. Multiple imputation (MI) has been popularly employed for analysis of missing longitudinal data. In particular, the MI‐GEE method has been proposed for inference of generalized estimating equations (GEE) when missing data are imputed via MI. However, little is known about how to perform model selection with multiply imputed longitudinal data. In this work, we extend the existing GEE model selection criteria, including the “quasi‐likelihood under the independence model criterion” (QIC) and the “missing longitudinal information criterion” (MLIC), to accommodate multiple imputed datasets for selection of the MI‐GEE mean model. According to real data analyses from a schizophrenia study and an AIDS study, as well as simulations under nonmonotone missingness with moderate proportion of missing observations, we conclude that: (i) more than a few imputed datasets are required for stable and reliable model selection in MI‐GEE analysis; (ii) the MI‐based GEE model selection methods with a suitable number of imputations generally perform well, while the naive application of existing model selection methods by simply ignoring missing observations may lead to very poor performance; (iii) the model selection criteria based on improper (frequentist) multiple imputation generally performs better than their analogies based on proper (Bayesian) multiple imputation.  相似文献   

20.
Penalized estimating equations   总被引:1,自引:0,他引:1  
Fu WJ 《Biometrics》2003,59(1):126-132
Penalty models--such as the ridge estimator, the Stein estimator, the bridge estimator, and the Lasso-have been proposed to deal with collinearity in regressions. The Lasso, for instance, has been applied to linear models, logistic regressions, Cox proportional hazard models, and neural networks. This article considers the bridge penalty model with penalty sigma(j)/beta(j)/gamma for estimating equations in general and applies this penalty model to the generalized estimating equations (GEE) in longitudinal studies. The lack of joint likelihood in the GEE is overcome by the penalized estimating equations, in which no joint likelihood is required. The asymptotic results for the penalty estimator are provided. It is demonstrated, with a simulation and an application, that the penalized GEE potentially improves the performance of the GEE estimator, and enjoys the same properties as linear penalty models.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号