Aberrant crypt foci and semiparametric modeling of correlated binary data |
| |
Authors: | Apanasovich Tatiyana V Ruppert David Lupton Joanne R Popovic Natasa Turner Nancy D Chapkin Robert S Carroll Raymond J |
| |
Affiliation: | School of Operations Research and Industrial Engineering, Cornell University, Ithaca, New York 14853, U.S.A.;Department of Nutrition and Food Science, Texas A&M University, College Station, Texas 77843-2253, U.S.A.;Department of Statistics, Texas A&M University, College Station, Texas 77843-3143, U.S.A. |
| |
Abstract: | Summary . Motivated by the spatial modeling of aberrant crypt foci (ACF) in colon carcinogenesis, we consider binary data with probabilities modeled as the sum of a nonparametric mean plus a latent Gaussian spatial process that accounts for short-range dependencies. The mean is modeled in a general way using regression splines. The mean function can be viewed as a fixed effect and is estimated with a penalty for regularization. With the latent process viewed as another random effect, the model becomes a generalized linear mixed model. In our motivating data set and other applications, the sample size is too large to easily accommodate maximum likelihood or restricted maximum likelihood estimation (REML), so pairwise likelihood, a special case of composite likelihood, is used instead. We develop an asymptotic theory for models that are sufficiently general to be used in a wide variety of applications, including, but not limited to, the problem that motivated this work. The splines have penalty parameters that must converge to zero asymptotically: we derive theory for this along with a data-driven method for selecting the penalty parameter, a method that is shown in simulations to improve greatly upon standard devices, such as likelihood crossvalidation. Finally, we apply the methods to the data from our experiment ACF. We discover an unexpected location for peak formation of ACF. |
| |
Keywords: | Aberrant crypt foci Colon carcinogenesis Composite likelihood Generalized linear mixed models Longitudinal data Pairwise likelihood Partially linear model Semiparametric regression Single index models Spatial statistics |
本文献已被 PubMed 等数据库收录! |
|