Variable selection for logistic regression using a prediction-focused information criterion |
| |
Authors: | Claeskens Gerda Croux Christophe Van Kerckhoven Johan |
| |
Affiliation: | ORSTAT and University Center for Statistics, K.U. Leuven, Naamsestraat 69, B-3000 Leuven, Belgium. gerda.claeskens@econ.kuleuven.be |
| |
Abstract: | In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables. The standard version of the FIC measures the mean squared error of the estimator of the quantity of interest in the selected model. In this article, we propose more general versions of the FIC, allowing other risk measures such as the one based on L(p) error. When prediction of an event is important, as is often the case in medical applications, we construct an FIC using the error rate as a natural risk measure. The advantages of using an information criterion which depends on both the quantity of interest and the selected risk measure are illustrated by means of a simulation study and application to a study on diabetic retinopathy. |
| |
Keywords: | Error rate Focused information criterion Forward selection Logistic regression Model selection Risk measures |
本文献已被 PubMed 等数据库收录! |
|