首页 | 本学科首页   官方微博 | 高级检索  
     


Variable selection for logistic regression using a prediction-focused information criterion
Authors:Claeskens Gerda  Croux Christophe  Van Kerckhoven Johan
Affiliation:ORSTAT and University Center for Statistics, K.U. Leuven, Naamsestraat 69, B-3000 Leuven, Belgium. gerda.claeskens@econ.kuleuven.be
Abstract:In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables. The standard version of the FIC measures the mean squared error of the estimator of the quantity of interest in the selected model. In this article, we propose more general versions of the FIC, allowing other risk measures such as the one based on L(p) error. When prediction of an event is important, as is often the case in medical applications, we construct an FIC using the error rate as a natural risk measure. The advantages of using an information criterion which depends on both the quantity of interest and the selected risk measure are illustrated by means of a simulation study and application to a study on diabetic retinopathy.
Keywords:Error rate    Focused information criterion    Forward selection    Logistic regression    Model selection    Risk measures
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号