Simulation‐based evaluation of the linear‐mixed model in the presence of an increasing proportion of singletons期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Simulation‐based evaluation of the linear‐mixed model in the presence of an increasing proportion of singletons

Authors:	Robin Bruyndonckx Niel Hens Marc Aerts

Institution:	1. Interuniversity Institute for Biostatistics and statistical Bioinformatics (I‐BIOSTAT), Hasselt University, Diepenbeek, Belgium;2. Laboratory of Medical Microbiology, Vaccine & Infectious Disease Institute (VAXINFECTIO), University of Antwerp, Antwerp, Belgium;3. Centre for Health Economic Research and Modelling of Infectious Diseases (CHERMID), Vaccine & Infectious Disease Institute (VAXINFECTIO), University of Antwerp, Antwerp, Belgium

Abstract:	Data in medical sciences often have a hierarchical structure with lower level units (e.g. children) nested in higher level units (e.g. departments). Several specific but frequently studied settings, mainly in longitudinal and family research, involve a large number of units that tend to be quite small, with units containing only one element referred to as singletons. Regardless of sparseness, hierarchical data should be analyzed with appropriate methodology such as, for example linear‐mixed models. Using a simulation study, based on the structure of a data example on Ceftriaxone consumption in hospitalized children, we assess the impact of an increasing proportion of singletons (0–95%), in data with a low, medium, or high intracluster correlation, on the stability of linear‐mixed models parameter estimates, confidence interval coverage and F test performance. Some techniques that are frequently used in the presence of singletons include ignoring clustering, dropping the singletons from the analysis and grouping the singletons into an artificial unit. We show that both the fixed and random effects estimates and their standard errors are stable in the presence of an increasing proportion of singletons. We demonstrate that ignoring clustering and dropping singletons should be avoided as they come with biased standard error estimates. Grouping the singletons into an artificial unit might be considered, although the linear‐mixed model performs better even when the proportion of singletons is high. We conclude that the linear‐mixed model is stable in the presence of singletons when both lower‐ and higher level sample sizes are fixed. In this setting, the use of remedial measures, such as ignoring clustering and grouping or removing singletons, should be dissuaded.

Keywords:	F test hierarchical data intracluster correlation performance characteristics sparseness

设为首页 | 免责声明 | 关于勤云 | 加入收藏