Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables
White, Ian R;
Daniel, Rhian;
Royston, Patrick;
(2010)
Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables.
Computational statistics & data analysis, 54 (10).
pp. 2267-2275.
ISSN 0167-9473
DOI: https://doi.org/10.1016/j.csda.2010.04.005
Permanent Identifier
Use this Digital Object Identifier when citing or linking to this resource.
Multiple imputation is a popular way to handle missing data. Automated procedures are widely available in standard software. However, such automated procedures may hide many assumptions and possible difficulties from the view of the data analyst. Imputation procedures such as monotone imputation and imputation by chained equations often involve the fitting of a regression model for a categorical outcome. If perfect prediction occurs in such a model, then automated procedures may give severely biased results. This is a problem in some standard software, but it may be avoided by bootstrap methods, penalised regression methods, or a new augmentation procedure. Crown Copyright (C) 2010 Published by Elsevier B.V. All rights reserved.