Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values

Ian R White¹, John B Carlin

Affiliations

PMID: 20842622
DOI: 10.1002/sim.3944

Comparative Study

Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values

Ian R White et al. Stat Med. 2010.

. 2010 Dec 10;29(28):2920-31.

doi: 10.1002/sim.3944.

Authors

Ian R White¹, John B Carlin

Affiliation

¹ MRC Biostatistics Unit, Institute of Public Health, Robinson Way, Cambridge CB2 0SR, U.K. ian.white@mrc-bsu.cam.ac.uk

PMID: 20842622
DOI: 10.1002/sim.3944

Abstract

When missing data occur in one or more covariates in a regression model, multiple imputation (MI) is widely advocated as an improvement over complete-case analysis (CC). We use theoretical arguments and simulation studies to compare these methods with MI implemented under a missing at random assumption. When data are missing completely at random, both methods have negligible bias, and MI is more efficient than CC across a wide range of scenarios. For other missing data mechanisms, bias arises in one or both methods. In our simulation setting, CC is biased towards the null when data are missing at random. However, when missingness is independent of the outcome given the covariates, CC has negligible bias and MI is biased away from the null. With more general missing data mechanisms, bias tends to be smaller for MI than for CC. Since MI is not always better than CC for missing covariate problems, the choice of method should take into account what is known about the missing data mechanism in a particular substantive application. Importantly, the choice of method should not be based on comparison of standard errors. We propose new ways to understand empirical differences between MI and CC, which may provide insights into the appropriateness of the assumptions underlying each method, and we propose a new index for assessing the likely gain in precision from MI: the fraction of incomplete cases among the observed values of a covariate (FICO).

PubMed Disclaimer

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values

Affiliation

Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values

Authors

Affiliation

Abstract

Publication types

MeSH terms

Grants and funding