Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun;27(6):1634-1649.
doi: 10.1177/0962280216666564. Epub 2016 Sep 19.

Multiple imputation by chained equations for systematically and sporadically missing multilevel data

Affiliations

Multiple imputation by chained equations for systematically and sporadically missing multilevel data

Matthieu Resche-Rigon et al. Stat Methods Med Res. 2018 Jun.

Abstract

In multilevel settings such as individual participant data meta-analysis, a variable is 'systematically missing' if it is wholly missing in some clusters and 'sporadically missing' if it is partly missing in some clusters. Previously proposed methods to impute incomplete multilevel data handle either systematically or sporadically missing data, but frequently both patterns are observed. We describe a new multiple imputation by chained equations (MICE) algorithm for multilevel data with arbitrary patterns of systematically and sporadically missing variables. The algorithm is described for multilevel normal data but can easily be extended for other variable types. We first propose two methods for imputing a single incomplete variable: an extension of an existing method and a new two-stage method which conveniently allows for heteroscedastic data. We then discuss the difficulties of imputing missing values in several variables in multilevel data using MICE, and show that even the simplest joint multilevel model implies conditional models which involve cluster means and heteroscedasticity. However, a simulation study finds that the proposed methods can be successfully combined in a multilevel MICE procedure, even when cluster means are not included in the imputation models.

Keywords: Missing data; chained equations; fully conditional specification; individual patient data meta-analysis; multilevel model; multiple imputation.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

    1. Rubin DB. Multiple Imputation for Nonresponse in Surveys Wiley Series in Probability and Statistics. New York: Wiley; 1987.
    1. Little R, Rubin D. Statistical analysis with missing data. 2nd ed. Hoboken, NJ: Wiley; 2002.
    1. White I, Royston P, Wood A. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30:377–399. - PubMed
    1. Carpenter J, Kenward M. Multiple imputation and its application. New York: John Wiley & Sons; 2012.
    1. Novo A, Schafer J. Package NORM, R package version 1.0-9.5. 2015 http://CRAN.R-project.org/package=norm (accessed)

Publication types