. 2018 Jun 30;37(14):2252-2266.

doi: 10.1002/sim.7654. Epub 2018 Apr 16.

Bootstrap inference when using multiple imputation

Michael Schomaker¹, Christian Heumann²

Affiliations

¹ Centre for Infectious Disease Epidemiology & Research, University of Cape Town, Falmouth Building, Observatory, Cape Town, 7925, South Africa.
² Christian Heumann, Institut für Statistik, Ludwig-Maximilians Universität München, München, Germany.

PMID: 29682776
PMCID: PMC5986623
DOI: 10.1002/sim.7654

Bootstrap inference when using multiple imputation

Michael Schomaker et al. Stat Med. 2018.

. 2018 Jun 30;37(14):2252-2266.

doi: 10.1002/sim.7654. Epub 2018 Apr 16.

Authors

Michael Schomaker¹, Christian Heumann²

Affiliations

¹ Centre for Infectious Disease Epidemiology & Research, University of Cape Town, Falmouth Building, Observatory, Cape Town, 7925, South Africa.
² Christian Heumann, Institut für Statistik, Ludwig-Maximilians Universität München, München, Germany.

PMID: 29682776
PMCID: PMC5986623
DOI: 10.1002/sim.7654

Abstract

Many modern estimators require bootstrapping to calculate confidence intervals because either no analytic standard error is available or the distribution of the parameter of interest is nonsymmetric. It remains however unclear how to obtain valid bootstrap inference when dealing with multiple imputation to address missing data. We present 4 methods that are intuitively appealing, easy to implement, and combine bootstrap estimation with multiple imputation. We show that 3 of the 4 approaches yield valid inference, but that the performance of the methods varies with respect to the number of imputed data sets and the extent of missingness. Simulation studies reveal the behavior of our approaches in finite samples. A topical analysis from HIV treatment research, which determines the optimal timing of antiretroviral treatment initiation in young children, demonstrates the practical implications of the 4 methods in a sophisticated and realistic setting. This analysis suffers from missing data and uses the g-formula for inference, a method for which no standard errors are available.

Keywords: HIV; causal inference; g-methods; missing data; resampling.

PubMed Disclaimer

Figures

**Figure 1**
Coverage probability of the interval estimates for β₁ in the first simulation setting dependent on the number of imputations. Results related to the complete simulated data, i.e. before missing data are generated, are labelled “original data”.

**Figure 2**
Estimate of β₁ in the first simulation setting, for a random simulation run: distribution of ‘MI Boot (pooled)’ for each imputed dataset (top) and distribution of ‘Boot MI (PS)’ for 50 random bootstrap samples (PS). Point estimates are marked by the black tick marks on the x-axis.

**Figure 3**
Estimated cumulative mortality difference between the interventions ‘immediate ART’ and ‘350/15’ at 3 years: distributions and confidence intervals of different estimators

**Figure 4**
Estimated cumulative mortality difference: distribution of ‘MI Boot (PS)’ for each imputed dataset (top) and distribution of ‘Boot MI (PS)’ for 25 random bootstrap samples (bottom). Point estimates are marked by the black tick marks on the x-axis.

See this image and copyright information in PMC

References

1. Rubin DB. Multiple imputation after 18+ years. Journal of the American Statistical Association. 1996;91(434):473–489.
1. Horton NJ, Kleinman KP. Much ado about nothing: a comparison of missing data methods and software to fit incomplete regression models. The American Statistician. 2007;61:79–90. - PMC - PubMed
1. Honaker J, King G, Blackwell M. Amelia II: A program for missing data. Journal of Statistical Software. 2011;45(7):1–47.
1. van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate imputation by chained equations in R. Journal of Statistical Software. 2011;45(3):1–67.
1. Royston P, White IR. Multiple imputation by chained equations (mice): Implementation in Stata. Journal of Statistical Software. 2011;45(4):1–20.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Bootstrap inference when using multiple imputation

Affiliations

Bootstrap inference when using multiple imputation

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources