Comparative Study

. 2019 Jan;34(1):23-36.

doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.

A comparison of different methods to handle missing data in the context of propensity score analysis

Jungyeon Choi¹, Olaf M Dekkers^{2

3}, Saskia le Cessie^{2

4}

Affiliations

¹ Department of Clinical Epidemiology, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands. J.Choi@lumc.nl.
² Department of Clinical Epidemiology, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.
³ Department of Endocrinology and Metabolism, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.
⁴ Department of Biomedical Data Sciences, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.

PMID: 30341708
PMCID: PMC6325992
DOI: 10.1007/s10654-018-0447-z

Comparative Study

A comparison of different methods to handle missing data in the context of propensity score analysis

Jungyeon Choi et al. Eur J Epidemiol. 2019 Jan.

. 2019 Jan;34(1):23-36.

doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.

Authors

Jungyeon Choi¹, Olaf M Dekkers^{2

3}, Saskia le Cessie^{2

4}

Affiliations

¹ Department of Clinical Epidemiology, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands. J.Choi@lumc.nl.
² Department of Clinical Epidemiology, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.
³ Department of Endocrinology and Metabolism, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.
⁴ Department of Biomedical Data Sciences, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.

PMID: 30341708
PMCID: PMC6325992
DOI: 10.1007/s10654-018-0447-z

Abstract

Propensity score analysis is a popular method to control for confounding in observational studies. A challenge in propensity methods is missing values in confounders. Several strategies for handling missing values exist, but guidance in choosing the best method is needed. In this simulation study, we compared four strategies of handling missing covariate values in propensity matching and propensity weighting. These methods include: complete case analysis, missing indicator method, multiple imputation and combining multiple imputation and missing indicator method. Concurrently, we aimed to provide guidance in choosing the optimal strategy. Simulated scenarios varied regarding missing mechanism, presence of effect modification or unmeasured confounding. Additionally, we demonstrated how missingness graphs help clarifying the missing structure. When no effect modification existed, complete case analysis yielded valid causal treatment effects even when data were missing not at random. In some situations, complete case analysis was also able to partially correct for unmeasured confounding. Multiple imputation worked well if the data were missing (completely) at random, and if the imputation model was correctly specified. In the presence of effect modification, more complex imputation models than default options of commonly used statistical software were required. Multiple imputation may fail when data are missing not at random. Here, combining multiple imputation and the missing indicator method reduced the bias as the missing indicator variable can be a proxy for unobserved confounding. The optimal way to handle missing values in covariates of propensity score models depends on the missing data structure and the presence of effect modification. When effect modification is present, default settings of imputation methods may yield biased results even if data are missing at random.

Keywords: Effect modification; Missing data; Missing indicator; Missingness graph; Multiple imputation; Propensity score analysis.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

**Fig. 1**
M-graphs for Simulation setting 1: MCAR scenario (a), MAR scenario (b), and MANR scenario (c)

**Fig. 2**
M-graphs for Simulation setting 2: MCAR scenario (a), MAR scenario (b), and MANR scenario (c)

**Fig. 3**
M-graphs for Simulation setting 3: MCAR scenario (a), MNAR scenario (b)

**Fig. 4**
Mean treatment effects and their 5th and 95th percentile ranges estimated by propensity weighting in Simulation setting 1 (left), 2 (middle) and 3 (right). For each missing scenario, missing data are handled with complete case analysis, missing indicator method, multiple imputation, and the combination of multiple imputation and missing indicator method (Combined method). The vertical lines represent the true treatment effect

See this image and copyright information in PMC

Cited by

Contribution of socio-demographic and clinical characteristics to predict initial referrals to psychosocial interventions in patients with serious mental illness.
Barbalat G, Plasse J, Chéreau-Boudet I, Gouache B, Legros-Lafarge E, Massoubre C, Guillard-Bouhet N, Haesebaert F, Franck N. Barbalat G, et al. Epidemiol Psychiatr Sci. 2024 Jan 29;33:e2. doi: 10.1017/S2045796024000015. Epidemiol Psychiatr Sci. 2024. PMID: 38282331 Free PMC article.
Effectiveness of Angiotensin II for Catecholamine Refractory Septic or Distributive Shock on Mortality: A Propensity Score Weighted Analysis of Real-World Experience in the Medical ICU.
Quan M, Cho N, Bushell T, Mak J, Nguyen N, Litwak J, Rockwood N, Nguyen HB. Quan M, et al. Crit Care Explor. 2022 Jan 18;4(1):e0623. doi: 10.1097/CCE.0000000000000623. eCollection 2022 Jan. Crit Care Explor. 2022. PMID: 35072084 Free PMC article.
Anatomic distribution of lower extremity deep venous thrombosis is associated with an increased risk of pulmonary embolism: A 10-year retrospective analysis.
Zhang J, Chen Y, Wang Z, Chen X, Liu Y, Liu M. Zhang J, et al. Front Cardiovasc Med. 2023 Mar 22;10:1154875. doi: 10.3389/fcvm.2023.1154875. eCollection 2023. Front Cardiovasc Med. 2023. PMID: 37034353 Free PMC article.
Robust estimation of dementia prevalence from two-phase surveys with non-responders via propensity score stratification.
Shen C, Pei M, Wang X, Zhao Y, Wang L, Tan J, Deng K, Li N. Shen C, et al. BMC Med Res Methodol. 2023 May 27;23(1):130. doi: 10.1186/s12874-023-01954-0. BMC Med Res Methodol. 2023. PMID: 37237383 Free PMC article.
Target Trial Emulation and Bias Through Missing Eligibility Data: An Application to a Study of Palivizumab for the Prevention of Hospitalization Due to Infant Respiratory Illness.
Tompsett D, Zylbersztejn A, Hardelid P, De Stavola B. Tompsett D, et al. Am J Epidemiol. 2023 Apr 6;192(4):600-611. doi: 10.1093/aje/kwac202. Am J Epidemiol. 2023. PMID: 36509514 Free PMC article. Clinical Trial.

See all "Cited by" articles

References

1. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. doi: 10.1093/biomet/70.1.41. - DOI
1. Austin PC. A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003. Stat Med. 2008;27(12):2037–2049. doi: 10.1002/sim.3150. - DOI - PubMed
1. Williamson E, Morley R, Lucas A, Carpenter J. Propensity scores: from naive enthusiasm to intuitive understanding. Stat Methods Med Res. 2012;21(3):273–293. doi: 10.1177/0962280210394483. - DOI - PubMed
1. Austin PC. The relative ability of different propensity score methods to balance measured covariates between treated and untreated subjects in observational studies. Med Decis Mak. 2009;29(6):661–677. doi: 10.1177/0272989X09341755. - DOI - PubMed
1. Austin PC. Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples. Stat Med. 2009;28(25):3083–3107. doi: 10.1002/sim.3697. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A comparison of different methods to handle missing data in the context of propensity score analysis

Affiliations

A comparison of different methods to handle missing data in the context of propensity score analysis

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical