Comparing the performance of propensity score methods in healthcare database studies with rare outcomes
- PMID: 28208229
- DOI: 10.1002/sim.7250
Comparing the performance of propensity score methods in healthcare database studies with rare outcomes
Abstract
Nonrandomized studies of treatments from electronic healthcare databases are critical for producing the evidence necessary to making informed treatment decisions, but often rely on comparing rates of events observed in a small number of patients. In addition, studies constructed from electronic healthcare databases, for example, administrative claims data, often adjust for many, possibly hundreds, of potential confounders. Despite the importance of maximizing efficiency when there are many confounders and few observed outcome events, there has been relatively little research on the relative performance of different propensity score methods in this context. In this paper, we compare a wide variety of propensity-based estimators of the marginal relative risk. In contrast to prior research that has focused on specific statistical methods in isolation of other analytic choices, we instead consider a method to be defined by the complete multistep process from propensity score modeling to final treatment effect estimation. Propensity score model estimation methods considered include ordinary logistic regression, Bayesian logistic regression, lasso, and boosted regression trees. Methods for utilizing the propensity score include pair matching, full matching, decile strata, fine strata, regression adjustment using one or two nonlinear splines, inverse propensity weighting, and matching weights. We evaluate methods via a 'plasmode' simulation study, which creates simulated datasets on the basis of a real cohort study of two treatments constructed from administrative claims data. Our results suggest that regression adjustment and matching weights, regardless of the propensity score model estimation method, provide lower bias and mean squared error in the context of rare binary outcomes. Copyright © 2017 John Wiley & Sons, Ltd.
Keywords: epidemiology; healthcare databases; propensity score; risk ratio; simulation.
Copyright © 2017 John Wiley & Sons, Ltd.
Similar articles
-
Methods for Variable Selection and Treatment Effect Estimation in Nonrandomized Studies with Few Outcome Events and Many Confounders [Internet].Washington (DC): Patient-Centered Outcomes Research Institute (PCORI); 2018 Mar. Washington (DC): Patient-Centered Outcomes Research Institute (PCORI); 2018 Mar. PMID: 37184181 Free Books & Documents. Review.
-
Variable Selection for Confounding Adjustment in High-dimensional Covariate Spaces When Analyzing Healthcare Databases.Epidemiology. 2017 Mar;28(2):237-248. doi: 10.1097/EDE.0000000000000581. Epidemiology. 2017. PMID: 27779497
-
Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation.Epidemiology. 2018 Jan;29(1):96-106. doi: 10.1097/EDE.0000000000000762. Epidemiology. 2018. PMID: 28991001
-
Should a propensity score model be super? The utility of ensemble procedures for causal adjustment.Stat Med. 2019 Apr 30;38(9):1690-1702. doi: 10.1002/sim.8075. Epub 2018 Dec 26. Stat Med. 2019. PMID: 30586681
-
Propensity score analysis with missing data.Psychol Methods. 2016 Sep;21(3):427-45. doi: 10.1037/met0000076. Epub 2016 Mar 10. Psychol Methods. 2016. PMID: 26962757 Review.
Cited by
-
The effect of a constraint on the maximum number of controls matched to each treated subject on the performance of full matching on the propensity score when estimating risk differences.Stat Med. 2021 Jan 15;40(1):101-118. doi: 10.1002/sim.8764. Epub 2020 Oct 7. Stat Med. 2021. PMID: 33027845 Free PMC article.
-
Evaluating the Utility of Coarsened Exact Matching for Pharmacoepidemiology Using Real and Simulated Claims Data.Am J Epidemiol. 2020 Jun 1;189(6):613-622. doi: 10.1093/aje/kwz268. Am J Epidemiol. 2020. PMID: 31845719 Free PMC article.
-
Evaluating large-scale propensity score performance through real-world and synthetic data experiments.Int J Epidemiol. 2018 Dec 1;47(6):2005-2014. doi: 10.1093/ije/dyy120. Int J Epidemiol. 2018. PMID: 29939268 Free PMC article.
-
Propensity-score matching with competing risks in survival analysis.Stat Med. 2019 Feb 28;38(5):751-777. doi: 10.1002/sim.8008. Epub 2018 Oct 22. Stat Med. 2019. PMID: 30347461 Free PMC article.
-
Gestational Diabetes and Maternal Weight Management During and After Pregnancy.J Womens Health (Larchmt). 2019 May;28(5):646-653. doi: 10.1089/jwh.2018.7020. Epub 2018 Nov 29. J Womens Health (Larchmt). 2019. PMID: 30457439 Free PMC article.