Confounding and regression adjustment in difference-in-differences studies
- PMID: 33978956
- PMCID: PMC8522571
- DOI: 10.1111/1475-6773.13666
Confounding and regression adjustment in difference-in-differences studies
Abstract
Objective: To define confounding bias in difference-in-difference studies and compare regression- and matching-based estimators designed to correct bias due to observed confounders.
Data sources: We simulated data from linear models that incorporated different confounding relationships: time-invariant covariates with a time-varying effect on the outcome, time-varying covariates with a constant effect on the outcome, and time-varying covariates with a time-varying effect on the outcome. We considered a simple setting that is common in the applied literature: treatment is introduced at a single time point and there is no unobserved treatment effect heterogeneity.
Study design: We compared the bias and root mean squared error of treatment effect estimates from six model specifications, including simple linear regression models and matching techniques.
Data collection: Simulation code is provided for replication.
Principal findings: Confounders in difference-in-differences are covariates that change differently over time in the treated and comparison group or have a time-varying effect on the outcome. When such a confounding variable is measured, appropriately adjusting for this confounder (ie, including the confounder in a regression model that is consistent with the causal model) can provide unbiased estimates with optimal SE. However, when a time-varying confounder is affected by treatment, recovering an unbiased causal effect using difference-in-differences is difficult.
Conclusions: Confounding in difference-in-differences is more complicated than in cross-sectional settings, from which techniques and intuition to address observed confounding cannot be imported wholesale. Instead, analysts should begin by postulating a causal model that relates covariates, both time-varying and those with time-varying effects on the outcome, to treatment. This causal model will then guide the specification of an appropriate analytical model (eg, using regression or matching) that can produce unbiased treatment effect estimates. We emphasize the importance of thoughtful incorporation of covariates to address confounding bias in difference-in-difference studies.
Keywords: difference-in-differences; matching; parallel trends; regression adjustment; time-varying confounding.
© 2021 The Authors. Health Services Research published by Wiley Periodicals LLC on behalf of Health Research and Educational Trust.
Figures




References
-
- National Federation of Independent Business v. Sebelius. (2011). www.oyez.org/cases/2011/11-393
-
- Antonisse L, Garfield R, Rudowitz R, Artiga S. The effects of Medicaid expansion under the ACA: updated findings from a literature review. Published 2018. https://www.kff.org/medicaid/issue-brief/the-effects-of-medicaid-expansi...
-
- Abadie A. Semiparametric difference‐in‐differences estimators. Rev Econ Stud. 2005;72:1‐19. 10.1111/0034-6527.00321. - DOI
-
- Bilinski A, Hatfield LA. Seeking evidence of absence: Reconsidering tests of model assumptions. ArXiv180503273 Stat. Published online May 8, 2018. Accessed July 23, 2018. http://arxiv.org/abs/1805.03273
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources