Model misspecification and bias for inverse probability weighting estimators of average causal effects

Ingeborg Waernbaum¹, Laura Pazzagli²

Affiliations

¹ Department of Statistics, Uppsala University, Sweden and Institute for Evaluation of Labour Market and Education Policy, IFAU, Uppsala, Sweden.
² Centre for Pharmacoepidemiology, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden.

PMID: 36045099
PMCID: PMC10087564
DOI: 10.1002/bimj.202100118

Model misspecification and bias for inverse probability weighting estimators of average causal effects

Ingeborg Waernbaum et al. Biom J. 2023 Feb.

. 2023 Feb;65(2):e2100118.

doi: 10.1002/bimj.202100118. Epub 2022 Aug 31.

Authors

Ingeborg Waernbaum¹, Laura Pazzagli²

Affiliations

¹ Department of Statistics, Uppsala University, Sweden and Institute for Evaluation of Labour Market and Education Policy, IFAU, Uppsala, Sweden.
² Centre for Pharmacoepidemiology, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden.

PMID: 36045099
PMCID: PMC10087564
DOI: 10.1002/bimj.202100118

Abstract

Commonly used semiparametric estimators of causal effects specify parametric models for the propensity score (PS) and the conditional outcome. An example is an augmented inverse probability weighting (IPW) estimator, frequently referred to as a doubly robust estimator, because it is consistent if at least one of the two models is correctly specified. However, in many observational studies, the role of the parametric models is often not to provide a representation of the data-generating process but rather to facilitate the adjustment for confounding, making the assumption of at least one true model unlikely to hold. In this paper, we propose a crude analytical approach to study the large-sample bias of estimators when the models are assumed to be approximations of the data-generating process, namely, when all models are misspecified. We apply our approach to three prototypical estimators of the average causal effect, two IPW estimators, using a misspecified PS model, and an augmented IPW (AIPW) estimator, using misspecified models for the outcome regression (OR) and the PS. For the two IPW estimators, we show that normalization, in addition to having a smaller variance, also offers some protection against bias due to model misspecification. To analyze the question of when the use of two misspecified models is better than one we derive necessary and sufficient conditions for when the AIPW estimator has a smaller bias than a simple IPW estimator and when it has a smaller bias than an IPW estimator with normalized weights. If the misspecification of the outcome model is moderate, the comparisons of the biases of the IPW and AIPW estimators show that the AIPW estimator has a smaller bias than the IPW estimators. However, all biases include a scaling with the PS-model error and we suggest caution in modeling the PS whenever such a model is involved. For numerical and finite sample illustrations, we include three simulation studies and corresponding approximations of the large-sample biases. In a dataset from the National Health and Nutrition Examination Survey, we estimate the effect of smoking on blood lead levels.

Keywords: average causal effects; comparing biases; outcome model; propensity score.

PubMed Disclaimer

Figures

**FIGURE 1**
Illustration of the components of the biases using the data‐generating process from Example 2. Top left: $e (X)$ and $e^{*} (X)$ by X; top right: $e (X) / e^{*} (X)$ by X; bottom left: $μ_{1} (X)$ and $μ_{1}^{*} (X)$ by X; and bottom right: $μ_{1} (X)$ and $μ_{1}^{*} (X)$ by $e (X) / e^{*} (X)$

**FIGURE 2**
Illustration of the bias reduction in ${\hat{Δ}}_{{IPW}_{2}}^{*}$ of the means $E [e (X) / e^{*} (X)]$ and $E [(1 - e (X)) / (1 - e^{*} (X))]$ of the PS errors in Designs A, Simulation 1 (top), 2 (middle), and 3 (bottom)

**FIGURE 4**
Overlap plots for the propensity score distributions, $\hat{e} (X)$ and ${\hat{e}}^{*} (X)$ for treated and controls for Design A (top), B (middle), and C (bottom) in Simulation 2

**FIGURE 5**
Overlap plots for the propensity score distributions, $\hat{e} (X)$ and ${\hat{e}}^{*} (X)$ for treated and controls for Design A (good overlap), B (moderate overlap), and C (poor overlap) in Simulation 3

**FIGURE 3**
Overlap plots for the propensity score distributions, $\hat{e} (X)$ and ${\hat{e}}^{*} (X)$ for treated and controls for Design A (top), B (middle), and C (bottom) in Simulation 1

See this image and copyright information in PMC

References

1. Bang, H. , & Robins, J. M. (2005). Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4), 962–973. - PubMed
1. Boos, D. D. , & Stefanski, L. (2013). M‐estimation (estimating equations). In Boos D. D. & Stefanski L. A. (Eds.), Essential statistical inference (Vol. 120, pp. 297–337). Springer.
1. Busso, M. , DiNardo, J. , & McCrary, J. (2014). New evidence on the finite sample properties of propensity score reweighting and matching estimators. Review of Economics and Statistics, 96(5), 885–897.
1. Cao, W. , Tsiatis, A. A. , & Davidian, M. (2009). Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data. Biometrika, 96(3), 723–734. - PMC - PubMed
1. Chang, S.‐H. , Chou, I.‐J. , Yeh, Y.‐H. , Chiou, M.‐J. , Wen, M.‐S. , Kuo, C.‐T. , See, L.‐C. , & Kuo, C.‐F. (2017). Association between use of non‐vitamin k oral anticoagulants with and without concurrent medications and risk of major bleeding in nonvalvular atrial fibrillation. JAMA, 318(13), 1250–1259. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Model misspecification and bias for inverse probability weighting estimators of average causal effects

Affiliations

Model misspecification and bias for inverse probability weighting estimators of average causal effects

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources