Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep;14(3):805-828.
doi: 10.1214/18-ba1131. Epub 2019 Jun 11.

High-Dimensional Confounding Adjustment Using Continuous Spike and Slab Priors

Affiliations

High-Dimensional Confounding Adjustment Using Continuous Spike and Slab Priors

Joseph Antonelli et al. Bayesian Anal. 2019 Sep.

Abstract

In observational studies, estimation of a causal effect of a treatment on an outcome relies on proper adjustment for confounding. If the number of the potential confounders (p) is larger than the number of observations (n), then direct control for all potential confounders is infeasible. Existing approaches for dimension reduction and penalization are generally aimed at predicting the outcome, and are less suited for estimation of causal effects. Under standard penalization approaches (e.g. Lasso), if a variable Xj is strongly associated with the treatment T but weakly with the outcome Y, the coefficient βj will be shrunk towards zero thus leading to confounding bias. Under the assumption of a linear model for the outcome and sparsity, we propose continuous spike and slab priors on the regression coefficients βj corresponding to the potential confounders Xj . Specifically, we introduce a prior distribution that does not heavily shrink to zero the coefficients (βj s) of the Xj s that are strongly associated with T but weakly associated with Y. We compare our proposed approach to several state of the art methods proposed in the literature. Our proposed approach has the following features: 1) it reduces confounding bias in high dimensional settings; 2) it shrinks towards zero coefficients of instrumental variables; and 3) it achieves good coverages even in small sample sizes. We apply our approach to the National Health and Nutrition Examination Survey (NHANES) data to estimate the causal effects of persistent pesticide exposure on triglyceride levels.

Keywords: bayesian variable selection; causal inference; high-dimensional data; shrinkage priors.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
The left panel shows pθ(βj) for a variety of values of βj as a function of wj. The right panel shows Δj/n as a function of wj. Here we fixed λ1 = 0.1, λ0 = 30, and θ = 0.05.
Figure 2:
Figure 2:
Posterior inclusion probabilities from the homogeneous model for simulations in Sections 4.1 and 4.2.
Figure 3:
Figure 3:
Results from analysis of volatile compounds on triglycerides. The upper panel shows the results using the full, n = 177, sample. The lower panel shows the results for just the n = 77 subjects who are over 40 years old.
Figure 4:
Figure 4:
The left panel shows a histogram of the ratios of standard errors for the EM-SSL approach and the double post selection approach for the analysis of volatile compounds in the full data. The right panel shows the corresponding histogram for the analysis of subjects over the age of 40. The dashed vertical line is the mean of the ratios that make up the histogram.

References

    1. Albert JH and Chib S (1993). “Bayesian analysis of binary and polychotomous response data.” Journal of the American statistical Association, 88(422): 669–679. MR1224394. 4
    1. Antonelli J, Cefalu M, Palmer N, and Agniel D (2018). “Doubly robust matching estimators for high dimensional confounding adjustment.” Biometrics. 3, 19 - PMC - PubMed
    1. Antonelli J, Parmigiani G, and Dominici F (2018). “Supplementary materials for “High-dimensional confounding adjustment using continuous spike and slab priors”.” Bayesian Analysis. doi: 10.1214/18-BA1131SUPP 5 - DOI - PMC - PubMed
    1. Antonelli J, Zigler C, and Dominici F (2017). “Guided Bayesian imputation to adjust for confounding when combining heterogeneous data sources in comparative effectiveness research.” Biostatistics, 18(3): 553–568. MR3799594. doi: 10.1093/biostatistics/kxx003. 2 - DOI - PMC - PubMed
    1. Athey S, Imbens GW, and Wager S (2016). “Approximate residual balancing: De-biased inference of average treatment effects in high dimensions.” arXiv preprint arXiv:1604.07125. 3, 10, 14, 19

LinkOut - more resources