Combining observational and experimental datasets using shrinkage estimators
- PMID: 36629736
- DOI: 10.1111/biom.13827
Combining observational and experimental datasets using shrinkage estimators
Abstract
We consider the problem of combining data from observational and experimental sources to draw causal conclusions. To derive combined estimators with desirable properties, we extend results from the Stein shrinkage literature. Our contributions are threefold. First, we propose a generic procedure for deriving shrinkage estimators in this setting, making use of a generalized unbiased risk estimate. Second, we develop two new estimators, prove finite sample conditions under which they have lower risk than an estimator using only experimental data, and show that each achieves a notion of asymptotic optimality. Third, we draw connections between our approach and results in sensitivity analysis, including proposing a method for evaluating the feasibility of our estimators.
Keywords: causal inference; data fusion; sensitivity analysis; shrinkage.
© 2023 The International Biometric Society.
References
REFERENCES
-
- Armstrong, T.B. & Kolesár, M. (2018) Optimal inference in a class of regression models. Econometrica, 86(2), 655-683.
-
- Armstrong, T.B., Kolesár, M. & Plagborg-Møller, M. (2022) Robust empirical bayes confidence intervals. Econometrica, 90(6), 2567-2602.
-
- Athey, S., Chetty, R., Imbens, G.W. & Kang, H. (2019) The surrogate index: combining short-term proxies to estimate long-term treatment effects more rapidly and precisely. Technical Report. National Bureau of Economic Research.
-
- Baranchik, A. (1964) Multiple regression and estimation of the mean of a multivariate normal distribution. Technical Report. Stanford University, Stanford, CA.
-
- Bareinboim, E. & Pearl, J. (2016) Causal inference and the data-fusion problem. Proceedings of the National Academy of Sciences, 113(27), 7345-7352.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
