Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec;79(4):2961-2973.
doi: 10.1111/biom.13827. Epub 2023 Jan 27.

Combining observational and experimental datasets using shrinkage estimators

Affiliations

Combining observational and experimental datasets using shrinkage estimators

Evan T R Rosenman et al. Biometrics. 2023 Dec.

Abstract

We consider the problem of combining data from observational and experimental sources to draw causal conclusions. To derive combined estimators with desirable properties, we extend results from the Stein shrinkage literature. Our contributions are threefold. First, we propose a generic procedure for deriving shrinkage estimators in this setting, making use of a generalized unbiased risk estimate. Second, we develop two new estimators, prove finite sample conditions under which they have lower risk than an estimator using only experimental data, and show that each achieves a notion of asymptotic optimality. Third, we draw connections between our approach and results in sensitivity analysis, including proposing a method for evaluating the feasibility of our estimators.

Keywords: causal inference; data fusion; sensitivity analysis; shrinkage.

PubMed Disclaimer

References

REFERENCES

    1. Armstrong, T.B. & Kolesár, M. (2018) Optimal inference in a class of regression models. Econometrica, 86(2), 655-683.
    1. Armstrong, T.B., Kolesár, M. & Plagborg-Møller, M. (2022) Robust empirical bayes confidence intervals. Econometrica, 90(6), 2567-2602.
    1. Athey, S., Chetty, R., Imbens, G.W. & Kang, H. (2019) The surrogate index: combining short-term proxies to estimate long-term treatment effects more rapidly and precisely. Technical Report. National Bureau of Economic Research.
    1. Baranchik, A. (1964) Multiple regression and estimation of the mean of a multivariate normal distribution. Technical Report. Stanford University, Stanford, CA.
    1. Bareinboim, E. & Pearl, J. (2016) Causal inference and the data-fusion problem. Proceedings of the National Academy of Sciences, 113(27), 7345-7352.

Publication types

LinkOut - more resources