Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available
- PMID: 26994063
- PMCID: PMC4832051
- DOI: 10.1093/aje/kwv254
Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available
Abstract
Ideally, questions about comparative effectiveness or safety would be answered using an appropriately designed and conducted randomized experiment. When we cannot conduct a randomized experiment, we analyze observational data. Causal inference from large observational databases (big data) can be viewed as an attempt to emulate a randomized experiment-the target experiment or target trial-that would answer the question of interest. When the goal is to guide decisions among several strategies, causal analyses of observational data need to be evaluated with respect to how well they emulate a particular target trial. We outline a framework for comparative effectiveness research using big data that makes the target trial explicit. This framework channels counterfactual theory for comparing the effects of sustained treatment strategies, organizes analytic approaches, provides a structured process for the criticism of observational studies, and helps avoid common methodologic pitfalls.
Keywords: big data; causal inference; comparative effectiveness research; target trial.
© The Author 2016. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
References
-
- Greenwood V. Can big data tell us what clinical trials don't? The New York Times. October 3, 2014. http://www.nytimes.com/2014/10/05/magazine/can-big-data-tell-us-what-cli.... Accessed September 4, 2015.
-
- Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;665:688–701.
-
- Robins JM. A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect [published erratum appears in Math Model. 1987;14:917–921] Math Model. 1986;7(9-12):1393–1512.
-
- Robins JM. Addendum to “a new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect” [published erratum appears in Comput Math Appl. 1989:18;477] Comput Math Appl. 1987;14(9-12):923–945.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical