Thirteen Questions About Using Machine Learning in Causal Research (You Won't Believe the Answer to Number 10!)
- PMID: 33751024
- PMCID: PMC8555423
- DOI: 10.1093/aje/kwab047
Thirteen Questions About Using Machine Learning in Causal Research (You Won't Believe the Answer to Number 10!)
Abstract
Machine learning is gaining prominence in the health sciences, where much of its use has focused on data-driven prediction. However, machine learning can also be embedded within causal analyses, potentially reducing biases arising from model misspecification. Using a question-and-answer format, we provide an introduction and orientation for epidemiologists interested in using machine learning but concerned about potential bias or loss of rigor due to use of "black box" models. We conclude with sample software code that may lower the barrier to entry to using these techniques.
Keywords: causal inference; double-robustness; epidemiologic methods; inverse probability weighting; machine learning; propensity score; targeted maximum likelihood estimation.
© The Author(s) 2021. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Comment in
-
Invited Commentary: Machine Learning in Causal Inference-How Do I Love Thee? Let Me Count the Ways.Am J Epidemiol. 2021 Aug 1;190(8):1483-1487. doi: 10.1093/aje/kwab048. Am J Epidemiol. 2021. PMID: 33751059
References
-
- Friedman J, Hastie T, Tibshirani R. The Elements of Statistical Learning. New York, NY: Springer Publishing Company; 2001.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
