Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences
- PMID: 36260666
- PMCID: PMC9581488
- DOI: 10.1126/sciadv.abk1942
Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences
Abstract
Machine learning (ML) methodology used in the social and health sciences needs to fit the intended research purposes of description, prediction, or causal inference. This paper provides a comprehensive, systematic meta-mapping of research questions in the social and health sciences to appropriate ML approaches by incorporating the necessary requirements to statistical analysis in these disciplines. We map the established classification into description, prediction, counterfactual prediction, and causal structural learning to common research goals, such as estimating prevalence of adverse social or health outcomes, predicting the risk of an event, and identifying risk factors or causes of adverse outcomes, and explain common ML performance metrics. Such mapping may help to fully exploit the benefits of ML while considering domain-specific aspects relevant to the social and health sciences and hopefully contribute to the acceleration of the uptake of ML applications to advance both basic and applied social and health sciences research.
Figures


References
-
- Wiemken T. L., Kelley R. R., Machine learning in epidemiology and health outcomes research. Ann. Rev. Public Health 41, 21–36 (2020). - PubMed
-
- Varian H. R., Big data: New tricks for econometrics. J. Econ. Perspect. 28, 3–28 (2014).
-
- J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning (Springer Series in Statistics, Springer, 2001), vol. 1.
-
- G. James, D. Witten, T. Hastie, R. Tibshirani, An Introduction to Statistical Learning (Springer, 2013), vol. 112.
Publication types
LinkOut - more resources
Full Text Sources