Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Oct 21;8(42):eabk1942.
doi: 10.1126/sciadv.abk1942. Epub 2022 Oct 19.

Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

Affiliations
Review

Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

Anja K Leist et al. Sci Adv. .

Abstract

Machine learning (ML) methodology used in the social and health sciences needs to fit the intended research purposes of description, prediction, or causal inference. This paper provides a comprehensive, systematic meta-mapping of research questions in the social and health sciences to appropriate ML approaches by incorporating the necessary requirements to statistical analysis in these disciplines. We map the established classification into description, prediction, counterfactual prediction, and causal structural learning to common research goals, such as estimating prevalence of adverse social or health outcomes, predicting the risk of an event, and identifying risk factors or causes of adverse outcomes, and explain common ML performance metrics. Such mapping may help to fully exploit the benefits of ML while considering domain-specific aspects relevant to the social and health sciences and hopefully contribute to the acceleration of the uptake of ML applications to advance both basic and applied social and health sciences research.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.. Typical relationship between model error and complexity.
Copyright by Sara Wade.
Fig. 2.
Fig. 2.. ML methods for prediction most relevant in the social and health sciences with nontechnical description ranked by interpretability/explainability versus complexity.
Note that classes of methods are represented as larger circles; specific ML methods are represented as small circles within. Ordering and selection of ML methods based on theoretical considerations and experience. ANN, artificial neural network. Copyright by Matthias Klee.

References

    1. Wiemken T. L., Kelley R. R., Machine learning in epidemiology and health outcomes research. Ann. Rev. Public Health 41, 21–36 (2020). - PubMed
    1. Yarkoni T., Westfall J., Choosing prediction over explanation in psychology: Lessons from machine learning. Perspect. Psychol. Sci. 12, 1100–1122 (2017). - PMC - PubMed
    1. Varian H. R., Big data: New tricks for econometrics. J. Econ. Perspect. 28, 3–28 (2014).
    1. J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning (Springer Series in Statistics, Springer, 2001), vol. 1.
    1. G. James, D. Witten, T. Hastie, R. Tibshirani, An Introduction to Statistical Learning (Springer, 2013), vol. 112.