Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

Anja K Leist¹, Matthias Klee¹, Jung Hyun Kim¹, David H Rehkopf², Stéphane P A Bordas³, Graciela Muniz-Terrera^{4

5}, Sara Wade⁶

Affiliations

¹ Department of Social Sciences, Institute for Research on Socio-Economic Inequality (IRSEI), University of Luxembourg, Esch-sur-Alzette, Luxembourg.
² Department of Epidemiology and Population Health, Stanford University, Palo Alto, CA, USA.
³ Department of Engineering, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
⁴ Centre for Dementia Prevention, University of Edinburgh, Edinburgh, UK.
⁵ Ohio University, Athens, OH, USA.
⁶ School of Mathematics, University of Edinburgh, Edinburgh, UK.

PMID: 36260666
PMCID: PMC9581488
DOI: 10.1126/sciadv.abk1942

Review

Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

Anja K Leist et al. Sci Adv. 2022.

. 2022 Oct 21;8(42):eabk1942.

doi: 10.1126/sciadv.abk1942. Epub 2022 Oct 19.

Authors

Anja K Leist¹, Matthias Klee¹, Jung Hyun Kim¹, David H Rehkopf², Stéphane P A Bordas³, Graciela Muniz-Terrera^{4

5}, Sara Wade⁶

Affiliations

¹ Department of Social Sciences, Institute for Research on Socio-Economic Inequality (IRSEI), University of Luxembourg, Esch-sur-Alzette, Luxembourg.
² Department of Epidemiology and Population Health, Stanford University, Palo Alto, CA, USA.
³ Department of Engineering, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
⁴ Centre for Dementia Prevention, University of Edinburgh, Edinburgh, UK.
⁵ Ohio University, Athens, OH, USA.
⁶ School of Mathematics, University of Edinburgh, Edinburgh, UK.

PMID: 36260666
PMCID: PMC9581488
DOI: 10.1126/sciadv.abk1942

Abstract

Machine learning (ML) methodology used in the social and health sciences needs to fit the intended research purposes of description, prediction, or causal inference. This paper provides a comprehensive, systematic meta-mapping of research questions in the social and health sciences to appropriate ML approaches by incorporating the necessary requirements to statistical analysis in these disciplines. We map the established classification into description, prediction, counterfactual prediction, and causal structural learning to common research goals, such as estimating prevalence of adverse social or health outcomes, predicting the risk of an event, and identifying risk factors or causes of adverse outcomes, and explain common ML performance metrics. Such mapping may help to fully exploit the benefits of ML while considering domain-specific aspects relevant to the social and health sciences and hopefully contribute to the acceleration of the uptake of ML applications to advance both basic and applied social and health sciences research.

PubMed Disclaimer

Figures

**Fig. 1.. Typical relationship between model error and complexity.**
Copyright by Sara Wade.

**Fig. 2.. ML methods for prediction most relevant in the social and health sciences with nontechnical description ranked by interpretability/explainability versus complexity.**
Note that classes of methods are represented as larger circles; specific ML methods are represented as small circles within. Ordering and selection of ML methods based on theoretical considerations and experience. ANN, artificial neural network. Copyright by Matthias Klee.

See this image and copyright information in PMC

References

1. Wiemken T. L., Kelley R. R., Machine learning in epidemiology and health outcomes research. Ann. Rev. Public Health 41, 21–36 (2020). - PubMed
1. Yarkoni T., Westfall J., Choosing prediction over explanation in psychology: Lessons from machine learning. Perspect. Psychol. Sci. 12, 1100–1122 (2017). - PMC - PubMed
1. Varian H. R., Big data: New tricks for econometrics. J. Econ. Perspect. 28, 3–28 (2014).
1. J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning (Springer Series in Statistics, Springer, 2001), vol. 1.
1. G. James, D. Witten, T. Hastie, R. Tibshirani, An Introduction to Statistical Learning (Springer, 2013), vol. 112.

Publication types

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

Affiliations

Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

Authors

Affiliations

Abstract

Figures

References

Publication types

LinkOut - more resources

Full Text Sources