Recommendations for Reporting Machine Learning Analyses in Clinical Research
- PMID: 33079589
- PMCID: PMC8320533
- DOI: 10.1161/CIRCOUTCOMES.120.006556
Recommendations for Reporting Machine Learning Analyses in Clinical Research
Abstract
Use of machine learning (ML) in clinical research is growing steadily given the increasing availability of complex clinical data sets. ML presents important advantages in terms of predictive performance and identifying undiscovered subpopulations of patients with specific physiology and prognoses. Despite this popularity, many clinicians and researchers are not yet familiar with evaluating and interpreting ML analyses. Consequently, readers and peer-reviewers alike may either overestimate or underestimate the validity and credibility of an ML-based model. Conversely, ML experts without clinical experience may present details of the analysis that are too granular for a clinical readership to assess. Overwhelming evidence has shown poor reproducibility and reporting of ML models in clinical research suggesting the need for ML analyses to be presented in a clear, concise, and comprehensible manner to facilitate understanding and critical evaluation. We present a recommendation for transparent and structured reporting of ML analysis results specifically directed at clinical researchers. Furthermore, we provide a list of key reporting elements with examples that can be used as a template when preparing and submitting ML-based manuscripts for the same audience.
Keywords: bioinformatics; machine learning; prognosis; report; reproducibility; research.
Conflict of interest statement
Disclosure: The authors have no conflicts to declare.
Figures





Comment in
-
Machine Learning in Clinical Journals: Moving From Inscrutable to Informative.Circ Cardiovasc Qual Outcomes. 2020 Oct;13(10):e007491. doi: 10.1161/CIRCOUTCOMES.120.007491. Epub 2020 Oct 14. Circ Cardiovasc Qual Outcomes. 2020. PMID: 33079583 Free PMC article. No abstract available.
References
-
- González-Beltrán A, Li P, Zhao J, Avila-Garcia MS, Roos M, Thompson M, van der Horst E, Kaliyaperumal R, Luo R, Lee T-L, Lam T, Edmunds SC, Sansone S-A, Rocca-Serra P. From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. PLoS One. 2015;10:e0127612. - PMC - PubMed
-
- Jordan MI, Mitchell TM. Machine learning: Trends, perspectives, and prospects. Science. 2015;349:255–260. - PubMed
-
- Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition. New York, NY: Springer; 2009.
-
- Vandenbroucke JP, von Elm E, Altman DG, Gøtzsche PC, Mulrow CD, Pocock SJ, Poole C, Schlesselman JJ, Egger M, Blettner M, Boffetta P, Brenner H, Chêne G, Cooper C, Davey-Smith G, Gagnon F, Greenland P, Greenland S, Infante-Rivard C, Ioannidis J, James A, Jones G, Ledergerber B, Little J, May M, Moher D, Momen H, Morabia A, Morgenstern H, Paccaud F, Röösli M, Rothenbacher D, Rothman K, Sabin C, Sauerbrei W, Say L, Sterne J, Syddall H, White I, Wieland S, Williams H, Zou GY. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and elaboration. Int J Surg. 2014;12:1500–1524. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources