A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models
- PMID: 30763612
- DOI: 10.1016/j.jclinepi.2019.02.004
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models
Abstract
Objectives: The objective of this study was to compare performance of logistic regression (LR) with machine learning (ML) for clinical prediction modeling in the literature.
Study design and setting: We conducted a Medline literature search (1/2016 to 8/2017) and extracted comparisons between LR and ML models for binary outcomes.
Results: We included 71 of 927 studies. The median sample size was 1,250 (range 72-3,994,872), with 19 predictors considered (range 5-563) and eight events per predictor (range 0.3-6,697). The most common ML methods were classification trees, random forests, artificial neural networks, and support vector machines. In 48 (68%) studies, we observed potential bias in the validation procedures. Sixty-four (90%) studies used the area under the receiver operating characteristic curve (AUC) to assess discrimination. Calibration was not addressed in 56 (79%) studies. We identified 282 comparisons between an LR and ML model (AUC range, 0.52-0.99). For 145 comparisons at low risk of bias, the difference in logit(AUC) between LR and ML was 0.00 (95% confidence interval, -0.18 to 0.18). For 137 comparisons at high risk of bias, logit(AUC) was 0.34 (0.20-0.47) higher for ML.
Conclusion: We found no evidence of superior performance of ML over LR. Improvements in methodology and reporting are needed for studies that compare modeling algorithms.
Keywords: AUC; Calibration; Clinical prediction models; Logistic regression; Machine learning; Reporting.
Copyright © 2019 Elsevier Inc. All rights reserved.
Comment in
-
Statistics versus machine learning: definitions are interesting (but understanding, methodology, and reporting are more important).J Clin Epidemiol. 2019 Dec;116:137-138. doi: 10.1016/j.jclinepi.2019.08.002. Epub 2019 Aug 16. J Clin Epidemiol. 2019. PMID: 31425736 No abstract available.
-
Statistical thinking, machine learning.J Clin Epidemiol. 2019 Dec;116:136-137. doi: 10.1016/j.jclinepi.2019.08.003. Epub 2019 Aug 16. J Clin Epidemiol. 2019. PMID: 31425737 No abstract available.
Similar articles
-
Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis.JMIR Med Inform. 2020 Nov 17;8(11):e16503. doi: 10.2196/16503. JMIR Med Inform. 2020. PMID: 33200995 Free PMC article. Review.
-
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024. PMID: 38813626 Chinese.
-
Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024. Front Cardiovasc Med. 2025. PMID: 39850379 Free PMC article.
-
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13. Med Phys. 2018. PMID: 29763967 Free PMC article.
-
Machine learning applications to clinical decision support in neurosurgery: an artificial intelligence augmented systematic review.Neurosurg Rev. 2020 Oct;43(5):1235-1253. doi: 10.1007/s10143-019-01163-8. Epub 2019 Aug 17. Neurosurg Rev. 2020. PMID: 31422572
Cited by
-
Pan-Cancer Transcriptional Models Predicting Chemosensitivity in Human Tumors.Cancer Inform. 2021 Mar 19;20:11769351211002494. doi: 10.1177/11769351211002494. eCollection 2021. Cancer Inform. 2021. PMID: 33795931 Free PMC article.
-
Translating Data Analytics Into Improved Spine Surgery Outcomes: A Roadmap for Biomedical Informatics Research in 2021.Global Spine J. 2022 Jun;12(5):952-963. doi: 10.1177/21925682211008424. Epub 2021 May 11. Global Spine J. 2022. PMID: 33973491 Free PMC article.
-
Predicting Overweight and Obesity Status Among Malaysian Working Adults With Machine Learning or Logistic Regression: Retrospective Comparison Study.JMIR Form Res. 2022 Dec 7;6(12):e40404. doi: 10.2196/40404. JMIR Form Res. 2022. PMID: 36476813 Free PMC article.
-
Good times bad times: Automated forecasting of seasonal cryptosporidiosis in Ontario using machine learning.Can Commun Dis Rep. 2020 Jun 4;46(6):192-197. doi: 10.14745/ccdr.v46i06a07. eCollection 2020 Jun 4. Can Commun Dis Rep. 2020. PMID: 32673377 Free PMC article.
-
Predicting high health-cost users among people with cardiovascular disease using machine learning and nationwide linked social administrative datasets.Health Econ Rev. 2023 Feb 4;13(1):9. doi: 10.1186/s13561-023-00422-1. Health Econ Rev. 2023. PMID: 36738348 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical