A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models

Evangelia Christodoulou¹, Jie Ma², Gary S Collins³, Ewout W Steyerberg⁴, Jan Y Verbakel⁵, Ben Van Calster⁶

Affiliations

¹ Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium.
² Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Windmill Road, Oxford, OX3 7LD UK.
³ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Windmill Road, Oxford, OX3 7LD UK; Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
⁴ Department of Biomedical Data Sciences, Leiden University Medical Centre, Albinusdreef 2, Leiden, 2333 ZA The Netherlands.
⁵ Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium; Department of Public Health & Primary Care, KU Leuven, Kapucijnenvoer 33J box 7001, Leuven, 3000 Belgium; Nuffield Department of Primary Care Health Sciences, University of Oxford, Woodstock Road, Oxford, OX2 6GG UK.
⁶ Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium; Department of Biomedical Data Sciences, Leiden University Medical Centre, Albinusdreef 2, Leiden, 2333 ZA The Netherlands. Electronic address: ben.vancalster@kuleuven.be.

PMID: 30763612
DOI: 10.1016/j.jclinepi.2019.02.004

Meta-Analysis

A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models

Evangelia Christodoulou et al. J Clin Epidemiol. 2019 Jun.

. 2019 Jun:110:12-22.

doi: 10.1016/j.jclinepi.2019.02.004. Epub 2019 Feb 11.

Authors

Evangelia Christodoulou¹, Jie Ma², Gary S Collins³, Ewout W Steyerberg⁴, Jan Y Verbakel⁵, Ben Van Calster⁶

Affiliations

¹ Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium.
² Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Windmill Road, Oxford, OX3 7LD UK.
³ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Windmill Road, Oxford, OX3 7LD UK; Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
⁴ Department of Biomedical Data Sciences, Leiden University Medical Centre, Albinusdreef 2, Leiden, 2333 ZA The Netherlands.
⁵ Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium; Department of Public Health & Primary Care, KU Leuven, Kapucijnenvoer 33J box 7001, Leuven, 3000 Belgium; Nuffield Department of Primary Care Health Sciences, University of Oxford, Woodstock Road, Oxford, OX2 6GG UK.
⁶ Department of Development & Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000 Belgium; Department of Biomedical Data Sciences, Leiden University Medical Centre, Albinusdreef 2, Leiden, 2333 ZA The Netherlands. Electronic address: ben.vancalster@kuleuven.be.

PMID: 30763612
DOI: 10.1016/j.jclinepi.2019.02.004

Abstract

Objectives: The objective of this study was to compare performance of logistic regression (LR) with machine learning (ML) for clinical prediction modeling in the literature.

Study design and setting: We conducted a Medline literature search (1/2016 to 8/2017) and extracted comparisons between LR and ML models for binary outcomes.

Results: We included 71 of 927 studies. The median sample size was 1,250 (range 72-3,994,872), with 19 predictors considered (range 5-563) and eight events per predictor (range 0.3-6,697). The most common ML methods were classification trees, random forests, artificial neural networks, and support vector machines. In 48 (68%) studies, we observed potential bias in the validation procedures. Sixty-four (90%) studies used the area under the receiver operating characteristic curve (AUC) to assess discrimination. Calibration was not addressed in 56 (79%) studies. We identified 282 comparisons between an LR and ML model (AUC range, 0.52-0.99). For 145 comparisons at low risk of bias, the difference in logit(AUC) between LR and ML was 0.00 (95% confidence interval, -0.18 to 0.18). For 137 comparisons at high risk of bias, logit(AUC) was 0.34 (0.20-0.47) higher for ML.

Conclusion: We found no evidence of superior performance of ML over LR. Improvements in methodology and reporting are needed for studies that compare modeling algorithms.

Keywords: AUC; Calibration; Clinical prediction models; Logistic regression; Machine learning; Reporting.

PubMed Disclaimer

Comment in

Statistics versus machine learning: definitions are interesting (but understanding, methodology, and reporting are more important).
Van Calster B, Verbakel JY, Christodoulou E, Steyerberg EW, Collins GS. Van Calster B, et al. J Clin Epidemiol. 2019 Dec;116:137-138. doi: 10.1016/j.jclinepi.2019.08.002. Epub 2019 Aug 16. J Clin Epidemiol. 2019. PMID: 31425736 No abstract available.
Statistical thinking, machine learning.
Bian J, Buchan I, Guo Y, Prosperi M. Bian J, et al. J Clin Epidemiol. 2019 Dec;116:136-137. doi: 10.1016/j.jclinepi.2019.08.003. Epub 2019 Aug 16. J Clin Epidemiol. 2019. PMID: 31425737 No abstract available.

Publication types

Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Medical
- ClinicalTrials.gov

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models

Affiliations

A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models

Authors

Affiliations

Abstract

Comment in

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical