. 2022 Apr 8;22(1):101.

doi: 10.1186/s12874-022-01577-x.

Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review

Paula Dhiman^{1

2}, Jie Ma³, Constanza L Andaur Navarro^{4

5}, Benjamin Speich^{3

6}, Garrett Bullock⁷, Johanna A A Damen^{4

5}, Lotty Hooft^{4

5}, Shona Kirtley³, Richard D Riley⁸, Ben Van Calster^{9

10

11}, Karel G M Moons^{4

5}, Gary S Collins^{3

12}

Affiliations

¹ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK. paula.dhiman@csm.ox.ac.uk.
² NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK. paula.dhiman@csm.ox.ac.uk.
³ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
⁴ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
⁵ Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
⁶ Basel Institute for Clinical Epidemiology and Biostatistics, Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland.
⁷ Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford, UK.
⁸ Centre for Prognosis Research, School of Medicine, Keele University, Staffordshire, ST5 5BG, UK.
⁹ Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
¹⁰ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands.
¹¹ EPI-centre, KU Leuven, Leuven, Belgium.
¹² NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.

PMID: 35395724
PMCID: PMC8991704
DOI: 10.1186/s12874-022-01577-x

Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review

Paula Dhiman et al. BMC Med Res Methodol. 2022.

. 2022 Apr 8;22(1):101.

doi: 10.1186/s12874-022-01577-x.

Authors

Affiliations

¹ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK. paula.dhiman@csm.ox.ac.uk.
² NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK. paula.dhiman@csm.ox.ac.uk.
³ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
⁴ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
⁵ Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
⁶ Basel Institute for Clinical Epidemiology and Biostatistics, Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland.
⁷ Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford, UK.
⁸ Centre for Prognosis Research, School of Medicine, Keele University, Staffordshire, ST5 5BG, UK.
⁹ Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
¹⁰ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands.
¹¹ EPI-centre, KU Leuven, Leuven, Belgium.
¹² NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.

PMID: 35395724
PMCID: PMC8991704
DOI: 10.1186/s12874-022-01577-x

Abstract

Background: Describe and evaluate the methodological conduct of prognostic prediction models developed using machine learning methods in oncology.

Methods: We conducted a systematic review in MEDLINE and Embase between 01/01/2019 and 05/09/2019, for studies developing a prognostic prediction model using machine learning methods in oncology. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement, Prediction model Risk Of Bias ASsessment Tool (PROBAST) and CHecklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies (CHARMS) to assess the methodological conduct of included publications. Results were summarised by modelling type: regression-, non-regression-based and ensemble machine learning models.

Results: Sixty-two publications met inclusion criteria developing 152 models across all publications. Forty-two models were regression-based, 71 were non-regression-based and 39 were ensemble models. A median of 647 individuals (IQR: 203 to 4059) and 195 events (IQR: 38 to 1269) were used for model development, and 553 individuals (IQR: 69 to 3069) and 50 events (IQR: 17.5 to 326.5) for model validation. A higher number of events per predictor was used for developing regression-based models (median: 8, IQR: 7.1 to 23.5), compared to alternative machine learning (median: 3.4, IQR: 1.1 to 19.1) and ensemble models (median: 1.7, IQR: 1.1 to 6). Sample size was rarely justified (n = 5/62; 8%). Some or all continuous predictors were categorised before modelling in 24 studies (39%). 46% (n = 24/62) of models reporting predictor selection before modelling used univariable analyses, and common method across all modelling types. Ten out of 24 models for time-to-event outcomes accounted for censoring (42%). A split sample approach was the most popular method for internal validation (n = 25/62, 40%). Calibration was reported in 11 studies. Less than half of models were reported or made available.

Conclusions: The methodological conduct of machine learning based clinical prediction models is poor. Guidance is urgently needed, with increased awareness and education of minimum prediction modelling standards. Particular focus is needed on sample size estimation, development and validation analysis methods, and ensuring the model is available for independent validation, to improve quality of machine learning based clinical prediction models.

Keywords: Machine learning; Methodology; Prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Fig. 1**
PRISMA flow diagram of studies included in the systematic review

See this image and copyright information in PMC

References

1. Hippisley-Cox J, Coupland C, Brindle P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. BMJ. 2017;357:j2099. - PMC - PubMed
1. Pulitanò C, Arru M, Bellio L, Rossini S, Ferla G, Aldrighetti L. A risk score for predicting perioperative blood transfusion in liver surgery. Br J Surg. 2007;94(7):860–865. - PubMed
1. Conroy RM, Pyörälä K, Fitzgerald AP, Sans S, Menotti A, De Backer G, et al. Estimation of ten-year risk of fatal cardiovascular disease in Europe: the SCORE project. Eur Heart J. 2003;24(11):987–1003. - PubMed
1. Nashef SAM, Roques F, Sharples LD, Nilsson J, Smith C, Goldstone AR, et al. EuroSCORE II. Eur J Cardiothorac Surg. 2012;41(4):734–745. - PubMed
1. Thamer M, Kaufman JS, Zhang Y, Zhang Q, Cotter DJ, Bang H. Predicting early death among elderly dialysis patients: development and validation of a risk score to assist shared decision making for dialysis initiation. Am J Kidney Dis. 2015;66(6):1024–1032. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review

Affiliations

Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous