. 2018 Jun:98:133-143.

doi: 10.1016/j.jclinepi.2017.11.013. Epub 2017 Nov 24.

Poor performance of clinical prediction models: the harm of commonly applied methods

Ewout W Steyerberg¹, Hajime Uno², John P A Ioannidis³, Ben van Calster⁴; Collaborators

Collaborators, Affiliations

Collaborators

Collaborators:
Chinedu Ukaegbu², Tara Dhingra², Sapna Syngal², Fay Kastrinos⁵

Affiliations

¹ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands; Department of Public Health, Erasmus MC, Rotterdam, The Netherlands. Electronic address: e.w.steyerberg@lumc.nl.
² Division of Population Sciences, Dana-Farber Cancer Institute, 02215 MA, Boston, USA.
³ Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA; Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA, USA; Department of Statistics, Stanford University School of Humanities and Sciences, Stanford, CA, USA; Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, USA.
⁴ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands; Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
⁵ Herbert Irving Comprehensive Cancer Center and Division of Digestive and Liver Diseases, Columbia University Medical Center, New York, NY, USA.

PMID: 29174118
DOI: 10.1016/j.jclinepi.2017.11.013

Poor performance of clinical prediction models: the harm of commonly applied methods

Ewout W Steyerberg et al. J Clin Epidemiol. 2018 Jun.

. 2018 Jun:98:133-143.

doi: 10.1016/j.jclinepi.2017.11.013. Epub 2017 Nov 24.

Authors

Ewout W Steyerberg¹, Hajime Uno², John P A Ioannidis³, Ben van Calster⁴; Collaborators

Collaborators

Collaborators:
Chinedu Ukaegbu², Tara Dhingra², Sapna Syngal², Fay Kastrinos⁵

Affiliations

¹ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands; Department of Public Health, Erasmus MC, Rotterdam, The Netherlands. Electronic address: e.w.steyerberg@lumc.nl.
² Division of Population Sciences, Dana-Farber Cancer Institute, 02215 MA, Boston, USA.
³ Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA; Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA, USA; Department of Statistics, Stanford University School of Humanities and Sciences, Stanford, CA, USA; Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, USA.
⁴ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands; Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
⁵ Herbert Irving Comprehensive Cancer Center and Division of Digestive and Liver Diseases, Columbia University Medical Center, New York, NY, USA.

PMID: 29174118
DOI: 10.1016/j.jclinepi.2017.11.013

Abstract

Objective: To evaluate limitations of common statistical modeling approaches in deriving clinical prediction models and explore alternative strategies.

Study design and setting: A previously published model predicted the likelihood of having a mutation in germline DNA mismatch repair genes at the time of diagnosis of colorectal cancer. This model was based on a cohort where 38 mutations were found among 870 participants, with validation in an independent cohort with 35 mutations. The modeling strategy included stepwise selection of predictors from a pool of over 37 candidate predictors and dichotomization of continuous predictors. We simulated this strategy in small subsets of a large contemporary cohort (2,051 mutations among 19,866 participants) and made comparisons to other modeling approaches. All models were evaluated according to bias and discriminative ability (concordance index, c) in independent data.

Results: We found over 50% bias for five of six originally selected predictors, unstable model specification, and poor performance at validation (median c = 0.74). A small validation sample hampered stable assessment of performance. Model prespecification based on external knowledge and using continuous predictors led to better performance (c = 0.836 and c = 0.852 with 38 and 2,051 events respectively).

Conclusion: Prediction models perform poorly if based on small numbers of events and developed with common but suboptimal statistical approaches. Alternative modeling strategies to best exploit available predictive information need wider implementation, with collaborative research to increase sample sizes.

Keywords: Events per variable; Prediction model; Regression analysis; Sample size; Simulation; Validation.

PubMed Disclaimer

Cited by

Canadian Anaphylaxis Network-Predicting Recurrence after Emergency Presentation for Allergic REaction (CAN-PREPARE): a prospective, cohort study protocol.
Alqurashi W, Shaker M, Wells GA, Collins GS, Greenhawt M, Curran JA, Zemek R, Schuh S, Ellis A, Gerdts J, Kreviazuk C, Dixon A, Eltorki M, Freedman SB, Gravel J, Poonai N, Worm M, Plint AC. Alqurashi W, et al. BMJ Open. 2022 Oct 31;12(10):e061976. doi: 10.1136/bmjopen-2022-061976. BMJ Open. 2022. PMID: 36316072 Free PMC article.
Development and validation of a model to predict ceiling of care in COVID-19 hospitalized patients.
Pallarès N, Inouzhe H, Straw S, Safdar N, Fernández D, Cortés J, Rodríguez L, Videla S, Barrio I, Witte KK, Carratalà J, Tebé C; MetroSud; DIVINE study group. Pallarès N, et al. BMC Palliat Care. 2024 Jul 16;23(1):173. doi: 10.1186/s12904-024-01490-8. BMC Palliat Care. 2024. PMID: 39010044 Free PMC article.
Evaluating Modeling and Validation Strategies for Tooth Loss.
Krois J, Graetz C, Holtfreter B, Brinkmann P, Kocher T, Schwendicke F. Krois J, et al. J Dent Res. 2019 Sep;98(10):1088-1095. doi: 10.1177/0022034519864889. Epub 2019 Jul 30. J Dent Res. 2019. PMID: 31361174 Free PMC article.
Assess the Performance and Cost-Effectiveness of LACE and HOSPITAL Re-Admission Prediction Models as a Risk Management Tool for Home Care Patients: An Evaluation Study of a Medical Center Affiliated Home Care Unit in Taiwan.
Su MC, Wang YJ, Chen TJ, Chiu SH, Chang HT, Huang MS, Hu LH, Li CC, Yang SJ, Wu JC, Chen YC. Su MC, et al. Int J Environ Res Public Health. 2020 Feb 2;17(3):927. doi: 10.3390/ijerph17030927. Int J Environ Res Public Health. 2020. PMID: 32024309 Free PMC article.
Impact of predictor measurement heterogeneity across settings on the performance of prediction models: A measurement error perspective.
Luijken K, Groenwold RHH, Van Calster B, Steyerberg EW, van Smeden M. Luijken K, et al. Stat Med. 2019 Aug 15;38(18):3444-3459. doi: 10.1002/sim.8183. Epub 2019 May 31. Stat Med. 2019. PMID: 31148207 Free PMC article.

See all "Cited by" articles

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Poor performance of clinical prediction models: the harm of commonly applied methods

Collaborators

Affiliations

Poor performance of clinical prediction models: the harm of commonly applied methods

Authors

Collaborators

Affiliations

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical