Traditional statistical methods for evaluating prediction models are uninformative as to clinical value: towards a decision analytic framework
- PMID: 20172362
- PMCID: PMC2857322
- DOI: 10.1053/j.seminoncol.2009.12.004
Traditional statistical methods for evaluating prediction models are uninformative as to clinical value: towards a decision analytic framework
Abstract
Cancer prediction models are becoming ubiquitous, yet we generally have no idea whether they do more good than harm. This is because current statistical methods for evaluating prediction models are uninformative as to their clinical value. Prediction models are typically evaluated in terms of discrimination or calibration. However, it is generally unclear how high discrimination needs to be before it is considered "high enough"; similarly, there are no rational guidelines as to the degree of miscalibration that would discount clinical use of a model. Classification tables do present the results of models in more clinically relevant terms, but it is not always clear which of two models is preferable on the basis of a particular classification table, or even whether either model should be used at all. Recent years have seen the development of straightforward decision analytic techniques that evaluate prediction models in terms of their consequences. This depends on the simple approach of weighting true and false positives differently, to reflect that, for example, delaying the diagnosis of a cancer is more harmful than an unnecessary biopsy. Such decision analytic techniques hold the promise of determining whether clinical implementation of prediction models would do more good than harm.
Copyright 2010 Elsevier Inc. All rights reserved.
Conflict of interest statement
Dr. Vickers and Ms Cronin have no primary financial relationships with any companies directly interested in the subject matter of this manuscript.
Figures



Similar articles
-
Calibration of risk prediction models: impact on decision-analytic performance.Med Decis Making. 2015 Feb;35(2):162-9. doi: 10.1177/0272989X14547233. Epub 2014 Aug 25. Med Decis Making. 2015. PMID: 25155798
-
Decision curve analysis to evaluate the clinical benefit of prediction models.Spine J. 2021 Oct;21(10):1643-1648. doi: 10.1016/j.spinee.2021.02.024. Epub 2021 Mar 3. Spine J. 2021. PMID: 33676020 Free PMC article.
-
Evaluating a new marker for risk prediction: decision analysis to the rescue.Discov Med. 2012 Sep;14(76):181-8. Discov Med. 2012. PMID: 23021372 Review.
-
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247. Zhonghua Jie He He Hu Xi Za Zhi. 2024. PMID: 38309959 Chinese.
-
Statistical evaluation of prognostic versus diagnostic models: beyond the ROC curve.Clin Chem. 2008 Jan;54(1):17-23. doi: 10.1373/clinchem.2007.096529. Epub 2007 Nov 16. Clin Chem. 2008. PMID: 18024533 Review.
Cited by
-
Assessing the Clinical Impact of Risk Prediction Models With Decision Curves: Guidance for Correct Interpretation and Appropriate Use.J Clin Oncol. 2016 Jul 20;34(21):2534-40. doi: 10.1200/JCO.2015.65.5654. Epub 2016 May 31. J Clin Oncol. 2016. PMID: 27247223 Free PMC article.
-
Psychological burden prediction based on demographic variables among infertile men with sexual dysfunction.Asian J Androl. 2019 Mar-Apr;21(2):156-162. doi: 10.4103/aja.aja_86_18. Asian J Androl. 2019. PMID: 30460932 Free PMC article.
-
The Net Reclassification Index (NRI): a Misleading Measure of Prediction Improvement Even with Independent Test Data Sets.Stat Biosci. 2015 Oct 1;7(2):282-295. doi: 10.1007/s12561-014-9118-0. Epub 2014 Aug 23. Stat Biosci. 2015. PMID: 26504496 Free PMC article.
-
Development and External Validation of Prediction Models for 10-Year Survival of Invasive Breast Cancer. Comparison with PREDICT and CancerMath.Clin Cancer Res. 2018 May 1;24(9):2110-2115. doi: 10.1158/1078-0432.CCR-17-3542. Epub 2018 Feb 14. Clin Cancer Res. 2018. PMID: 29444929 Free PMC article.
-
Decision Curve Analysis for Personalized Treatment Choice between Multiple Options.Med Decis Making. 2023 Apr;43(3):337-349. doi: 10.1177/0272989X221143058. Epub 2022 Dec 13. Med Decis Making. 2023. PMID: 36511470 Free PMC article.
References
-
- Murphy NC, Biankin AV, Millar EK, McNeil CM, O’Toole SA, Segara D, et al. Loss of STARD10 expression identifies a group of poor prognosis breast cancers independent of HER2/Neu and triple negative status. Int J Cancer. 2009 - PubMed
-
- Korse CM, Taal BG, de Groot CA, Bakker RH, Bonfrer JM. Chromogranin-A and N-Terminal Pro-Brain Natriuretic Peptide: An Excellent Pair of Biomarkers for Diagnostics in Patients With Neuroendocrine Tumor. J Clin Oncol. 2009 - PubMed
-
- Garcia-Albeniz X, Gallego R. Prognostic role of plasma insulin-like growth factor (IGF) and IGF-binding protein 3 in metastatic colorectal cancer. Clin Cancer Res. 2009;15(16):5288. author reply. - PubMed
-
- Stinchcombe TE, Hodgson L, Herndon JE, 2nd, Kelley MJ, Cicchetti MG, Ramnath N, et al. Treatment outcomes of different prognostic groups of patients on cancer and leukemia group B trial 39801: induction chemotherapy followed by chemoradiotherapy compared with chemoradiotherapy alone for unresectable stage III non-small cell lung cancer. J Thorac Oncol. 2009;4(9):1117–25. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources