Validation of biomarker-based risk prediction models
- PMID: 18829476
- PMCID: PMC3896456
- DOI: 10.1158/1078-0432.CCR-07-4534
Validation of biomarker-based risk prediction models
Abstract
The increasing availability and use of predictive models to facilitate informed decision making highlights the need for careful assessment of the validity of these models. In particular, models involving biomarkers require careful validation for two reasons: issues with overfitting when complex models involve a large number of biomarkers, and interlaboratory variation in assays used to measure biomarkers. In this article, we distinguish between internal and external statistical validation. Internal validation, involving training-testing splits of the available data or cross-validation, is a necessary component of the model building process and can provide valid assessments of model performance. External validation consists of assessing model performance on one or more data sets collected by different investigators from different institutions. External validation is a more rigorous procedure necessary for evaluating whether the predictive model will generalize to populations other than the one on which it was developed. We stress the need for an external data set to be truly external, that is, to play no role in model development and ideally be completely unavailable to the researchers building the model. In addition to reviewing different types of validation, we describe different types and features of predictive models and strategies for model building, as well as measures appropriate for assessing their performance in the context of validation. No single measure can characterize the different components of the prediction, and the use of multiple summary measures is recommended.
Figures

Similar articles
-
Validation of analytic methods for biomarkers used in drug development.Clin Cancer Res. 2008 Oct 1;14(19):5967-76. doi: 10.1158/1078-0432.CCR-07-4535. Clin Cancer Res. 2008. PMID: 18829475 Free PMC article. Review.
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
The future of Cochrane Neonatal.Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834
-
Prognostic models for newly-diagnosed chronic lymphocytic leukaemia in adults: a systematic review and meta-analysis.Cochrane Database Syst Rev. 2020 Jul 31;7(7):CD012022. doi: 10.1002/14651858.CD012022.pub2. Cochrane Database Syst Rev. 2020. PMID: 32735048 Free PMC article.
-
Prediction of response to pemetrexed in non-small-cell lung cancer with immunohistochemical phenotyping based on gene expression profiles.BMC Cancer. 2019 May 14;19(1):440. doi: 10.1186/s12885-019-5645-x. BMC Cancer. 2019. PMID: 31088547 Free PMC article.
Cited by
-
Multivariate morphological brain signatures predict patients with chronic abdominal pain from healthy control subjects.Pain. 2015 Aug;156(8):1545-1554. doi: 10.1097/j.pain.0000000000000196. Pain. 2015. PMID: 25906347 Free PMC article.
-
ENDORSE: a prognostic model for endocrine therapy in estrogen-receptor-positive breast cancers.Mol Syst Biol. 2022 Jun;18(6):e10558. doi: 10.15252/msb.202110558. Mol Syst Biol. 2022. PMID: 35671075 Free PMC article.
-
Do two machine-learning based prognostic signatures for breast cancer capture the same biological processes?PLoS One. 2011 Mar 14;6(3):e17795. doi: 10.1371/journal.pone.0017795. PLoS One. 2011. PMID: 21423753 Free PMC article.
-
Immune monitoring technology primer.J Immunother Cancer. 2015 Oct 20;3:40. doi: 10.1186/s40425-015-0086-9. eCollection 2015. J Immunother Cancer. 2015. PMID: 26500774 Free PMC article.
-
Repeatability of radiomics studies in colorectal cancer: a systematic review.BMC Gastroenterol. 2023 Apr 14;23(1):125. doi: 10.1186/s12876-023-02743-1. BMC Gastroenterol. 2023. PMID: 37059990 Free PMC article.
References
-
- Thompson IM, Ankerst DP, Chi C, et al. Assessing prostate cancer risk: Results from the Prostate Cancer Prevention Trial. J Natl Cancer Inst. 2006;98:529–34. - PubMed
-
- Gail MH, Brinton LA, Byar DP, et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst. 1989;81:1879–86. - PubMed
-
- Kattan MW, Eastham JA, Stapleton AMF, Wheeler TM, Scardino PT. A preoperative nomogram for disease recurrence following radical prostatectomy for prostate cancer. J Natl Cancer Inst. 1998;90:766–71. - PubMed
-
- Paik S, Shak S, Tang G, et al. A multi-gene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med. 2004;351:2817–26. - PubMed
-
- Skates SJ, Pauler DK, Jacobs IJ. Screening based on the risk of cancer calculation from Bayesian hierarchical change point and mixture models of longitudinal markers. J Am Stat Assoc. 2001;96:429–39.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical