Performance of reclassification statistics in comparing risk prediction models

Nancy R Cook¹, Nina P Paynter

Affiliations

PMID: 21294152
PMCID: PMC3395053
DOI: 10.1002/bimj.201000078

Performance of reclassification statistics in comparing risk prediction models

Nancy R Cook et al. Biom J. 2011 Mar.

. 2011 Mar;53(2):237-58.

doi: 10.1002/bimj.201000078. Epub 2011 Feb 3.

Authors

Nancy R Cook¹, Nina P Paynter

Affiliation

¹ Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA, USA. ncook@rics.bwh.harvard.edu

PMID: 21294152
PMCID: PMC3395053
DOI: 10.1002/bimj.201000078

Abstract

Concerns have been raised about the use of traditional measures of model fit in evaluating risk prediction models for clinical use, and reclassification tables have been suggested as an alternative means of assessing the clinical utility of a model. Several measures based on the table have been proposed, including the reclassification calibration (RC) statistic, the net reclassification improvement (NRI), and the integrated discrimination improvement (IDI), but the performance of these in practical settings has not been fully examined. We used simulations to estimate the type I error and power for these statistics in a number of scenarios, as well as the impact of the number and type of categories, when adding a new marker to an established or reference model. The type I error was found to be reasonable in most settings, and power was highest for the IDI, which was similar to the test of association. The relative power of the RC statistic, a test of calibration, and the NRI, a test of discrimination, varied depending on the model assumptions. These tools provide unique but complementary information.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest

The authors have declared no conflict of interest.

Figures

**Figure 1**
Predictiveness curves for models with and without variable Y (left), high sensitivity C-reactive protein (CRP) (middle), and systolic blood pressure (SBP) (right). Horizontal lines indicate the prevalence of the outcome.

**Figure 2**
Null distribution of reclassification calibration statistics: observed proportions with p-values < 0.10 (top) or <0.05 (bottom) for test of correct model X or XY when ORY=1 and for model XY when ORY=3 with cell size >20 and for average expectation ≥ 5, for P(D)=0.10.

**Figure 3**
Power for measures of model fit, with P(D) = 0.10.

**Figure 4**
Power for IDI (top), NRI (middle) and reclassification calibration test (bottom) by probability of disease P(D) and ORY, using ORX = 8, N=5000, and category cut points of 0.5*P(D). P(D), and 2*P(D).

**Figure 5**
Power for measures of model fit, with ORX = 8, P(D) = 0.10, and varying prevalence (PY) of a binary predictor Y.

See this image and copyright information in PMC

References

1. Adult Treatment Panel III. Executive Summary of The Third Report of The National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, And Treatment of High Blood Cholesterol In Adults (Adult Treatment Panel III) JAMA. 2001;285:2486–2497. - PubMed
1. Ash A, Shwartz M. R2: a useful measure of model performance when predicting a dichotomous outcome. Statistics in Medicine. 1999;18:375–384. - PubMed
1. Baker SG, Cook NR, Vickers A, Kramer BS. Using relative utility curves to evaluate risk prediction. Journal of the Royal Statistics Society, Series A. 2009;172:729–748. - PMC - PubMed
1. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115:928–935. - PubMed
1. Cook NR. Comments on ‘Evaluating the added predictive ability of a new biomarker: from area under the ROC curve to reclassification and beyond’. Statistics in Medicine. 2008;27:191–195. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Performance of reclassification statistics in comparing risk prediction models

Affiliation

Performance of reclassification statistics in comparing risk prediction models

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources