Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Mar;306(3):e222343.
doi: 10.1148/radiol.222343. Epub 2022 Nov 15.

Net Reclassification Index Statistics Do Not Help Assess New Risk Models

Affiliations
Review

Net Reclassification Index Statistics Do Not Help Assess New Risk Models

Kathleen F Kerr. Radiology. 2023 Mar.

Abstract

When evaluating a new risk factor for disease (eg, a measurement from imaging studies), many investigators examine its value above and beyond existing biomarkers and risk factors. They compare the performance of an "old" risk model using established predictors and a "new" risk model that adds the new factor. Net reclassification index (NRI) statistics are a family of metrics for comparing two risk models. NRI statistics became popular in some medical fields and have appeared in high-impact journals. This article reviews NRI statistics and describes several issues with them. Problems include unacceptable statistical behavior, incorrect statistical inferences, and lack of interpretability. NRI statistics are unhelpful (at best) and misleading (at worst).

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: K.F.K. No relevant relationships.

Figures

None
Graphical abstract
Mathematical definition of the net reclassification index (NRI). The NRI
seeks to compare a new risk model to an existing model. “Event”
and “nonevent” refer to patients who would and would not go on to
experience an event without intervention. “Up” refers to patients
who move to a higher risk category with the new risk model. “Down”
refers to patients who move to a lower risk category. The event NRI and nonevent
NRI are defined as shown in the equation. The (overall) NRI is the sum of the
event NRI and nonevent NRI. Note that the NRI combines four proportions but is
not itself a proportion. The quantitative value of an NRI statistic can range
from −2 to 2. P = probability.
Figure 1:
Mathematical definition of the net reclassification index (NRI). The NRI seeks to compare a new risk model to an existing model. “Event” and “nonevent” refer to patients who would and would not go on to experience an event without intervention. “Up” refers to patients who move to a higher risk category with the new risk model. “Down” refers to patients who move to a lower risk category. The event NRI and nonevent NRI are defined as shown in the equation. The (overall) NRI is the sum of the event NRI and nonevent NRI. Note that the NRI combines four proportions but is not itself a proportion. The quantitative value of an NRI statistic can range from −2 to 2. P = probability.
Three-category net reclassification index (NRI) for a sample of 1000
patients from a population with 90% events (patients who would go on to
experience an event without intervention) and 10% nonevents (patients who would
not go on to experience an event without intervention). Overall NRI is higher in
example 2 compared with example 1. Yet, in example 2 more patients have worse
risk classification (orange cells) compared with better risk classification
(green cells) with a switch from the old to the new risk model. In other words,
more patients would be harmed than helped by switching from the old risk model
to the new risk model, even though the NRI statistic is positive. This behavior
arises because NRI ignores the relative proportions of events and nonevents in
the population. In example 3, the percentage of event patients reclassified and
the percentage of nonevent patients reclassified are identical to those in
example 1; the NRI statistics are also identical to those in example 1. However,
compared with example 1, the new risk model in example 3 reclassifies more event
patients from high risk to low risk instead of from high risk to medium risk and
reclassifies more nonevent patients from low risk to high risk instead of from
low risk to medium risk. The NRI statistics are identical for example 1 and
example 3 because the NRI only accounts for the direction of changes and not
their magnitude; thus, the NRI ignores crucial information about the clinical
impact of reclassification.
Figure 2:
Three-category net reclassification index (NRI) for a sample of 1000 patients from a population with 90% events (patients who would go on to experience an event without intervention) and 10% nonevents (patients who would not go on to experience an event without intervention). Overall NRI is higher in example 2 compared with example 1. Yet, in example 2 more patients have worse risk classification (orange cells) compared with better risk classification (green cells) with a switch from the old to the new risk model. In other words, more patients would be harmed than helped by switching from the old risk model to the new risk model, even though the NRI statistic is positive. This behavior arises because NRI ignores the relative proportions of events and nonevents in the population. In example 3, the percentage of event patients reclassified and the percentage of nonevent patients reclassified are identical to those in example 1; the NRI statistics are also identical to those in example 1. However, compared with example 1, the new risk model in example 3 reclassifies more event patients from high risk to low risk instead of from high risk to medium risk and reclassifies more nonevent patients from low risk to high risk instead of from low risk to medium risk. The NRI statistics are identical for example 1 and example 3 because the NRI only accounts for the direction of changes and not their magnitude; thus, the NRI ignores crucial information about the clinical impact of reclassification.

References

    1. American College of Radiology . Breast Imaging Reporting and Data System (BI-RADS) . Reston, Va: : American College of Radiology; , 2013. .
    1. McKearnan SB , Wolfson J , Vock DM , Vazquez-Benitez G , O’Connor PJ . Performance of the Net Reclassification Improvement for Nonnested Models and a Novel Percentile-Based Alternative . Am J Epidemiol 2018. ; 187 ( 6 ): 1327 – 1335 . - PMC - PubMed
    1. Cook NR . Use and misuse of the receiver operating characteristic curve in risk prediction . Circulation 2007. ; 115 ( 7 ): 928 – 935 . - PubMed
    1. Pencina MJ , D’Agostino RB Sr , D’Agostino RB Jr , Vasan RS . Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond . Stat Med 2008. ; 27 ( 2 ): 157 – 172 ; discussion 207–212 . - PubMed
    1. Pencina MJ , Fine JP , D’Agostino RB Sr . Discrimination slope and integrated discrimination improvement - properties, relationships and impact of calibration . Stat Med 2017. ; 36 ( 28 ): 4482 – 4490 . - PubMed

Publication types

LinkOut - more resources