Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 May 10;62(5):605-611.
doi: 10.2967/jnumed.120.251520. Epub 2021 Feb 12.

Statistical Considerations in the Evaluation of Continuous Biomarkers

Affiliations
Review

Statistical Considerations in the Evaluation of Continuous Biomarkers

Mei-Yin C Polley et al. J Nucl Med. .

Abstract

Discovery of biomarkers has been steadily increasing over the past decade. Although a plethora of biomarkers has been reported in the biomedical literature, few have been sufficiently validated for broader clinical applications. One particular challenge that may have hindered the adoption of biomarkers into practice is the lack of reproducible biomarker cut points. In this article, we attempt to identify some common statistical issues related to biomarker cut point identification and provide guidance on proper evaluation, interpretation, and validation of such cut points. First, we illustrate how discretization of a continuous biomarker using sample percentiles results in significant information loss and should be avoided. Second, we review the popular "minimal-P-value" approach for cut point identification and show that this method results in highly unstable P values and unduly increases the chance of significant findings when the biomarker is not associated with outcome. Third, we critically review a common analysis strategy by which the selected biomarker cut point is used to categorize patients into different risk categories and then the difference in survival curves among these risk groups in the same dataset is claimed as the evidence supporting the biomarker's prognostic strength. We show that this method yields an exaggerated P value and overestimates the prognostic impact of the biomarker. We illustrate that the degree of the optimistic bias increases with the number of variables being considered in a risk model. Finally, we discuss methods to appropriately ascertain the additional prognostic contribution of the new biomarker in disease settings where standard prognostic factors already exist. Throughout the article, we use real examples in oncology to highlight relevant methodologic issues, and when appropriate, we use simulations to illustrate more abstract statistical concepts.

Keywords: area under the ROC curve; biomarker cut point; biomarker discretization; prognostic biomarker; resubstitution statistics; statistics.

PubMed Disclaimer

Figures

FIGURE 1.
FIGURE 1.
Hypothetical relationship between biomarker M and clinical outcome. (A) Green line depicts linear relationship between biomarker and outcome. Risk of outcome increases linearly with increasing biomarker values. Dashed lines illustrate effect of dichotomizing biomarker, assuming that discontinuity in risk occurs at cut point c (patients whose biomarker values are below cut point are conferred the same risk, which is lower by magnitude Δ than that conferred to patients whose biomarker values exceed cut point). c* represents the biomarker value of a patient whose true risk is highest in the high-risk subgroup. (B) Quadratic relationship between biomarker and outcome. Risk of outcome decreases with biomarker up to point m and increases linearly after m.
FIGURE 2.
FIGURE 2.
Dichotomy of continuous biomarker (NLR example). (A) Nonlinear relationship between NLR and patient survival in Mayo Clinic triple-negative breast cancer dataset using restricted spline method. (B) Effect of dichotomizing NLR at its sample median. Association between NLR and survival is no longer significant (log-rank P = 0.27). AIC = Akaike’s information criterion; L.R. = likelihood ratio.
FIGURE 3.
FIGURE 3.
Minimal-P-value approach (NLR example). (A) Highly unstable P values of log-rank test as function of cut point used for NLR in Mayo Clinic triple-negative breast cancer dataset. Top and bottom 20% of NLR values were excluded, and 200 cut points were used. (B) Strong inverse correlation between estimated HRs and log-rank P values for NLR in Mayo Clinic triple-negative breast cancer dataset. Smallest P value corresponds to most extreme HR estimate.
FIGURE 4.
FIGURE 4.
Type I error inflation as function of number of cut points and sample size using minimal-P-value approach. In each simulation, 10% of smallest and largest biomarker values were not considered as potential cut points. Two-sided P value from log-rank test was computed for each cut point applied. Each plotted point represents percentage of 5,000 simulations for which minimal P value is less than nominal 5% level based on assumption that there is no association between biomarker and time-to-event outcome (i.e., type I error). No censoring in outcome was assumed.
FIGURE 5.
FIGURE 5.
Effect of number of covariates (k) in risk model on resubstitution bias in AUC. Population box plot represents true AUC distribution in interested population at large. Sample box plot represents distribution of AUC derived from sample dataset used to construct risk model. Each box plot was based on 1,000 simulations. When k = 5, there is slight upward (optimistic) bias in sample AUC distribution compared with true population. Degree of optimistic bias increases drastically when k increases to 50.
FIGURE 6.
FIGURE 6.
Schema for biomarker cut point analysis and evaluation.

References

    1. Chae S, Kang KM, Kim HJ, et al. . Neutrophil-lymphocyte ratio predicts response to chemotherapy in triple-negative breast cancer. Curr Oncol. 2018;25:e113–e119. - PMC - PubMed
    1. Patel DA, Xi J, Luo J, et al. . Neutrophil-to-lymphocyte ratio as a predictor of survival in patients with triple-negative breast cancer. Breast Cancer Res Treat. 2019;174:443–452. - PubMed
    1. Pistelli M, De Lisa M, Ballatore Z, et al. . Pre-treatment neutrophil to lymphocyte ratio may be a useful tool in predicting survival in early triple negative breast cancer patients. BMC Cancer. 2015;15:195. - PMC - PubMed
    1. Leon-Ferre RA, Polley MY, Liu H, et al. . Impact of histopathology, tumor-infiltrating lymphocytes, and adjuvant chemotherapy on prognosis of triple-negative breast cancer. Breast Cancer Res Treat. 2018;167:89–99. - PMC - PubMed
    1. Harrell FE. Regression Modeling Strategies with Applications to Linear Models, Logistic and Ordinary Regression, and Survival Analysis. 2nd ed. Springer; 2015:22–28.

Publication types