Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 10;40(20):4522-4539.
doi: 10.1002/sim.9077. Epub 2021 Jun 3.

Estimation and construction of confidence intervals for biomarker cutoff-points under the shortest Euclidean distance from the ROC surface to the perfection corner

Affiliations

Estimation and construction of confidence intervals for biomarker cutoff-points under the shortest Euclidean distance from the ROC surface to the perfection corner

Brian R Mosier et al. Stat Med. .

Abstract

Pancreatic ductal adenocarcinoma (PDAC) is an aggressive type of cancer with a 5-year survival rate of less than 5%. As in many other diseases, its diagnosis might involve progressive stages. It is common that in biomarker studies referring to PDAC, recruitment involves three groups: healthy individuals, patients that suffer from chronic pancreatitis, and PDAC patients. Early detection and accurate classification of the state of the disease are crucial for patients' successful treatment. ROC analysis is the most popular way to evaluate the performance of a biomarker and the Youden index is commonly employed for cutoff derivation. The so-called generalized Youden index has a drawback in the three-class case of not accommodating the full data set when estimating the optimal cutoffs. In this article, we explore the use of the Euclidean distance of the ROC to the perfection corner for the derivation of cutoffs in trichotomous settings. We construct an inferential framework that involves both parametric and nonparametric techniques. Our methods can accommodate the full information of a given data set and thus provide more accurate estimates in terms of the decision-making cutoffs compared with a Youden-based strategy. We evaluate our approaches through extensive simulations and illustrate them on a PDAC biomarker study.

Keywords: 3-class; Box-Cox; Euclidean distance; ROC; Youden index; cutoffs; kernels; perfection corner.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Plots comparing the Euclidean distance and the Youden index of a biomarker score viewed from different perspectives for convenience. Euclidean distance is represented by the red line segment and Youden index is represented by the blue line segment. The useless biomarker plane is represented by the diagonal gridded plane.
FIGURE 2
FIGURE 2
Plot of coverage for c1 and c2 derived using the Euclidean method for all scenarios. The four plots, in order from left to right contain coverages for the following methods: (1) the delta method approach based on the assumption of normally distributed biomarker scores, (2) the Box-Cox approach, which assumes biomarker scores can be transformed to normality, (3) the naive kernel-based approach without the coverage correction of added noise for the bootstrap, and (4) the proposed kernel-based method with the coverage correction (denoted kernels w/𝜖). Each subplot contains four shaded regions, representing sample sizes of (n1,n2,n3) = (50,50,50), (n1,n2,n3) = (100,100,100), (n1,n2,n3) = (200,200,200), and (n1,n2,n3) = (150,100,50). Each shaded region contains two vertical bands of points, the left of which corresponds to the coverages of the Euclidean method, and the right of which corresponds to the Youden index, as illustrated in the leftmost subplot.
FIGURE 3
FIGURE 3
Interval widths for biomarker scores for all simulated scenarios. Scenarios include data generated from normal, lognormal, gamma, and normal mixture distributions. Sample sizes of (n1,n2,n3) = (50,50,50), (n1,n2,n3) = (100,100,100), (n1,n2,n3) = (200,200,200), and (n1,n2,n3) = (150,100,50) are included, as well as scenarios corresponding to VUS = 0.4, and VUS = 0.6. We include interval widths for the delta method approach (Del), the Box-Cox approach (BC), and the proposed kernel-based approach with added noise that was described in Section 4 (K w/ є).
FIGURE 4
FIGURE 4
The plots are histograms of the percent difference in total classification (sum of TCRs) for the Euclidean method and Youden index. Optimal cutoff values were estimated using the Euclidean and Youden methods based on training data. The estimated cutoff values were then used to derive true classification rates using the true distributions of the biomarker scores. The area to the right of 0 corresponds to a higher sum of true classification rates for the Euclidean method than for the Youden index. Plots (a)-(d) correspond to biomarkers simulated from normal distributions with equally spaced means of (μ1,μ2,μ3) = (5,5.6592,6.3184), and standard deviations of (σ1,σ2,σ3) = (1,1,1). Plots (e)-(h) correspond to normal distributions with means of (μ1,μ2,μ3) = (10,10.8,12), and standard deviations of (σ1,σ2,σ3) = (1,1,1.7)
FIGURE 5
FIGURE 5
Plots of biomarker scores and ROC surfaces for each of the biomarkers.

Similar articles

Cited by

References

    1. Foucher ED, Ghigo C, Chouaib S, Galon J, Iovanna J, Olive D. Pancreatic ductal adenocarcinoma: a strong imbalance of good and bad immunological cops in the tumor microenvironment. Front Immunol. 2018; 9:1044. - PMC - PubMed
    1. Wingren C, Sandstrom A, Segersvard R, Carlsson A, Andersson R, Lohr M, Borrebaeck CA. identification of serum biomarker signatures associated with pancreatic cancer. Cancer Res. 2012; 72(10): 2481–2490. - PubMed
    1. Gold DV, Goggins M, Modrak DE, et al. Detection of early-stage pancreatic adenocarcinoma. Cancer Epidemiol, Biomarkers and Prevention. 2010; 19(11): 2786–2794. - PMC - PubMed
    1. Chan A, Dimitromanolakis A, Brand RE, Serra S, Diamandis EP, Blasutig IM. Validation of biomarkers that complement CA19.9 in detecting early pancreatic cancer. Clin Cancer Res. 2014; 20(22): 5787–5795. - PMC - PubMed
    1. Capello M, Bantis LE, Ghislaine S, et al. Sequential validation of blood-based protein biomarker candidates for early-stage pancreatic cancer. J Natl Cancer Inst. 2017; 109(4): djw266. - PMC - PubMed

Publication types