Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep;33(9):1577-1594.
doi: 10.1177/09622802241267356. Epub 2024 Aug 8.

Estimation and inference on the partial volume under the receiver operating characteristic surface

Affiliations

Estimation and inference on the partial volume under the receiver operating characteristic surface

Kate J Young et al. Stat Methods Med Res. 2024 Sep.

Abstract

measures of biomarker accuracy that employ the receiver operating characteristic surface have been proposed for biomarkers that classify patients into one of three groups: healthy, benign, or aggressive disease. The volume under the receiver operating characteristic surface summarizes the overall discriminatory ability of a biomarker in such configurations, but includes cutoffs associated with clinically irrelevant true classification rates. Due to the lethal nature of pancreatic cancer, cutoffs associated with a low true classification rate for identifying patients with pancreatic cancer may be undesirable and not appropriate for use in a clinical setting. In this project, we study the properties of a more focused criterion, the partial volume under the receiver operating characteristic surface, that summarizes the diagnostic accuracy of a marker in the three-class setting for regions restricted to only those of clinical interest. We propose methods for estimation and inference on the partial volume under the receiver operating characteristic surface under parametric and non-parametric frameworks and apply these methods to the evaluation of potential biomarkers for the diagnosis of pancreatic cancer.

Keywords: Biomarkers; classification; delta Method; kernels; receiver operating characteristic surface.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interestsThe author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure 1.
Figure 1.
For a single hypothetical marker this illustrates three (out of many) different partial volumes one could consider depending on the clinical interest. Plots a to c show the pVUS with the ROC surface for the biomarker of interest. Plot a shows the pVUS after forcing a true class rate for group I of 0.4–0.6 and for group 3 a range of 0.1–0.3. The corresponding pVUS is visualized by the purple tube. Analogously, in plot b, we force specific ranges for TC3 (group 3) and TC2 (group 2), and in plot c, we force specific ranges for TC1 and TC2. Plots d to f display the same pVUS regions for the same biomarker as plots a to c, respectively, but with the uninformative ROC surface for comparison. pVUS: partial volume under the receiver operating characteristic surface; ROC: receiver operating characteristic; TC: true classification.
Figure 2.
Figure 2.
The ROC surface for two hypothetical markers and their corresponding feasible regions. Plot a shows the uninformative ROC surface and the ROC surface for hypothetical marker in which the ROC surface does not intersect the uninformative ROC surface, and plots b to d show the marker’s feasible regions for pVUSTC1,TC3,pVUSTC2,TC3, and pVUSTC1,TC2. Plot e shows the uninformative ROC surface and the ROC surface for hypothetical marker in which the ROC surface intersects the uninformative ROC surface, and plots f to h show the marker’s feasible regions for pVUSTC1,TC3,pVUSTC2,TC3, and pVUSTC1,TC2. These plots illustrate the following areas on each plane: A, which consists of all points that fall in the projection of the ROC surface of interest on the given plane but not in the projection of the uninformative surface; B, which consists of all points that fall in the projection of both the uninformative surface and the ROC surface of interest and; C, which consists of all points that fall in the projection of the uninformative surface onto the given plane, but not in the projection of the ROC surface of interest. pVUS: partial volume under the receiver operating characteristic surface; ROC: receiver operating characteristic; TC: true classification.
Figure 3.
Figure 3.
Plots of the estimated coverage of the pVUS CI’s under scenarios involving normal, log-normal, and gamma distributed biomarker scores at various levels of the VUS and pVUS. The colors denote the true VUS and the sizes of the dots denote the true pVUS. The dotted line represents a coverage of 95%. For corresponding scaled pVUS see Supplemental Appendix E. pVUS: partial volume under the receiver operating characteristic surface; CI: confidence interval; VUS: volume under the receiver operating characteristic surface.
Figure 4.
Figure 4.
Plots of the estimated size (at 5% significance) and power of the statistical test of the pVUS involving normal, log-normal, and gamma distributed biomarker scores at various levels of pVUSSC for a restricted region of TC3[0.1,0.3] and TC1[0.1,0.3]. The pVUS values corresponding to the pVUSSC values shown on the x-axis are all taken on the same region of interest. Furthermore, for the selected region of interest in these simulations, an uninformative marker exhibits a pVUS of 0.024. Thus, the size of the test was evaluated at pVUS=0.024, and the power of the test was evaluated at various values of pVUS>0.024. The dotted lines represent powers of 5%, 80%, and 90%. We note that the corresponding pVUSSC values in these simulations range from 0.167 to 0.717. pVUS: partial volume under the receiver operating characteristic surface; TC: true classification.
Figure 5.
Figure 5.
The estimated feasible regions and feasible regions for inference of the seven pancreatic ductal adenocarcinoma (PDAC) biomarkers. The feasible regions are represented by the unions of regions A and B (AB) and region B is the feasible region for inference.
Figure 6.
Figure 6.
The estimates and 95% Cl’s for the VUS, scaled pVUS, and pVUS for seven potential PDAC biomarkers. The pVUS is defined by TC1 boundaries p1=0,p2=0.2, and TC3 boundaries, q1=0.6,q2=0.8.
Figure 7.
Figure 7.
The estimated distributions, ROC surfaces, and pVUS regions (defined by TC1 boundaries p1=0,p2=0.2, and TC3 boundaries, q1=0.6,q2=0.8) for three PDAC biomarkers: LRG1, CA19.9, and MRPS24. The top three panels display the kernel-based probability density estimates for the Box-Cox transformed biomarker measurements. The shaded areas in these plots represent the range of cutoff points c1 and c2 that correspond to the restricted region. This restricted region is illustrated in the bottom three panels with the estimated ROC surfaces for each biomarker. Similar figures for the other four biomarkers studied are provided in Supplemental Appendix F. pVUS: partial volume under the receiver operating characteristic surface; PDAC: pancreatic ductal adenocarcinoma; ROC: receiver operating characteristic.

References

    1. Capello M, Bantis LE, Scelo G et al. Sequential validation of blood-based protein biomarker candidates for early-stage pancreatic cancer. J Natl Cancer Inst 2017; 109: djw266. - PMC - PubMed
    1. Deshwar AB, Sugar E, Torto D et al. Diagnostic intervals and pancreatic ductal adenocarcinoma (PDAC) resectability: a single-center retrospective analysis. Ann Pancreat Cancer 2018; 1: 13–13. - PMC - PubMed
    1. Vujasinovic M, Dugic A, Maisonneuve P et al. Risk of developing pancreatic cancer in patients with chronic pancreatitis. J Clin Med 2020; 9: 3720. - PMC - PubMed
    1. Nakas CT, Dalrymple-Alford JC, Anderson T et al. Generalization of Youden index for multiple-class classification problems applied to the assessment of externally validated cognition in Parkinson disease screening. Stat Med 2013; 32: 995–1003. - PubMed
    1. Wang Z and Chang YCI. Marker selection via maximizing the partial area under the ROC curve of linear risk scores. Biostatistics 2011; 12: 369–385. - PubMed

LinkOut - more resources