Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Feb;24(1):68-106.
doi: 10.1177/0962280214537390. Epub 2014 Jun 11.

Quantitative imaging biomarkers: a review of statistical methods for computer algorithm comparisons

Affiliations
Review

Quantitative imaging biomarkers: a review of statistical methods for computer algorithm comparisons

Nancy A Obuchowski et al. Stat Methods Med Res. 2015 Feb.

Abstract

Quantitative biomarkers from medical images are becoming important tools for clinical diagnosis, staging, monitoring, treatment planning, and development of new therapies. While there is a rich history of the development of quantitative imaging biomarker (QIB) techniques, little attention has been paid to the validation and comparison of the computer algorithms that implement the QIB measurements. In this paper we provide a framework for QIB algorithm comparisons. We first review and compare various study designs, including designs with the true value (e.g. phantoms, digital reference images, and zero-change studies), designs with a reference standard (e.g. studies testing equivalence with a reference standard), and designs without a reference standard (e.g. agreement studies and studies of algorithm precision). The statistical methods for comparing QIB algorithms are then presented for various study types using both aggregate and disaggregate approaches. We propose a series of steps for establishing the performance of a QIB algorithm, identify limitations in the current statistical literature, and suggest future directions for research.

Keywords: agreement; bias; image metrics; imaging biomarkers; precision; quantitative imaging; repeatability; reproducibility.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The role of quantitative medical imaging algorithms and dependency of the estimated QIB on sources of bias and precision.
Figure 2
Figure 2
Trade-offs between different study designs which can be used for algorithm characterization and validation.
Figure 3
Figure 3
Decision tree for identifying statistical methods for a QIB algorithm comparison study. *Reference standard, defined as a well-accepted or commonly used method for measuring the biomarker but with recognized bias and/or measurement error. Examples of reference standards are histology, expert human readers, or a state-of-the-art QIB algorithm.
Figure 3
Figure 3
Decision tree for identifying statistical methods for a QIB algorithm comparison study. *Reference standard, defined as a well-accepted or commonly used method for measuring the biomarker but with recognized bias and/or measurement error. Examples of reference standards are histology, expert human readers, or a state-of-the-art QIB algorithm.
Figure 4
Figure 4
Histograms of the maximum likelihood estimators of the ICC of two QIB algorithms (left and center columns) and of the difference in their ICC (right column), estimated using an imperfect reference standard (top row, ICC of reference standard 0.8), a nearly perfect reference standard (center row, ICC of 0.999), and a perfect reference standard (bottom row). The red line denotes the true value. Bias in the maximum likelihood estimators is negligible when we use the nearly perfect reference standard or true value, but is significant when we use imperfect reference standards.

References

    1. Biomarkers Definitions Working Group. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther 2001. 2001;(69):89–95. - PubMed
    1. Woodcock J, Woosley R. The FDA critical path initiative and its influence on new drug development. Annu Rev Med. 2008;(59):1–12. - PubMed
    1. QIBA Metrology Metrology Working Group. The Emerging Science of Quantitative Imaging Biomarkers: Terminology and Definitions for Scientific Studies and for Regulatory Submissions. Submitted to SMMR - PubMed
    1. Performance Working Group. Quantitative Imaging Biomarkers: A Review of Statistical Methods for Technical Performance Assessment. Submitted to SMMR - PMC - PubMed
    1. Yankelevitz DF. Quantifying “Overdiagnosis” in Lung Cancer Screening. Radiology. 2008;246(1):332–333. - PubMed

Publication types