Comparative Study

. 2007 Jun;34(6):2024-38.

doi: 10.1118/1.2736289.

Evaluating computer-aided detection algorithms

Hong Jun Yoon¹, Bin Zheng, Berkman Sahiner, Dev P Chakraborty

Affiliations

PMID: 17654906
PMCID: PMC2041901
DOI: 10.1118/1.2736289

Comparative Study

Evaluating computer-aided detection algorithms

Hong Jun Yoon et al. Med Phys. 2007 Jun.

. 2007 Jun;34(6):2024-38.

doi: 10.1118/1.2736289.

Authors

Hong Jun Yoon¹, Bin Zheng, Berkman Sahiner, Dev P Chakraborty

Affiliation

¹ Department of Radiology, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, USA.

PMID: 17654906
PMCID: PMC2041901
DOI: 10.1118/1.2736289

Erratum in

Med Phys. 2008 Nov;35(11):5197
Erratum: "Evaluating computer-aided detection algorithms".
Yoon HJ, Zheng B, Sahiner B, Chakraborty DP. Yoon HJ, et al. Med Phys. 2008 Nov;35(11):5197. doi: 10.1118/1.2995750. Med Phys. 2008. PMID: 28524966 No abstract available.

Abstract

Computer-aided detection (CAD) has been attracting extensive research interest during the last two decades. It is recognized that the full potential of CAD can only be realized by improving the performance and robustness of CAD algorithms and this requires good evaluation methodology that would permit CAD designers to optimize their algorithms. Free-response receiver operating characteristic (FROC) curves are widely used to assess CAD performance, however, evaluation rarely proceeds beyond determination of lesion localization fraction (sensitivity) at an arbitrarily selected value of nonlesion localizations (false marks) per image. This work describes a FROC curve fitting procedure that uses a recent model of visual search that serves as a framework for the free-response task. A maximum likelihood procedure for estimating the parameters of the model from free-response data and fitting CAD generated FROC curves was implemented. Procedures were implemented to estimate two figures of merit and associated statistics such as 95% confidence intervals and goodness of fit. One of the figures of merit does not require the arbitrary specification of an operating point at which to evaluate CAD performance. For comparison a related method termed initial detection and candidate analysis was also implemented that is applicable when all suspicious regions are reported. The two methods were tested on seven mammography CAD data sets and both yielded good to excellent fits. The search model approach has the advantage that it can potentially be applied to radiologist generated free-response data where not all suspicious regions are reported, only the ones that are deemed sufficiently suspicious to warrant clinical follow-up. This work represents the first practical application of the search model to an important evaluation problem in diagnostic radiology. Software based on this work is expected to benefit CAD developers working in diverse areas of medical imaging.

PubMed Disclaimer

Figures

**Figure 1**
This figure illustrates the IDCA approach to fitting FROC operating points. IDCA regards the lesion and non-lesion localization counts as arising from normal and abnormal “cases” in a pseudo-ROC study. The counts are analyzed by ROC curve-fitting software yielding the fitted curve shown in the upper panel. The FROC curve, shown in the lower panel, is obtained by a mapping operation indicated by the arrow, consisting of a point-by-point multiplication of the pseudo-ROC curve along the y-axis by ν′, and along the x-axis by λ′, where (λ′,ν′) are the coordinates of the observed end-point of the FROC curve. Therefore the corner (1, 1) maps to the end-point (λ′,ν′) and each pseudo-ROC point maps to a unique FROC point. Four pseudo-ROC and four FROC operating points and the corresponding cutoffs ζ_i (i = 1, 2, 3, 4) are shown.

**Figure 2**
This figure compares the cutoffs implicit in the two models; panel (a) corresponds to IDCA and panel (b) corresponds to the search model. For an R-rating free-response study there are R ordered cutoffs ζ_i (i = 1, 2, …, R). The observed non-lesion and lesion localization counts are *F_i* and *T_i*, respectively. Since the pseudo-ROC point corresponding to the lowest bin is (1, 1), the corresponding cutoff ζ₁ is negative infinity and no counts are possible below it, whereas in the search model ζ₁ is finite and an unknown number of counts are possible below it. It is this difference that makes search model parameter estimation more difficult.

**Figure 3**
A typical plot of −LL_λ (λ) vs. λ, where the ordinate is the value of the negative of the log likelihood after it has been minimized with respect to all parameters except λ. It is seen that −LL_λ (λ) has as a minima at λ = *λ_M* and [*λ_L*,*λ_U* ] illustrates the construction of an asymmetric confidence interval for λ, see text.

**Figure 4**
IDCA (upper panel) and search model (lower panel) fits to data set CAD_A. The dashed curves correspond to the model fits (IDCA – upper panel, or search model – lower panel) and the solid curves are the raw data (the same raw data is plotted in the upper and lower panels). Since the fits are close to the raw data it is difficult to distinguish between them – the “staircase” pattern corresponding to the raw data may be helpful. The solid circles are operating points resulting from binning the data, and the binned data was used by the fitting procedures. They are constrained to lie exactly on the raw curve. Shown are 95% confidence intervals for an intermediate operating point. Both fits are visually excellent although the statistical measures of quality of fit are not as good (p-values 0.0074 and 0.0073, see text for more discussion of this aspect).

**Figure 5**
As in Figure 4, except this is for data set CAD_B. Both fits are excellent (p-values 0.36 and 0.44).

**Figure 6**
As in Figure 4, except this is for data set CAD_F. Both fits are excellent (p-values 0.28 and 0.21).

See this image and copyright information in PMC

References

1. Brake GM, Karssemeijer N, Hendriks JH. Automated detection of breast carcinomas not detected in a screening program. Radiology. 1998;207:465–471. - PubMed
1. Birdwell RL, Ikeda DM, O’Shaughnessy KF, et al. Mammographic characteristics of 115 missed cancers later detected with screening mammography and the potential utility of computer-aided detection. Radiology. 2001;219:192–202. - PubMed
1. White CS, Romney BM, Mason AC, et al. Primary carcinoma of the lung overlooked at CT: analysis of findings in 14 patients. Radiology. 1996;199:109–115. - PubMed
1. Kakinuma R, Ohmatsu H, Kaneko M, et al. Detection failures in spiral CT screening for lung cancer: analysis of CT findings. Radiology. 1999;212:61–66. - PubMed
1. Forrest JV, Friedman PJ. Radiologic errors in patients with lung cancer. Western Journal of Medicine. 1981;134:485–490. - PMC - PubMed

Publication types

Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Evaluating computer-aided detection algorithms

Affiliation

Evaluating computer-aided detection algorithms

Authors

Affiliation

Erratum in

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous