. 2015 Jul;42(7):4241-9.

doi: 10.1118/1.4922681.

Assessment of performance and reproducibility of applying a content-based image retrieval scheme for classification of breast lesions

Rohith Reddy Gundreddy¹, Maxine Tan¹, Yuchen Qiu¹, Samuel Cheng¹, Hong Liu¹, Bin Zheng¹

Affiliations

PMID: 26133622
PMCID: PMC4474953
DOI: 10.1118/1.4922681

Assessment of performance and reproducibility of applying a content-based image retrieval scheme for classification of breast lesions

Rohith Reddy Gundreddy et al. Med Phys. 2015 Jul.

. 2015 Jul;42(7):4241-9.

doi: 10.1118/1.4922681.

Authors

Rohith Reddy Gundreddy¹, Maxine Tan¹, Yuchen Qiu¹, Samuel Cheng¹, Hong Liu¹, Bin Zheng¹

Affiliation

¹ School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019.

PMID: 26133622
PMCID: PMC4474953
DOI: 10.1118/1.4922681

Abstract

Purpose: To develop a new computer-aided diagnosis (CAD) scheme using a content-based image retrieval (CBIR) approach for classification between the malignant and benign breast lesions depicted on the digital mammograms and assess CAD performance and reproducibility.

Methods: An image dataset including 820 regions of interest (ROIs) was used. Among them, 431 ROIs depict malignant lesions and 389 depict benign lesions. After applying an image preprocessing process to define the lesion center, two image features were computed from each ROI. The first feature is an average pixel value of a mapped region generated using a watershed algorithm. The second feature is an average pixel value difference between a ROI's center region and the rest of the image. A two-step CBIR approach uses these two features sequentially to search for ten most similar reference ROIs for each queried ROI. A similarity based classification score was then computed to predict the likelihood of the queried ROI depicting a malignant lesion. To assess the reproducibility of the CAD scheme, we selected another independent testing dataset of 100 ROIs. For each ROI in the testing dataset, we added four randomly queried lesion center pixels and examined the variation of the classification scores.

Results: The area under the ROC curve (AUC) = 0.962 ± 0.006 was obtained when applying a leave-one-out validation method to 820 ROIs. Using the independent testing dataset, the initial AUC value was 0.832 ± 0.040, and using the median classification score of each ROI with five queried seeds, AUC value increased to 0.878 ± 0.035.

Conclusions: The authors demonstrated that (1) a simple and efficient CBIR scheme using two lesion density distribution related features achieved high performance in classifying breast lesions without actual lesion segmentation and (2) similar to the conventional CAD schemes using global optimization approaches, improving reproducibility is also one of the challenges in developing CAD schemes using a CBIR based regional optimization approach.

PubMed Disclaimer

Figures

**FIG. 1.**
Illustration of 50 malignant ROIs extracted in our independent testing dataset.

**FIG. 2.**
Illustration of 50 benign ROIs extracted in our independent testing dataset.

**FIG. 3.**
An example of a matrix extracted from “a lesion center” (a) and the corresponding output of applying watershed function processing (b).

**FIG. 4.**
A diagram showing the variation of AUC values versus the increase of kernel size of a watershed matrix to compute feature 1 (F₁).

**FIG. 5.**
A scatter diagram showing the distribution of two image features computed from 820 ROIs. The red square marks indicate malignant ROIs, and the blue diamond marks represent benign ROIs.

**FIG. 6.**
Histograms of the number of malignant reference ROIs retrieved using feature 1 (F₁) among the 50 malignant and 50 benign testing ROIs.

**FIG. 7.**
Comparison of three ROC curves generated using the rockit program. The computed AUC values are 0.962 ± 0.006, 0.603 ± 0.020, and 0.515 ± 0.020 when using our CBIR based classification scores (a leave-one-ROI-out based validation method), the average of two image features, and the first feature computed from the watershed algorithm generated maps, respectively.

**FIG. 8.**
Distribution of classification scores computed using the original queried lesion center seeds (marked by *) and the median classification scores (marked by ^∘) among 50 malignant ROIs (a) and 50 benign ROIs (b). In both diagrams (a) and (b), the solid red line and dashed blue line indicate the average classification score level of the 50 ROIs using the original seeds and median classification scores using the five seeds, respectively.

**FIG. 9.**
Comparison of two ROC curves generated using two different sets of classification scores of 100 testing ROIs. In these two ROC curves, AUC = 0.832 ± 0.040 and 0.878 ± 0.035 when using the classification scores computed based on the original queried lesion center seed and the median classification scores computed from five randomly placed lesion center seeds, respectively.

See this image and copyright information in PMC

Cited by

Radiological images and machine learning: Trends, perspectives, and prospects.
Zhang Z, Sejdić E. Zhang Z, et al. Comput Biol Med. 2019 May;108:354-370. doi: 10.1016/j.compbiomed.2019.02.017. Epub 2019 Feb 27. Comput Biol Med. 2019. PMID: 31054502 Free PMC article. Review.
A multi-feature image retrieval scheme for pulmonary nodule diagnosis.
Wei G, Qiu M, Zhang K, Li M, Wei D, Li Y, Liu P, Cao H, Xing M, Yang F. Wei G, et al. Medicine (Baltimore). 2020 Jan;99(4):e18724. doi: 10.1097/MD.0000000000018724. Medicine (Baltimore). 2020. PMID: 31977863 Free PMC article.
Applying a new quantitative image analysis scheme based on global mammographic features to assist diagnosis of breast cancer.
Chen X, Zargari A, Hollingsworth AB, Liu H, Zheng B, Qiu Y. Chen X, et al. Comput Methods Programs Biomed. 2019 Oct;179:104995. doi: 10.1016/j.cmpb.2019.104995. Epub 2019 Jul 29. Comput Methods Programs Biomed. 2019. PMID: 31443864 Free PMC article.
Computer-aided classification of mammographic masses using visually sensitive image features.
Wang Y, Aghaei F, Zarafshani A, Qiu Y, Qian W, Zheng B. Wang Y, et al. J Xray Sci Technol. 2017;25(1):171-186. doi: 10.3233/XST-16212. J Xray Sci Technol. 2017. PMID: 27911353 Free PMC article.
Content-based image retrieval for Lung Nodule Classification Using Texture Features and Learned Distance Metric.
Wei G, Cao H, Ma H, Qi S, Qian W, Ma Z. Wei G, et al. J Med Syst. 2017 Nov 29;42(1):13. doi: 10.1007/s10916-017-0874-5. J Med Syst. 2017. PMID: 29185058

See all "Cited by" articles

References

1. Siegel R., Naishadham D., and Jemal A., “Cancer statistics, 2013,” Ca-Cancer J. Clin. 63, 11–30 (2013).10.3322/caac.21166 - DOI - PubMed
1. Cady B. and Michaelson J. S., “The life-sparing potential of mammographic screening,” Cancer 91, 1699–1703 (2001).10.1002/1097-0142(20010501)91:9<1699::AID-CNCR1186>3.0.CO;2-W - DOI - PubMed
1. American College of Radiology , ACR Breast Imaging Reporting and Data System Atlas (BI-RADS Atlas) (American College of Radiology, Reston, VA, 2003).
1. Berg W. A. et al., “Diagnostic accuracy of mammography, clinical examination, US, and MR imaging in preoperative assessment of breast cancer,” Radiology 233, 830–849 (2004).10.1148/radiol.2333031484 - DOI - PubMed
1. Sickles E. A., Wolverton D. E., and Dee K. E., “Performance parameters for screening and diagnostic mammography: Specialist and general radiologists,” Radiology 224, 861–869 (2002).10.1148/radiol.2243011482 - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 CA160205/CA/NCI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Assessment of performance and reproducibility of applying a content-based image retrieval scheme for classification of breast lesions

Affiliation

Assessment of performance and reproducibility of applying a content-based image retrieval scheme for classification of breast lesions

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous