Needs assessment for next generation computer-aided mammography reference image databases and evaluation studies
- PMID: 21448711
- DOI: 10.1007/s11548-011-0553-9
Needs assessment for next generation computer-aided mammography reference image databases and evaluation studies
Abstract
Introduction: Breast cancer is globally a major threat for women's health. Screening and adequate follow-up can significantly reduce the mortality from breast cancer. Human second reading of screening mammograms can increase breast cancer detection rates, whereas this has not been proven for current computer-aided detection systems as "second reader". Critical factors include the detection accuracy of the systems and the screening experience and training of the radiologist with the system. When assessing the performance of systems and system components, the choice of evaluation methods is particularly critical. Core assets herein are reference image databases and statistical methods.
Methods: We have analyzed characteristics and usage of the currently largest publicly available mammography database, the Digital Database for Screening Mammography (DDSM) from the University of South Florida, in literature indexed in Medline, IEEE Xplore, SpringerLink, and SPIE, with respect to type of computer-aided diagnosis (CAD) (detection, CADe, or diagnostics, CADx), selection of database subsets, choice of evaluation method, and quality of descriptions.
Results: 59 publications presenting 106 evaluation studies met our selection criteria. In 54 studies (50.9%), the selection of test items (cases, images, regions of interest) extracted from the DDSM was not reproducible. Only 2 CADx studies, not any CADe studies, used the entire DDSM. The number of test items varies from 100 to 6000. Different statistical evaluation methods are chosen. Most common are train/test (34.9% of the studies), leave-one-out (23.6%), and N-fold cross-validation (18.9%). Database-related terminology tends to be imprecise or ambiguous, especially regarding the term "case".
Discussion: Overall, both the use of the DDSM as data source for evaluation of mammography CAD systems, and the application of statistical evaluation methods were found highly diverse. Results reported from different studies are therefore hardly comparable. Drawbacks of the DDSM (e.g. varying quality of lesion annotations) may contribute to the reasons. But larger bias seems to be caused by authors' own decisions upon study design. RECOMMENDATIONS/CONCLUSION: For future evaluation studies, we derive a set of 13 recommendations concerning the construction and usage of a test database, as well as the application of statistical evaluation methods.
Similar articles
-
Image feature evaluation in two new mammography CAD prototypes.Int J Comput Assist Radiol Surg. 2011 Nov;6(6):721-35. doi: 10.1007/s11548-011-0549-5. Epub 2011 Mar 5. Int J Comput Assist Radiol Surg. 2011. PMID: 21380554
-
Computer-aided detection; the effect of training databases on detection of subtle breast masses.Acad Radiol. 2010 Nov;17(11):1401-8. doi: 10.1016/j.acra.2010.06.009. Epub 2010 Jul 22. Acad Radiol. 2010. PMID: 20650667 Free PMC article.
-
A curated mammography data set for use in computer-aided detection and diagnosis research.Sci Data. 2017 Dec 19;4:170177. doi: 10.1038/sdata.2017.177. Sci Data. 2017. PMID: 29257132 Free PMC article.
-
CADx of mammographic masses and clustered microcalcifications: a review.Med Phys. 2009 Jun;36(6):2052-68. doi: 10.1118/1.3121511. Med Phys. 2009. PMID: 19610294 Review.
-
The efficacy of using computer-aided detection (CAD) for detection of breast cancer in mammography screening: a systematic review.Acta Radiol. 2019 Jan;60(1):13-18. doi: 10.1177/0284185118770917. Epub 2018 Apr 17. Acta Radiol. 2019. PMID: 29665706
Cited by
-
Independent component analysis to detect clustered microcalcification breast cancers.ScientificWorldJournal. 2012;2012:540457. doi: 10.1100/2012/540457. Epub 2012 Apr 24. ScientificWorldJournal. 2012. PMID: 22654626 Free PMC article.
-
Characterizing Architectural Distortion in Mammograms by Linear Saliency.J Med Syst. 2017 Feb;41(2):26. doi: 10.1007/s10916-016-0672-5. Epub 2016 Dec 22. J Med Syst. 2017. PMID: 28005248
-
Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model.Int J Comput Assist Radiol Surg. 2014 Nov;9(6):1005-20. doi: 10.1007/s11548-014-0992-1. Epub 2014 Mar 25. Int J Comput Assist Radiol Surg. 2014. PMID: 24664267 Free PMC article.
-
Identification of breast lesion through integrated study of gorilla troops optimization and rotation-based learning from MRI images.Sci Rep. 2023 Jul 18;13(1):11577. doi: 10.1038/s41598-023-36300-3. Sci Rep. 2023. PMID: 37463919 Free PMC article.
-
Optimization of Network Topology in Computer-Aided Detection Schemes Using Phased Searching with NEAT in a Time-Scaled Framework.Cancer Inform. 2014 Oct 13;13(Suppl 1):17-27. doi: 10.4137/CIN.S13885. eCollection 2014. Cancer Inform. 2014. PMID: 25392680 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous