Computer-aided classification of breast masses: performance and interobserver variability of expert radiologists versus residents
- PMID: 20971779
- PMCID: PMC3009385
- DOI: 10.1148/radiol.10081308
Computer-aided classification of breast masses: performance and interobserver variability of expert radiologists versus residents
Abstract
Purpose: To evaluate the interobserver variability in descriptions of breast masses by dedicated breast imagers and radiology residents and determine how any differences in lesion description affect the performance of a computer-aided diagnosis (CAD) computer classification system.
Materials and methods: Institutional review board approval was obtained for this HIPAA-compliant study, and the requirement to obtain informed consent was waived. Images of 50 breast lesions were individually interpreted by seven dedicated breast imagers and 10 radiology residents, yielding 850 lesion interpretations. Lesions were described with use of 11 descriptors from the Breast Imaging Reporting and Data System, and interobserver variability was calculated with the Cohen κ statistic. Those 11 features were selected, along with patient age, and merged together by a linear discriminant analysis (LDA) classification model trained by using 1005 previously existing cases. Variability in the recommendations of the computer model for different observers was also calculated with the Cohen κ statistic.
Results: A significant difference was observed for six lesion features, and radiology residents had greater interobserver variability in their selection of five of the six features than did dedicated breast imagers. The LDA model accurately classified lesions for both sets of observers (area under the receiver operating characteristic curve = 0.94 for residents and 0.96 for dedicated imagers). Sensitivity was maintained at 100% for residents and improved from 98% to 100% for dedicated breast imagers. For residents, the computer model could potentially improve the specificity from 20% to 40% (P < .01) and the κ value from 0.09 to 0.53 (P < .001). For dedicated breast imagers, the computer model could increase the specificity from 34% to 43% (P = .16) and the κ value from 0.21 to 0.61 (P < .001).
Conclusion: Among findings showing a significant difference, there was greater interobserver variability in lesion descriptions among residents; however, an LDA model using data from either dedicated breast imagers or residents yielded a consistently high performance in the differentiation of benign from malignant breast lesions, demonstrating potential for improving specificity and decreasing interobserver variability in biopsy recommendations.
© RSNA, 2010
Conflict of interest statement
Authors stated no financial relationship to disclose.
Figures
Comment in
-
[Experts versus beginners, 1:1? : Can CAD achieve the equalizer in the classification of breast lesions?].Radiologe. 2011 Jun;51(6):453-4. doi: 10.1007/s00117-011-2173-3. Radiologe. 2011. PMID: 21487797 German. No abstract available.
References
-
- Lee CH. Screening mammography: proven benefit, continued controversy. Radiol Clin North Am 2002;40(3):395–407 - PubMed
-
- Pisano ED, Gatsonis C, Hendrick E, et al. Diagnostic performance of digital versus film mammography for breast-cancer screening. N Engl J Med 2005;353(17):1773–1783 - PubMed
-
- Helvie MA, Ikeda DM, Adler DD. Localization and needle aspiration of breast lesions: complications in 370 cases. AJR Am J Roentgenol 1991;157(4):711–714 - PubMed
-
- Dixon JM, John TG. Morbidity after breast biopsy for benign disease in a screened population. Lancet 1992;339(8785):128. - PubMed
-
- Hall FM, Storella JM, Silverstone DZ, Wyshak G. Nonpalpable breast lesions: recommendations for biopsy based on suspicion of carcinoma at mammography. Radiology 1988;167(2):353–358 - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous
