. 2012 Aug;39(8):4903-17.

doi: 10.1118/1.4736530.

Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation

Brad M Keller¹, Diane L Nathan, Yan Wang, Yuanjie Zheng, James C Gee, Emily F Conant, Despina Kontos

Affiliations

PMID: 22894417
PMCID: PMC3416877
DOI: 10.1118/1.4736530

Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation

Brad M Keller et al. Med Phys. 2012 Aug.

. 2012 Aug;39(8):4903-17.

doi: 10.1118/1.4736530.

Authors

Brad M Keller¹, Diane L Nathan, Yan Wang, Yuanjie Zheng, James C Gee, Emily F Conant, Despina Kontos

Affiliation

¹ Department of Radiology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA. brad.keller@uphs.upenn.edu

PMID: 22894417
PMCID: PMC3416877
DOI: 10.1118/1.4736530

Abstract

Purpose: The amount of fibroglandular tissue content in the breast as estimated mammographically, commonly referred to as breast percent density (PD%), is one of the most significant risk factors for developing breast cancer. Approaches to quantify breast density commonly focus on either semiautomated methods or visual assessment, both of which are highly subjective. Furthermore, most studies published to date investigating computer-aided assessment of breast PD% have been performed using digitized screen-film mammograms, while digital mammography is increasingly replacing screen-film mammography in breast cancer screening protocols. Digital mammography imaging generates two types of images for analysis, raw (i.e., "FOR PROCESSING") and vendor postprocessed (i.e., "FOR PRESENTATION"), of which postprocessed images are commonly used in clinical practice. Development of an algorithm which effectively estimates breast PD% in both raw and postprocessed digital mammography images would be beneficial in terms of direct clinical application and retrospective analysis.

Methods: This work proposes a new algorithm for fully automated quantification of breast PD% based on adaptive multiclass fuzzy c-means (FCM) clustering and support vector machine (SVM) classification, optimized for the imaging characteristics of both raw and processed digital mammography images as well as for individual patient and image characteristics. Our algorithm first delineates the breast region within the mammogram via an automated thresholding scheme to identify background air followed by a straight line Hough transform to extract the pectoral muscle region. The algorithm then applies adaptive FCM clustering based on an optimal number of clusters derived from image properties of the specific mammogram to subdivide the breast into regions of similar gray-level intensity. Finally, a SVM classifier is trained to identify which clusters within the breast tissue are likely fibroglandular, which are then aggregated into a final dense tissue segmentation that is used to compute breast PD%. Our method is validated on a group of 81 women for whom bilateral, mediolateral oblique, raw and processed screening digital mammograms were available, and agreement is assessed with both continuous and categorical density estimates made by a trained breast-imaging radiologist.

Results: Strong association between algorithm-estimated and radiologist-provided breast PD% was detected for both raw (r = 0.82, p < 0.001) and processed (r = 0.85, p < 0.001) digital mammograms on a per-breast basis. Stronger agreement was found when overall breast density was assessed on a per-woman basis for both raw (r = 0.85, p < 0.001) and processed (0.89, p < 0.001) mammograms. Strong agreement between categorical density estimates was also seen (weighted Cohen's κ ≥ 0.79). Repeated measures analysis of variance demonstrated no statistically significant differences between the PD% estimates (p > 0.1) due to either presentation of the image (raw vs processed) or method of PD% assessment (radiologist vs algorithm).

Conclusions: The proposed fully automated algorithm was successful in estimating breast percent density from both raw and processed digital mammographic images. Accurate assessment of a woman's breast density is critical in order for the estimate to be incorporated into risk assessment models. These results show promise for the clinical application of the algorithm in quantifying breast density in a repeatable manner, both at time of imaging as well as in retrospective studies.

PubMed Disclaimer

Figures

**Figure 1**
Sample digital mammograms of BIRADS categories I–IV digital mammograms in order of increasing percent density. (I) <25%; (II) 26%–50%; (III) 51%–75%; (IV) >75%.

**Figure 2**
Flowchart of the proposed algorithm.

**Figure 3**
Comparison of gray-level intensity distributions of the breast region in “For Processing” (i.e., “Raw,” images a and b), histogram equalized raw (i.e., images c and d) and “For Presentation” (i.e., “Processed”; images e and f) digital mammograms of a BIRADS III category woman.

**Figure 4**
Illustration of adaptive air threshold detection on a digital mammogram with nonzero air pixels. (Left) Histogram showing the location of the first major rise in gray-level values, E, (long-dashed line) and the computed air threshold, th, (short-dashed). (Right) Identified breast-air interface contour (white line).

**Figure 5**
Effect of gray-level histogram smoothing. (Left) Original mammogram with breast area outlined in white, (Center) Z-score normalized gray-level intensity histogram constructed at a 0.01 bin-width, (Right) Histogram postsmoothing with a Gaussian kernel of width = 50, alpha = 5.

**Figure 6**
Segmentation algorithm for a k = 6 mammogram. (a) Original mammogram; (b) normalized, smoothed breast-pixel intensity histogram with FCM cluster centroids (vertical lines); (c) pixel cluster-membership represented by shading; (d) final dense tissue segmentation combining clusters 5–6.

**Figure 7**
Scatter plots of per-breast (top row) and per-woman (bottom row) algorithm-estimated (x axis) and radiologist-provided (y axis) PD% for raw (left column) and processed (right column) DM image sets. Regression (solid) and unity (dashed) lines are provided for reference.

**Figure 8**
Distributions of per-breast (left) and per-woman (right) assessed PD% as a function of image presentation and assessment-method. Two-way ANOVA indicated no significant groupwise differences (p > 0.1).

**Figure 9**
Per-breast (top) and per-woman (bottom) box-plots of algorithm-estimated PD% in raw (left) and processed (right) DM images vs radiologist-provided categorical ACR BIRADS density scores. BIRADS categories were assigned using the standard thresholds on continuous PD%: (I) < 25%; (II) 25%–50%; (III) 51%–75%; (IV) >75%.

**Figure 10**
Cross-validation performance as a function of histogram-construction parameters (i.e., bin width, b, Gaussian kernel width, w, and kernel variance, α) for raw (top) and processed (bottom) digital mammograms.

See this image and copyright information in PMC

References

1. Jemal A., Siegel R., Xu J., and Ward E., “Cancer statistics, 2010,” Ca-Cancer J. Clin. 60, 277–300 (2010). 10.3322/caac.20073 - DOI - PubMed
1. Gail M. H., Brinton L. A., Byar D. P., Corle D. K., Green S. B., Schairer C., and Mulvihill J. J., “Projecting individualized probabilities of developing breast cancer for white females who are being examined annually,” J. Natl. Cancer Inst. 81, 1879–1886 (1989). 10.1093/jnci/81.24.1879 - DOI - PubMed
1. Lehman C. D., “Role of MRI in screening women at high risk for breast cancer,” J. Magn. Reson Imaging 24, 964–970 (2006). 10.1002/jmri.20752 - DOI - PubMed
1. Lehman C. D., Blume J. D., Weatherall P., Thickman D., Hylton N., Warner E., Pisano E., Schnitt S. J., Gatsonis C., Schnall M., DeAngelis G. A., Stomper P., Rosen E. L., O’Loughlin M., Harms S., and Bluemke D. A., “Screening women at high risk for breast cancer with mammography and magnetic resonance imaging,” Cancer 103, 1898–1905 (2005). 10.1002/cncr.20971 - DOI - PubMed
1. Lehman C. D., Isaacs C., Schnall M. D., Pisano E. D., Ascher S. M., Weatherall P. T., Bluemke D. A., Bowen D. J., Marcom P. K., Armstrong D. K., Domchek S. M., Tomlinson G., Skates S. J., and Gatsonis C., “Cancer yield of mammography, MR, and US in high-risk women: prospective multi-institution breast cancer screening study,” Radiology 244, 381–388 (2007). 10.1148/radiol.2442060461 - DOI - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation

Affiliation

Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical