Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 26;21(7):e14464.
doi: 10.2196/14464.

Breast Cancer Detection and Diagnosis Using Mammographic Data: Systematic Review

Affiliations

Breast Cancer Detection and Diagnosis Using Mammographic Data: Systematic Review

Syed Jamal Safdar Gardezi et al. J Med Internet Res. .

Abstract

Background: Machine learning (ML) has become a vital part of medical imaging research. ML methods have evolved over the years from manual seeded inputs to automatic initializations. The advancements in the field of ML have led to more intelligent and self-reliant computer-aided diagnosis (CAD) systems, as the learning ability of ML methods has been constantly improving. More and more automated methods are emerging with deep feature learning and representations. Recent advancements of ML with deeper and extensive representation approaches, commonly known as deep learning (DL) approaches, have made a very significant impact on improving the diagnostics capabilities of the CAD systems.

Objective: This review aimed to survey both traditional ML and DL literature with particular application for breast cancer diagnosis. The review also provided a brief insight into some well-known DL networks.

Methods: In this paper, we present an overview of ML and DL techniques with particular application for breast cancer. Specifically, we search the PubMed, Google Scholar, MEDLINE, ScienceDirect, Springer, and Web of Science databases and retrieve the studies in DL for the past 5 years that have used multiview mammogram datasets.

Results: The analysis of traditional ML reveals the limited usage of the methods, whereas the DL methods have great potential for implementation in clinical analysis and improve the diagnostic capability of existing CAD systems.

Conclusions: From the literature, it can be found that heterogeneous breast densities make masses more challenging to detect and classify compared with calcifications. The traditional ML methods present confined approaches limited to either particular density type or datasets. Although the DL methods show promising improvements in breast cancer diagnosis, there are still issues of data scarcity and computational cost, which have been overcome to a significant extent by applying data augmentation and improved computational power of DL algorithms.

Keywords: breast cancer; convolutional neural networks; deep learning; lesion classification; machine learning; malignant tumor.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1
Figure 1
Multiview breast mammogram of a patient. The first column presents two views of the right breast: right craniocaudal (RCC) view and right mediolateral oblique (RMLO) view. The second column presents two views of the left breast: left craniocaudal (LCC) view and left mediolateral oblique (LMLO) view.
Figure 2
Figure 2
Difference between 2 pipelines: conventional machine learning pipeline (left) and deep learning pipeline (right).
Figure 3
Figure 3
(a) Original mammogram image 1024×1024. (b) Preprocessing to remove annotations. (c) pectoral muscle (PM) removal by region growing. (d) PM removal by adaptive segmentation.
Figure 4
Figure 4
Pixel-level illustration of true positive, false positive, and false negative compared with ground truth.
Figure 5
Figure 5
An overview of mammogram processing using computer-aided diagnosis based on machine learning algorithms.
Figure 6
Figure 6
Sample results from the study by Ribli et al for mass detection and classification.
Figure 7
Figure 7
An overview of conditional generative adversarial network adapted from the study by Singh et al for mass segmentation and shape classification. CNN: convolutional neural network.
Figure 8
Figure 8
Sample results from the study by Wu et al for synthetic generation of data using conditional generative adversarial network. GAN: generative adversarial network.

References

    1. American Cancer Society. 2018. Global Cancer: Facts & Figures, 4th edition http://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-s....
    1. Blakely T, Shaw C, Atkinson J, Cunningham R, Sarfati D. Social inequalities or inequities in cancer incidence? Repeated census-cancer cohort studies, New Zealand 1981-1986 to 2001-2004. Cancer Causes Control. 2011 Sep;22(9):1307–18. doi: 10.1007/s10552-011-9804-x. - DOI - PubMed
    1. Smigal C, Jemal A, Ward E, Cokkinides V, Smith R, Howe HL, Thun M. Trends in breast cancer by race and ethnicity: update 2006. CA Cancer J Clin. 2006;56(3):168–83. doi: 10.3322/canjclin.56.3.168. https://onlinelibrary.wiley.com/resolve/openurl?genre=article&sid=nlm:pu... - DOI - PubMed
    1. Guide to Mammography And Other Breast Imaging Procedures. New York: Natl Council on Radiation; 2012.
    1. Ponraj DN, Jenifer ME, Poongodi DP, Manoharan JS. A survey on the preprocessing techniques of mammogram for the detection of breast cancer. J Emerg Trends Comput Inf Sci. 2011;2(12):656–64. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.651.592

Publication types