Breast tumor diagnosis via multimodal deep learning using ultrasound B-mode and Nakagami images

Sabiq Muhtadi¹, Caterina M Gallippi¹

Affiliations

Affiliation

¹ University of North Carolina at Chapel Hill, North Carolina State University, Lampe Joint Department of Biomedical Engineering, Chapel Hill, North Carolina, United States.

PMID: 40375887
PMCID: PMC12077846 (available on 2026-05-14)
DOI: 10.1117/1.JMI.12.S2.S22009

Breast tumor diagnosis via multimodal deep learning using ultrasound B-mode and Nakagami images

Sabiq Muhtadi et al. J Med Imaging (Bellingham). 2025 Nov.

. 2025 Nov;12(Suppl 2):S22009.

doi: 10.1117/1.JMI.12.S2.S22009. Epub 2025 May 14.

Authors

Sabiq Muhtadi¹, Caterina M Gallippi¹

Affiliation

¹ University of North Carolina at Chapel Hill, North Carolina State University, Lampe Joint Department of Biomedical Engineering, Chapel Hill, North Carolina, United States.

PMID: 40375887
PMCID: PMC12077846 (available on 2026-05-14)
DOI: 10.1117/1.JMI.12.S2.S22009

Abstract

Purpose: We propose and evaluate multimodal deep learning (DL) approaches that combine ultrasound (US) B-mode and Nakagami parametric images for breast tumor classification. It is hypothesized that integrating tissue brightness information from B-mode images with scattering properties from Nakagami images will enhance diagnostic performance compared with single-input approaches.

Approach: An EfficientNetV2B0 network was used to develop multimodal DL frameworks that took as input (i) numerical two-dimensional (2D) maps or (ii) rendered red-green-blue (RGB) representations of both B-mode and Nakagami data. The diagnostic performance of these frameworks was compared with single-input counterparts using 831 US acquisitions from 264 patients. In addition, gradient-weighted class activation mapping was applied to evaluate diagnostically relevant information utilized by the different networks.

Results: The multimodal architectures demonstrated significantly higher area under the receiver operating characteristic curve (AUC) values ( $p < 0.05$ ) than their monomodal counterparts, achieving an average improvement of 10.75%. In addition, the multimodal networks incorporated, on average, 15.70% more diagnostically relevant tissue information. Among the multimodal models, those using RGB representations as input outperformed those that utilized 2D numerical data maps ( $p < 0.05$ ). The top-performing multimodal architecture achieved a mean AUC of 0.896 [95% confidence interval (CI): 0.813 to 0.959] when performance was assessed at the image level and 0.848 (95% CI: 0.755 to 0.903) when assessed at the lesion level.

Conclusions: Incorporating B-mode and Nakagami information together in a multimodal DL framework improved classification outcomes and increased the amount of diagnostically relevant information accessed by networks, highlighting the potential for automating and standardizing US breast cancer diagnostics to enhance clinical outcomes.

Keywords: breast cancer; deep learning; multimodal deep learning; quantitative ultrasound; ultrasound imaging.

PubMed Disclaimer

References

1. American Cancer Society, Breast Cancer Facts & Figures 2022-2024, American Cancer Society Inc., Atlanta: (2022).
1. Siegel R. L., et al. , “Cancer statistics, 2023,” CA. Cancer J. Clin. 73(1), 17–48 (2023).CAMCAM10.3322/caac.21763 - DOI - PubMed
1. Berg W. A., et al. , “Ultrasound as the primary screening test for breast cancer: analysis from ACRIN 6666,” J. Natl. Cancer Inst. 108(4), djv367 (2016).JNCIEQ10.1093/jnci/djv367 - DOI - PMC - PubMed
1. Sood R., et al. , “Ultrasound for breast cancer detection globally: a systematic review and meta-analysis,” J. Glob. Oncol. 5, 1–17 (2019).10.1200/JGO.19.00127 - DOI - PMC - PubMed
1. Bae M. S., et al. , “Breast cancer detected with screening US: reasons for nondetection at mammography,” Radiology 270(2), 369–377 (2014).RADLAX10.1148/radiol.13130724 - DOI - PubMed

LinkOut - more resources

Full Text Sources
- Society of Photo-Optical Instrumentation Engineers

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Breast tumor diagnosis via multimodal deep learning using ultrasound B-mode and Nakagami images

Affiliation

Breast tumor diagnosis via multimodal deep learning using ultrasound B-mode and Nakagami images

Authors

Affiliation

Abstract

Similar articles

References

LinkOut - more resources

Full Text Sources

Abstract

Similar articles

References

Related information

LinkOut - more resources

Full Text Sources