Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Nov;12(Suppl 2):S22009.
doi: 10.1117/1.JMI.12.S2.S22009. Epub 2025 May 14.

Breast tumor diagnosis via multimodal deep learning using ultrasound B-mode and Nakagami images

Affiliations

Breast tumor diagnosis via multimodal deep learning using ultrasound B-mode and Nakagami images

Sabiq Muhtadi et al. J Med Imaging (Bellingham). 2025 Nov.

Abstract

Purpose: We propose and evaluate multimodal deep learning (DL) approaches that combine ultrasound (US) B-mode and Nakagami parametric images for breast tumor classification. It is hypothesized that integrating tissue brightness information from B-mode images with scattering properties from Nakagami images will enhance diagnostic performance compared with single-input approaches.

Approach: An EfficientNetV2B0 network was used to develop multimodal DL frameworks that took as input (i) numerical two-dimensional (2D) maps or (ii) rendered red-green-blue (RGB) representations of both B-mode and Nakagami data. The diagnostic performance of these frameworks was compared with single-input counterparts using 831 US acquisitions from 264 patients. In addition, gradient-weighted class activation mapping was applied to evaluate diagnostically relevant information utilized by the different networks.

Results: The multimodal architectures demonstrated significantly higher area under the receiver operating characteristic curve (AUC) values ( p < 0.05 ) than their monomodal counterparts, achieving an average improvement of 10.75%. In addition, the multimodal networks incorporated, on average, 15.70% more diagnostically relevant tissue information. Among the multimodal models, those using RGB representations as input outperformed those that utilized 2D numerical data maps ( p < 0.05 ). The top-performing multimodal architecture achieved a mean AUC of 0.896 [95% confidence interval (CI): 0.813 to 0.959] when performance was assessed at the image level and 0.848 (95% CI: 0.755 to 0.903) when assessed at the lesion level.

Conclusions: Incorporating B-mode and Nakagami information together in a multimodal DL framework improved classification outcomes and increased the amount of diagnostically relevant information accessed by networks, highlighting the potential for automating and standardizing US breast cancer diagnostics to enhance clinical outcomes.

Keywords: breast cancer; deep learning; multimodal deep learning; quantitative ultrasound; ultrasound imaging.

PubMed Disclaimer

Similar articles

References

    1. American Cancer Society, Breast Cancer Facts & Figures 2022-2024, American Cancer Society Inc., Atlanta: (2022).
    1. Siegel R. L., et al. , “Cancer statistics, 2023,” CA. Cancer J. Clin. 73(1), 17–48 (2023).CAMCAM10.3322/caac.21763 - DOI - PubMed
    1. Berg W. A., et al. , “Ultrasound as the primary screening test for breast cancer: analysis from ACRIN 6666,” J. Natl. Cancer Inst. 108(4), djv367 (2016).JNCIEQ10.1093/jnci/djv367 - DOI - PMC - PubMed
    1. Sood R., et al. , “Ultrasound for breast cancer detection globally: a systematic review and meta-analysis,” J. Glob. Oncol. 5, 1–17 (2019).10.1200/JGO.19.00127 - DOI - PMC - PubMed
    1. Bae M. S., et al. , “Breast cancer detected with screening US: reasons for nondetection at mammography,” Radiology 270(2), 369–377 (2014).RADLAX10.1148/radiol.13130724 - DOI - PubMed

LinkOut - more resources