Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 23;20(1):72.
doi: 10.1186/s13062-025-00661-8.

Multimodal deep learning for predicting neoadjuvant treatment outcomes in breast cancer: a systematic review

Affiliations

Multimodal deep learning for predicting neoadjuvant treatment outcomes in breast cancer: a systematic review

Eriseld Krasniqi et al. Biol Direct. .

Abstract

Background: Pathological complete response (pCR) to neoadjuvant systemic therapy (NAST) is an established prognostic marker in breast cancer (BC). Multimodal deep learning (DL), integrating diverse data sources (radiology, pathology, omics, clinical), holds promise for improving pCR prediction accuracy. This systematic review synthesizes evidence on multimodal DL for pCR prediction and compares its performance against unimodal DL.

Methods: Following PRISMA, we searched PubMed, Embase, and Web of Science (January 2015-April 2025) for studies applying DL to predict pCR in BC patients receiving NAST, using data from radiology, digital pathology (DP), multi-omics, and/or clinical records, and reporting AUC. Data on study design, DL architectures, and performance (AUC) were extracted. A narrative synthesis was conducted due to heterogeneity.

Results: Fifty-one studies, mostly retrospective (90.2%, median cohort 281), were included. Magnetic resonance imaging and DP were common primary modalities. Multimodal approaches were used in 52.9% of studies, often combining imaging with clinical data. Convolutional neural networks were the dominant architecture (88.2%). Longitudinal imaging improved prediction over baseline-only (median AUC 0.91 vs. 0.82). Overall, the median AUC across studies was 0.88, with 35.3% achieving AUC ≥ 0.90. Multimodal models showed a modest but consistent improvement over unimodal approaches (median AUC 0.88 vs. 0.83). Omics and clinical text were rarely primary DL inputs.

Conclusion: DL models demonstrate promising accuracy for pCR prediction, especially when integrating multiple modalities and longitudinal imaging. However, significant methodological heterogeneity, reliance on retrospective data, and limited external validation hinder clinical translation. Future research should prioritize prospective validation, integration underutilized data (multi-omics, clinical), and explainable AI to advance DL predictors to the clinical setting.

Keywords: Breast cancer; Deep learning; Multimodal prediction; Neoadjuvant treatment.

PubMed Disclaimer

Conflict of interest statement

Declarations: Ethics approval and consent to participate. Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow diagram of preferred reporting items for systematic reviews and meta-analyses (PRISMA). DL, deep learning
Fig. 2
Fig. 2
Sankey diagram showing the distribution of primary, secondary and tertiary modalities among the 51 studies included in this review. The left column indicates each study’s primary data modality, the middle column its secondary data modality, and the right column its tertiary data modality. Flow widths are proportional to the number of studies integrating a data modality level with the next. MRI, Magnetic Resonance Imaging; DP, Digital pathology; US, Ultrasonography (Ultrasound); CT, Computed tomography; MG, Mammography; PET, Positron emission tomography; RNA, RNA Sequencing data; Clinical: Clinical data; None, No additional data modality. Numbers in parentheses denote study counts (n) for each modality at the given level
Fig. 3
Fig. 3
Bubble plot displaying the median Area Under the Receiver Operating Characteristic Curve (AUC) of studies grouped based on the deep learning (DL) architecture utilized. Each circle represents a set of studies, with relative dimension representing the number of studies included. CNN, Convolutional neural network; CNN + , Convolutional neural network augmented with additional modules like Attention mechanisms, etc.; TRA, Transformer architecture; CML, Classical machine learning

Similar articles

References

    1. Cortazar P, Zhang L, Untch M, Mehta K, Costantino JP, Wolmark N, et al. Pathological complete response and long-term clinical benefit in breast cancer: the CTNeoBC pooled analysis. Lancet. 2014;384:164–72. - PubMed
    1. Symmans WF, Peintinger F, Hatzis C, Rajan R, Kuerer H, Valero V, et al. Measurement of residual breast cancer burden to predict survival after neoadjuvant chemotherapy. J Clin Oncol. 2007;25:4414–22. - PubMed
    1. Spring LM, Fell G, Arfe A, Sharma C, Greenup R, Reynolds KL, et al. Pathologic complete response after neoadjuvant chemotherapy and impact on breast cancer recurrence and survival: A comprehensive meta-analysis. Clin Cancer Res. 2020;26:2838–48. - PMC - PubMed
    1. von Minckwitz G, Untch M, Blohmer J-U, Costa SD, Eidtmann H, Fasching PA, et al. Definition and impact of pathologic complete response on prognosis after neoadjuvant chemotherapy in various intrinsic breast cancer subtypes. J Clin Oncol. 2012;30:1796–804. - PubMed
    1. Houssami N, Macaskill P, von Minckwitz G, Marinovich ML, Mamounas E. Meta-analysis of the association of breast cancer subtype and pathologic complete response to neoadjuvant chemotherapy. Eur J Cancer. 2012;48:3342–54. - PubMed

Publication types

LinkOut - more resources