Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 1;16(1):5876.
doi: 10.1038/s41467-025-60824-z.

Deep learning assessment of metastatic relapse risk from digitized breast cancer histological slides

Affiliations

Deep learning assessment of metastatic relapse risk from digitized breast cancer histological slides

I Garberis et al. Nat Commun. .

Abstract

Accurate risk stratification is critical for guiding treatment decisions in early breast cancer. We present an artificial intelligence (AI)-based tool that analyzes digitized tumor slides to predict 5-year metastasis-free survival (MFS) in patients with estrogen receptor-positive, HER2-negative (ER + /HER2 - ) early breast cancer (EBC). Our deep learning model, RlapsRisk BC, independently predicts MFS and provides significant prognostic value beyond traditional clinico-pathological variables (C-index 0.81 vs 0.76, p < 0.05). Applying a 5% MFS event probability threshold stratifies patients into low- and high-risk groups. After dichotomization, combining RlapsRisk BC with clinico-pathological factors increases cumulative sensitivity (0.69 vs 0.63) and dynamic specificity (0.80 vs 0.76) compared to clinical factors alone. Expert analysis of high-impact regions identified by the model highlights well-established morphological features, supporting its interpretability and biological relevance.

PubMed Disclaimer

Conflict of interest statement

Competing interests: V.G., C.S., K.E., B.S., A.J., L.H., R.D., M.A., L.G., M.S., A.S., J.R., F.B., J.D., and V.A. are employees of Owkin Inc. S.D. reports grants and non-financial support from Pfizer, grants from Novartis, grants and non-financial support from AstraZeneca, grants from Roche Genentech, grants from Lilly, grants from Orion, grants from Amgen, grants from Sanofi, grants from Exact Sciences, grants from Servier, grants from MSD, grants from BMS, grants from Pierre Fabre, grants from Exact Sciences, grants from Besins, grants from European Commission grants, grants from French government grants, grants from Fondation ARC grants, grants from Taiho, grants from Elsan, outside the submitted work. F.A. declares institutional financial interests, research grants with Novartis, Pfizer, AstraZeneca, Eli Lilly, Daiichi, Roche, and Sanofi. B.P. reports Consulting fees from Astra Zeneca (institutional), Seagen (institutional), Gilead (institutional), Novartis (institutional), Lilly (institutional), MSD (institutional), Pierre Fabre (personal), Daiichi-Sankyo (institutional/personal); research funding (institutional) from Astra Zeneca, Daiichi-Sankyo, Gilead, Seagen, MSD, and Fondation ARC. Travel support: Astra Zeneca; Pierre Fabre; MSD; Daiichi-Sankyo. M.L.-T. reports consulting fees from Astra Zeneca (institutional/personal), Seagen (personal), Lilly (personal), MSD (institutional/personal), Pierre Fabre (personal), Daiichi-Sankyo (institutional/personal), Myriad Genetics (personal), Exact Sciences (personal), Roche Diagnostics ((institutional/personal); research funding (institutional) from Roche Diagnostics, Daiichi-Sankyo, and Pierre Fabre. Travel support: AstraZeneca, Seagen, and Daiichi-Sankyo. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. RlapsRisk BC algorithm overview.
The left panel illustrates the algorithm’s processing steps, while the right panel details the training procedure and the overall model architecture designed to predict the risk group.
Fig. 2
Fig. 2. Forest plot of the adjusted RlapsRisk BC score HRs on the prediction of 5-year metastasis-free survival on the CANTO cohort.
Each square of the forest plot represents the HR of the RlapsRisk BC score (a continuous variable) adjusted for the prognostic clinico-pathological factors in the subgroup of patients defined by the variable category in the first column of the table. The 95% HR confidence intervals, computed using the Wald method, are represented by the horizontal lines. Histological tumor grade 1 and low clinical risk group were removed as no MFS events were recorded in these subgroups. Hazard ratios (HRs) and 95% confidence intervals were estimated using a Cox proportional hazards model. p Values correspond to two-sided Wald tests evaluating the null hypothesis HR = 1 and were adjusted for multiple comparisons using the Benjamini–Hochberg procedure. *p < 0.1, **p < 0.05, ***p < 0.01.
Fig. 3
Fig. 3. Metastases-free survival of patients stratified according to RlapsRisk BC classifier (left), Clinical score classifier (middle), and RR Combined classifier (right) among patients from the CANTO validation cohort.
p Values were computed using the two-sided log-rank test to compare survival distributions between groups. Shaded areas represent 95% confidence intervals estimated using the Greenwood formula.
Fig. 4
Fig. 4. Metastases-free survival of patients stratified according to RlapsRisk BC classifier (left), Clinical score classifier (center), and RR Combined classifier (right) among patients from the CANTO intermediate clinical risk validation cohort.
p Values were computed using the two-sided log-rank test to compare survival distributions between groups. Shaded areas represent 95% confidence intervals estimated using the Greenwood formula.
Fig. 5
Fig. 5. Interpretability of the RlapsRisk BC model.
The methodology applied to generate the data used for this analysis is outlined. To streamline pathologist review, slides were sampled to express as much diversity as possible in the high, low, and intermediary predicted risks. Shapley values were computed to evaluate the positive or negative contributions of clusters of similar tiles. A selection of the most contributing clusters of tiles was made, and random clusters were also sampled from the distribution of Shapley values. An expert pathologist annotated the data, blinded to any information about contribution or risk.
Fig. 6
Fig. 6. A selection of the histological features annotated by an expert pathologist.
The proportion of tiles annotated as containing a particular histological feature within clusters of tiles linked to high and low risk, as well as randomly selected ones, is presented alongside the significance of p values from the Chi-square test of independence computed between each group, adjusted for multiple testing using the Benjamini–Hochberg procedure. For each histological feature, the region of a WSI containing the annotated cluster of tiles is displayed, extracted both from the CANTO H&E dataset (upper histological image) and from the CANTO HES dataset (lower histological image). ns p > 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

References

    1. Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.71, 209–249 (2021). - PubMed
    1. Ferreira, A. R. et al. Differential impact of endocrine therapy (ET) and chemotherapy (CT) on quality of life (QoL) of 4,262 breast cancer (BC) survivors: a prospective patient-reported outcomes (PRO) analysis. J. Clin. Oncol.37, 512–512 (2019). - PubMed
    1. Mastro, L. D. et al. Extended therapy with letrozole as adjuvant treatment of postmenopausal patients with early-stage breast cancer: a multicentre, open-label, randomised, phase 3 trial. Lancet Oncol.22, 1458–1467 (2021). - PubMed
    1. Harbeck, N. et al. Adjuvant abemaciclib combined with endocrine therapy for high-risk early breast cancer: updated efficacy and Ki-67 analysis from the monarchE study. Ann. Oncol.32, 1571–1581 (2021). - PubMed
    1. Sparano, J. A. et al. Prospective validation of a 21-gene expression assay in breast cancer. N. Engl. J. Med.373, 2005–2014 (2015). - PMC - PubMed