Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 2;16(1):2106.
doi: 10.1038/s41467-025-57283-x.

Multimodal histopathologic models stratify hormone receptor-positive early breast cancer

Affiliations

Multimodal histopathologic models stratify hormone receptor-positive early breast cancer

Kevin M Boehm et al. Nat Commun. .

Abstract

The Oncotype DX® Recurrence Score (RS) is an assay for hormone receptor-positive early breast cancer with extensively validated predictive and prognostic value. However, its cost and lag time have limited global adoption, and previous attempts to estimate it using clinicopathologic variables have had limited success. To address this, we assembled 6172 cases across three institutions and developed Orpheus, a multimodal deep learning tool to infer the RS from H&E whole-slide images. Our model identifies TAILORx high-risk cases (RS > 25) with an area under the curve (AUC) of 0.89, compared to a leading clinicopathologic nomogram with 0.73. Furthermore, in patients with RS ≤ 25, Orpheus ascertains risk of metastatic recurrence more accurately than the RS itself (0.75 vs 0.49 mean time-dependent AUC). These findings have the potential to guide adjuvant therapy for high-risk cases and tailor surveillance for patients at elevated metastatic recurrence risk.

PubMed Disclaimer

Conflict of interest statement

Competing interests: J.S.R.-F. reports receiving personal/consultancy fees from Goldman Sachs, Bain Capital, REPARE Therapeutics, Saga Diagnostics, and Paige.AI, membership of the scientific advisory boards of VolitionRx, REPARE Therapeutics and Paige.AI, membership of the Board of Directors of Grupo Oncoclinicas, and ad hoc membership of the scientific advisory boards of AstraZeneca, Merck, Daiichi Sankyo, Roche Tissue Diagnostics and Personalis, outside the scope of this study. He is also currently employed at AstraZeneca. OSMEN holds shares in StratifAI GmbH. JNK declares consulting services for Owkin, France, DoMore Diagnostics, Norway, Panakeia, UK, Scailyte, Switzerland, Cancilico, Germany, Mindpeak, Germany, MultiplexDx, Slovakia, and Histofy, UK; furthermore, he holds shares in StratifAI GmbH, Germany, has received a research grant by GSK, and has received honoraria by AstraZeneca, Bayer, Eisai, Janssen, MSD, BMS, Roche, Pfizer and Fresenius. The mentioned competing interests are related to cancer and the computational analysis of histopathology slides, which is the main topic of this research. SPS conducts collaborative research with AstraZeneca Inc., outside the scope of this study. FP reports receiving consultancy fees from AstraZeneca and reports membership of advisory boards of MultiplexDx and AstraZeneca. EDP and PC are employed at MultiplexDX. A.M. has received honoraria as a consultant, advisor, or speaker from Roche and Menarini/Stemline and has received travel and accommodation support from AstraZeneca. E.M. received honoraria for Consulting or Advisory Role: Exact Sciences, MSD Oncology, Daiichi Sankyo/AstraZeneca, Pfizer, Seagen, Ipsen. Travel, Accommodations, Expenses: Roche, Pfizer, Lilly, Novartis, Gilead Sciences, AstraZeneca, Pierre Fabre. R.E. has received honoraria from Roche, Eisai, Pfizer, BioNTech, Veracyte (PROCURE), Diaceutics, Mindpeak, AstraZeneca, MEDAC, and Novartis. The institution of R.E. conducts research for AstraZeneca, Roche, Janssen-Cilag, NanoString Technologies, Biocartis, ZytoVision, Novartis, Cepheid, Mindpeak, MSD, Gilead, Palleos Healthcare, Owkin and BioNTech. E.D.P. and P.C. are current employees of and receive research funding from MultiplexDX. P.C. is an equity holder in MultiplexDX. BW reports research funding from Repare Therapeutics. SC discloses compensation for professional services to Boxer Capital, LLC, Eli Lilly and Company, Encore Medical Education, Genesis Therapeutics, Gerson Lehrman Group, Novartis, Nuvalent, Inc., Prelude Therapeutics, SAGA Diagnostics, and eFFECTOR Therapeutics. He also discloses equity and IP rights in Odyssey Biosciences and equity in Totus Medicines, Inc. H.W. is an advisor for AstraZeneca and Daiichi Sankyo. P.R. reports receiving funding from Grail, Illumina, Novartis, AstraZeneca, Epic Sciences, Invitae/ArcherDx, Biothernostics, Tempus, Inivata, Biovica, Guardant, Personalis, and Myriad. P.R. also is an advisor to Novartis, AstraZeneca, Pfizer, Lilly/Loxo, Prelude Therapeutics, Epic Sciences, Foundation Medicine, Inivata, Natera, Tempus, SAGA Diagnostics, Paige.ai, Guardant, and Myriad. P.R. is also a co-founder and board member of Odyssey Biosciences. LZB reports providing professional services to the Cancer Prevention & Research Institute of Texas. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Developing a multimodal transformer model for breast cancer risk.
Early-stage breast tumors are a resected, b profiled histologically, c digitized, and d used for downstream modeling of recurrence risk. e Number of pathologic slides, pathology reports, and patients included in each split. f Histogram depicting number of slides with a given number of tiles. g Tissue detection, tessellation, transformer-based modeling of CTransPath-derived tile embeddings, pathology report scraping, tokenization and transformer-based modeling, nuclear segmentation for interpretation, tensor fusion for multimodal integration. Created in BioRender. Boehm, K. (2025) https://biorender.com/s94u717.
Fig. 2
Fig. 2. Orpheus performance for TAILORx risk stratification.
ac The whole-slide image (WSI)-based model reliably identifies high-risk disease (RS > 25) as defined by TAILORx across the MSK-BRCA (n = 1029), IEO-BRCA (n = 452), and MDX-BRCA (n = 572) test cohorts. d The multimodal model outperforms the WSI- and text-based unimodal models. Error bars by 1000-fold bootstrapping. e, f The multimodal model outperforms a clinicogenomic nomogram in identifying high-risk diseases. g Calibration plot and h predicted score frequencies. All results are shown for test sets.
Fig. 3
Fig. 3. Orpheus+ performance for identifying distant recurrence.
a, b Orpheus+ risk scores versus Oncotype DX ® (ODX, N = 1464) and Multiplex DX ® (MDX, N = 575) recurrence score (RS) values for cases with or without distant recurrence. p-values by Mann–Whitney-Wilcoxon test, two-sided. c Time-dependent areas under the receiver operating characteristic curve for Orpheus+ and ODX RS scores against recurrence in the MSK-BRCA test set for cases with ODX RS ≤ 25, with associated calibration plots. The mean value represents AUC over 2–8 years. d Receiver operating characteristic curves for Orpheus+ and MDX RS scores against recurrence in the MDX-BRCA test/validation set, with associated calibration plots. **** denotes p ≤ 1e−4, *** denotes p ≤ 1e-3, ** denotes p ≤ 1e-2, and * denotes – ≤ 5e-2. In box plots, boxes denote 25th–75th percentiles with lines at the median, whiskers denote the range without outliers, and individual points denote outliers. Exact p-values, top to bottom, for a are 5e-7 and 0.80, and for b, are 0.0013 and 0.0011.
Fig. 4
Fig. 4. Cellular and transcriptomic correlates of risk.
a Saliency map of contributory foreground tiles, with one tile-attention value pair denoted by the arrows and predicted score. b Histogram of tile-attention values. c The five highest- and lowest-attention tiles at greater magnification. d Association of cellular features with high- and low-risk tissue. Hypothesis testing was performed with the two-sided Mann–Whitney U-test with corrections for multiple testing. e High and f low relative abundance of inflammatory cells. g High and h low standard deviation of neoplastic nuclear area. il Quantification of stromal fraction (SF), tumor cell proliferation (Prolif), lymphocyte infiltrating signature score (LISS), and lymphocyte fraction (LF) for predicted low- and high-risk patients (50 each) depicted in blue and orange, respectively, in the MSK-BRCA cohort. p-values are generated using an independent two-sided t-test. In box plots, boxes denote 25th–75th percentiles, whiskers denote the range without outliers, and individual points denote outliers. Scale bars denote 64 µm.
Fig. 5
Fig. 5. Potential clinical use case of the Orpheus recurrence risk prediction model.
The Orpheus multimodal prediction model for recurrence risk prediction is potentially capable of guiding decision-making for adjuvant cytotoxic chemotherapy alongside adjuvant endocrine therapy for predicted low- and high-risk patients. The model is within scope for early-stage hormone receptor-positive (HR+) and HER2- breast cancer patients. Created in BioRender. Marra, A. (2025) https://BioRender.com/l96q019.

References

    1. Giaquinto, A. N. et al. Breast Cancer Statistics, 2022. CA Cancer J. Clin.72, 524–541 (2022). - PubMed
    1. Paik, S. et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N. Engl. J. Med.351, 2817–2826 (2004). - PubMed
    1. Morganti, S. et al. Refining risk stratification in HR-positive/HER2-negative early breast cancer: how to select patients for treatment escalation? Breast Cancer Res. Treat.192, 465–484 (2022). - PubMed
    1. Nitz, U. et al. Reducing chemotherapy use in clinically high-risk, genomically low-risk pN0 and pN1 early breast cancer patients: five-year data from the prospective, randomised phase 3 West German Study Group (WSG) PlanB trial. Breast Cancer Res. Treat.165, 573–583 (2017). - PMC - PubMed
    1. Dowsett, M. et al. Prediction of risk of distant recurrence using the 21-gene recurrence score in node-negative and node-positive postmenopausal patients with breast cancer treated with anastrozole or tamoxifen: a TransATAC study. J. Clin. Oncol.28, 1829–1834 (2010). - PubMed

Substances