Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2025 Aug;7(8):100891.
doi: 10.1016/j.landig.2025.100891. Epub 2025 Aug 19.

Assessing genotype-phenotype correlations in colorectal cancer with deep learning: a multicentre cohort study

Affiliations
Free article
Multicenter Study

Assessing genotype-phenotype correlations in colorectal cancer with deep learning: a multicentre cohort study

Marco Gustav et al. Lancet Digit Health. 2025 Aug.
Free article

Abstract

Background: Deep learning-based models enable the prediction of molecular biomarkers from histopathology slides of colorectal cancer stained with haematoxylin and eosin; however, few studies have assessed prediction targets beyond microsatellite instability (MSI), BRAF, and KRAS systematically. We aimed to develop and validate a multi-target model based on deep learning for the simultaneous prediction of numerous genetic alterations and their associated phenotypes in colorectal cancer.

Methods: In this multicentre cohort study, tissue samples from patients with colorectal cancer were obtained by surgical resection and stained with haematoxylin and eosin. These samples were then digitised into whole-slide images and used to train and test a transformer-based deep learning algorithm for biomarker detection to simultaneously predict multiple genetic alterations and provide heatmap explanations. The primary dataset comprised 1376 patients from five cohorts who underwent comprehensive panel sequencing, with an additional 536 patients from two public datasets for validation. We compared the model's performance against conventional single-target models and examined the co-occurrence of alterations and shared morphology.

Findings: The multi-target model was able to predict numerous biomarkers from pathology slides, matching and partly exceeding single-target transformers. In the primary external validation cohorts, mean area under the receiver operating characteristic curve (AUROC) for the multi-target transformer was 0·78 (SD 0·01) for BRAF, 0·88 (0·01) for hypermutation, 0·93 (0·01) for MSI, and 0·86 (0·01) for RNF43; predictive performance was consistent across metrics and supported by co-occurrence analyses. However, biomarkers with high AUROCs largely correlated with MSI, with model predictions depending considerably on morphology associated with MSI at pathological examination.

Interpretation: By use of morphology associated with MSI and more subtle biomarker-specific patterns within a shared phenotype, the multi-target transformers efficiently predicted biomarker status for diverse genetic alterations in colorectal cancer from slides stained with haematoxylin and eosin. These results highlight the importance of considering mutational co-occurrence and common morphology in biomarker research based on deep learning. Our validated and scalable model could support extension to other cancers and large, diverse cohorts, potentially facilitating cost-effective pre-screening and streamlined diagnostics in precision oncology.

Funding: German Federal Ministry of Health, Max-Eder-Programme of German Cancer Aid, German Federal Ministry of Education and Research, German Academic Exchange Service, and the EU.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests JNK declares consulting services for Bioptimus, Owkin, DoMore Diagnostics, Panakeia, AstraZeneca, Mindpeak, and MultiplexDx. He also holds shares in StratifAI GmbH and Synagen GmbH; has received a research grant from GSK; and has received honoraria from AstraZeneca, Bayer, Daiichi Sankyo, Eisai, Janssen, Merck, MSD, Bristol Myers Squibb, Roche, Pfizer, and Fresenius. MG has received honoraria for lectures sponsored by Techniker Krankenkasse and AstraZeneca. NGR has received travel compensation from Bruker Spatial Biology (formerly NanoString); and holds stocks for Bayer, Pfizer, AstraZeneca, Roche, and Abivax. SF has received honoraria for lectures from Bristol Myers Squibb and MSD. UP reports consulting services for AbbVie. UP’s husband holds individual stocks for Arm Holdings, BYD Company, Crowdstrike Holdings, Stellantis, BioNTech SE ADR, Amazon, CureVac, NanoString Technologies, Google–Alphabet Class C, Nvidia, and Microsoft. All other authors declare no competing interests.

Update of

Publication types