Transformer-based biomarker prediction from colorectal cancer histology: A large-scale multicentric study
- PMID: 37652006
- PMCID: PMC10507381
- DOI: 10.1016/j.ccell.2023.08.002
Transformer-based biomarker prediction from colorectal cancer histology: A large-scale multicentric study
Abstract
Deep learning (DL) can accelerate the prediction of prognostic biomarkers from routine pathology slides in colorectal cancer (CRC). However, current approaches rely on convolutional neural networks (CNNs) and have mostly been validated on small patient cohorts. Here, we develop a new transformer-based pipeline for end-to-end biomarker prediction from pathology slides by combining a pre-trained transformer encoder with a transformer network for patch aggregation. Our transformer-based approach substantially improves the performance, generalizability, data efficiency, and interpretability as compared with current state-of-the-art algorithms. After training and evaluating on a large multicenter cohort of over 13,000 patients from 16 colorectal cancer cohorts, we achieve a sensitivity of 0.99 with a negative predictive value of over 0.99 for prediction of microsatellite instability (MSI) on surgical resection specimens. We demonstrate that resection specimen-only training reaches clinical-grade performance on endoscopic biopsy tissue, solving a long-standing diagnostic problem.
Keywords: artificial intelligence; biomarker; colorectal cancer; deep learning; microsatellite instability; multiple instance learning; transformer.
Copyright © 2023 The Author(s). Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests J.N.K. reports consulting services for Owkin, France, Panakeia, UK, and DoMore Diagnostics, Norway and has received honoraria for lectures by M.S.D., Eisai, and Fresenius. N.W. has received fees for advisory board activities with BMS, Astellas, GSK, and Amgen, not related to this study. N.W. has received fees for advisory board activities with BMS, Astellas, and Amgen, not related to this study. P.Q. has received fees for advisory board activities with Roche and AMGEN and research funding from Roche through an Innovate UK National Pathology Imaging Consortium grant. H.I.G. has received fees for advisory board activities by AstraZeneca and BMS, not related to this study. M.S.T. is a scientific advisor to Mindpeak and Sonrai Analytics, and has received honoraria recently from BMS, MSD, Roche, Sanofi, and Incyte. He has received grant support from Phillips, Roche, MSD, and Akoya. None of these disclosures are related to this work. D.N.C. has participated in advisory boards for MSD and has received research funding on behalf of the TransSCOT consortium from HalioDx for analyses independent of this study. V.H.K. has served as an invited speaker on behalf of Indica Labs and has received project-based research funding from The Image Analysis Group and Roche outside of the submitted work. No other potential disclosures are reported by any of the authors.
Figures






Comment in
-
Deep learning transforms colorectal cancer biomarker prediction from histopathology images.Cancer Cell. 2023 Sep 11;41(9):1543-1545. doi: 10.1016/j.ccell.2023.08.006. Epub 2023 Aug 30. Cancer Cell. 2023. PMID: 37652005
Similar articles
-
Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E-stained images: achieving state-of-the-art predictive performance with fewer data using Swin Transformer.J Pathol Clin Res. 2023 May;9(3):223-235. doi: 10.1002/cjp2.312. Epub 2023 Feb 1. J Pathol Clin Res. 2023. PMID: 36723384 Free PMC article.
-
Generalizable biomarker prediction from cancer pathology slides with self-supervised deep learning: A retrospective multi-centric study.Cell Rep Med. 2023 Apr 18;4(4):100980. doi: 10.1016/j.xcrm.2023.100980. Epub 2023 Mar 22. Cell Rep Med. 2023. PMID: 36958327 Free PMC article.
-
Artificial intelligence for detection of microsatellite instability in colorectal cancer-a multicentric analysis of a pre-screening tool for clinical application.ESMO Open. 2022 Apr;7(2):100400. doi: 10.1016/j.esmoop.2022.100400. Epub 2022 Mar 2. ESMO Open. 2022. PMID: 35247870 Free PMC article.
-
Artificial Intelligence for Histology-Based Detection of Microsatellite Instability and Prediction of Response to Immunotherapy in Colorectal Cancer.Cancers (Basel). 2021 Jan 21;13(3):391. doi: 10.3390/cancers13030391. Cancers (Basel). 2021. PMID: 33494280 Free PMC article. Review.
-
Is There a Role for Programmed Death Ligand-1 Testing and Immunotherapy in Colorectal Cancer With Microsatellite Instability? Part I-Colorectal Cancer: Microsatellite Instability, Testing, and Clinical Implications.Arch Pathol Lab Med. 2018 Jan;142(1):17-25. doi: 10.5858/arpa.2017-0040-RA. Epub 2017 Nov 16. Arch Pathol Lab Med. 2018. PMID: 29144791 Review.
Cited by
-
Improving performance in colorectal cancer histology decomposition using deep and ensemble machine learning.Heliyon. 2024 Sep 10;10(18):e37561. doi: 10.1016/j.heliyon.2024.e37561. eCollection 2024 Sep 30. Heliyon. 2024. PMID: 39309850 Free PMC article.
-
Exploring the interplay between colorectal cancer subtypes genomic variants and cellular morphology: A deep-learning approach.PLoS One. 2024 Sep 10;19(9):e0309380. doi: 10.1371/journal.pone.0309380. eCollection 2024. PLoS One. 2024. PMID: 39255280 Free PMC article.
-
Converging deep learning and human-observed tumor-adipocyte interaction as a biomarker in colorectal cancer.Commun Med (Lond). 2024 Aug 15;4(1):163. doi: 10.1038/s43856-024-00589-6. Commun Med (Lond). 2024. PMID: 39147895 Free PMC article.
-
Assessing Genotype-Phenotype Correlations with Deep Learning in Colorectal Cancer: A Multi-Centric Study.medRxiv [Preprint]. 2025 Feb 8:2025.02.04.25321660. doi: 10.1101/2025.02.04.25321660. medRxiv. 2025. PMID: 39973981 Free PMC article. Preprint.
-
Neoadjuvant therapy for colorectal cancer from 2015 to 2024: a visual analysis and bibliometric analysis.Front Oncol. 2025 Apr 2;15:1526610. doi: 10.3389/fonc.2025.1526610. eCollection 2025. Front Oncol. 2025. PMID: 40242245 Free PMC article.
References
-
- Lee S.H., Song I.H., Jang H.-J. Feasibility of deep learning-based fully automated classification of microsatellite instability in tissue slides of colorectal cancer. Int. J. Cancer. 2021;149:728–740. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical