Development and interpretation of a pathomics-based model for the prediction of microsatellite instability in Colorectal Cancer
- PMID: 33042271
- PMCID: PMC7532670
- DOI: 10.7150/thno.49864
Development and interpretation of a pathomics-based model for the prediction of microsatellite instability in Colorectal Cancer
Abstract
Microsatellite instability (MSI) has been approved as a pan-cancer biomarker for immune checkpoint blockade (ICB) therapy. However, current MSI identification methods are not available for all patients. We proposed an ensemble multiple instance deep learning model to predict microsatellite status based on histopathology images, and interpreted the pathomics-based model with multi-omics correlation. Methods: Two cohorts of patients were collected, including 429 from The Cancer Genome Atlas (TCGA-COAD) and 785 from an Asian colorectal cancer (CRC) cohort (Asian-CRC). We established the pathomics model, named Ensembled Patch Likelihood Aggregation (EPLA), based on two consecutive stages: patch-level prediction and WSI-level prediction. The initial model was developed and validated in TCGA-COAD, and then generalized in Asian-CRC through transfer learning. The pathological signatures extracted from the model were analyzed with genomic and transcriptomic profiles for model interpretation. Results: The EPLA model achieved an area-under-the-curve (AUC) of 0.8848 (95% CI: 0.8185-0.9512) in the TCGA-COAD test set and an AUC of 0.8504 (95% CI: 0.7591-0.9323) in the external validation set Asian-CRC after transfer learning. Notably, EPLA captured the relationship between pathological phenotype of poor differentiation and MSI (P < 0.001). Furthermore, the five pathological imaging signatures identified from the EPLA model were associated with mutation burden and DNA damage repair related genotype in the genomic profiles, and antitumor immunity activated pathway in the transcriptomic profiles. Conclusions: Our pathomics-based deep learning model can effectively predict MSI from histopathology images and is transferable to a new patient cohort. The interpretability of our model by association with pathological, genomic and transcriptomic phenotypes lays the foundation for prospective clinical trials of the application of this artificial intelligence (AI) platform in ICB therapy.
Keywords: colorectal cancer; ensembled patch likelihood aggregation (EPLA); microsatellite instability; multi-omics; pathomics.
© The author(s).
Conflict of interest statement
Competing Interests: F.Y., Y.Z., W.J.L., T.X.W., W.J.H., W.M.T and J.H.Y. are employed by Tencent and W.J.C. is employed by Shanghai Tongshu Biotechnology Co., Ltd.
Figures





Similar articles
-
Crosstalk Between the MSI Status and Tumor Microenvironment in Colorectal Cancer.Front Immunol. 2020 Aug 12;11:2039. doi: 10.3389/fimmu.2020.02039. eCollection 2020. Front Immunol. 2020. PMID: 32903444 Free PMC article.
-
A next-generation sequencing-based strategy combining microsatellite instability and tumor mutation burden for comprehensive molecular diagnosis of advanced colorectal cancer.BMC Cancer. 2021 Mar 16;21(1):282. doi: 10.1186/s12885-021-07942-1. BMC Cancer. 2021. PMID: 33726687 Free PMC article.
-
Spatially aware graph neural networks and cross-level molecular profile prediction in colon cancer histopathology: a retrospective multi-cohort study.Lancet Digit Health. 2022 Nov;4(11):e787-e795. doi: 10.1016/S2589-7500(22)00168-6. Lancet Digit Health. 2022. PMID: 36307192
-
Molecular pathological classification of colorectal cancer.Virchows Arch. 2016 Aug;469(2):125-34. doi: 10.1007/s00428-016-1956-3. Epub 2016 Jun 20. Virchows Arch. 2016. PMID: 27325016 Free PMC article. Review.
-
Relationships Between Immune Landscapes, Genetic Subtypes and Responses to Immunotherapy in Colorectal Cancer.Front Immunol. 2020 Mar 6;11:369. doi: 10.3389/fimmu.2020.00369. eCollection 2020. Front Immunol. 2020. PMID: 32210966 Free PMC article. Review.
Cited by
-
A multi-omic approach reveals utility of CD45 expression in prognosis and novel target discovery.Front Genet. 2022 Aug 17;13:928328. doi: 10.3389/fgene.2022.928328. eCollection 2022. Front Genet. 2022. PMID: 36061172 Free PMC article.
-
Artificial Intelligence Applications in the Treatment of Colorectal Cancer: A Narrative Review.Clin Med Insights Oncol. 2024 Jan 5;18:11795549231220320. doi: 10.1177/11795549231220320. eCollection 2024. Clin Med Insights Oncol. 2024. PMID: 38187459 Free PMC article. Review.
-
Identification of technology frontiers of artificial intelligence-assisted pathology based on patent citation network.PLoS One. 2022 Aug 22;17(8):e0273355. doi: 10.1371/journal.pone.0273355. eCollection 2022. PLoS One. 2022. PMID: 35994484 Free PMC article. Review.
-
Development of a deep learning system for predicting biochemical recurrence in prostate cancer.BMC Cancer. 2025 Feb 10;25(1):232. doi: 10.1186/s12885-025-13628-9. BMC Cancer. 2025. PMID: 39930342 Free PMC article.
-
Deep Gaussian process with uncertainty estimation for microsatellite instability and immunotherapy response prediction from histology.NPJ Digit Med. 2025 May 19;8(1):294. doi: 10.1038/s41746-025-01580-8. NPJ Digit Med. 2025. PMID: 40389599 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical