Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
- PMID: 36376239
- PMCID: PMC9732687
- DOI: 10.1002/cjp2.302
Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
Abstract
Deep learning models are increasingly being used to interpret whole-slide images (WSIs) in digital pathology and to predict genetic mutations. Currently, it is commonly assumed that tumor regions have most of the predictive power. However, it is reasonable to assume that other tissues from the tumor microenvironment may also provide important predictive information. In this paper, we propose an unsupervised clustering-based multiple-instance deep learning model for the prediction of genetic mutations using WSIs of three cancer types obtained from The Cancer Genome Atlas. Our proposed model facilitates the identification of spatial regions related to specific gene mutations and exclusion of patches that lack predictive information through the use of unsupervised clustering. This results in a more accurate prediction of gene mutations when compared with models using all image patches on WSIs and two recently published algorithms for all three different cancer types evaluated in this study. In addition, our study validates the hypothesis that the prediction of gene mutations solely based on tumor regions on WSI slides may not always provide the best performance. Other tissue types in the tumor microenvironment could provide a better prediction ability than tumor tissues alone. These results highlight the heterogeneity in the tumor microenvironment and the importance of identification of predictive image patches in digital pathology prediction tasks.
Keywords: H&E image; deep learning; gene mutation; unsupervised clustering; whole-slide images.
© 2022 The Authors. The Journal of Pathology: Clinical Research published by The Pathological Society of Great Britain and Ireland and John Wiley & Sons Ltd.
Figures






Similar articles
-
Deep learning-based tumor microenvironment segmentation is predictive of tumor mutations and patient survival in non-small-cell lung cancer.BMC Cancer. 2022 Sep 21;22(1):1001. doi: 10.1186/s12885-022-10081-w. BMC Cancer. 2022. PMID: 36131239 Free PMC article.
-
Dual-path neural network extracts tumor microenvironment information from whole slide images to predict molecular typing and prognosis of Glioma.Comput Methods Programs Biomed. 2025 Apr;261:108580. doi: 10.1016/j.cmpb.2024.108580. Epub 2025 Jan 4. Comput Methods Programs Biomed. 2025. PMID: 39809091
-
Unsupervised mutual transformer learning for multi-gigapixel Whole Slide Image classification.Med Image Anal. 2024 Aug;96:103203. doi: 10.1016/j.media.2024.103203. Epub 2024 May 21. Med Image Anal. 2024. PMID: 38810517
-
An aggregation of aggregation methods in computational pathology.Med Image Anal. 2023 Aug;88:102885. doi: 10.1016/j.media.2023.102885. Epub 2023 Jun 29. Med Image Anal. 2023. PMID: 37423055 Review.
-
Deep learning for colon cancer histopathological images analysis.Comput Biol Med. 2021 Sep;136:104730. doi: 10.1016/j.compbiomed.2021.104730. Epub 2021 Aug 4. Comput Biol Med. 2021. PMID: 34375901 Review.
Cited by
-
Deep learning in histopathology images for prediction of oncogenic driver molecular alterations in lung cancer: a systematic review and meta-analysis.Transl Lung Cancer Res. 2025 May 30;14(5):1756-1769. doi: 10.21037/tlcr-2024-1196. Epub 2025 May 21. Transl Lung Cancer Res. 2025. PMID: 40535093 Free PMC article.
-
Artificial intelligence model predicts M2 macrophage levels and HCC prognosis with only globally labeled pathological images.Front Oncol. 2024 Dec 20;14:1474155. doi: 10.3389/fonc.2024.1474155. eCollection 2024. Front Oncol. 2024. PMID: 39759153 Free PMC article.
-
Deep-GenMut: Automated genetic mutation classification in oncology: A deep learning comparative study.Heliyon. 2024 May 31;10(11):e32279. doi: 10.1016/j.heliyon.2024.e32279. eCollection 2024 Jun 15. Heliyon. 2024. PMID: 38912449 Free PMC article.
-
Self-supervised artificial intelligence predicts recurrence, metastasis and disease specific death from primary cutaneous squamous cell carcinoma at diagnosis.Res Sq [Preprint]. 2023 Dec 13:rs.3.rs-3607399. doi: 10.21203/rs.3.rs-3607399/v1. Res Sq. 2023. Update in: NPJ Digit Med. 2025 Feb 15;8(1):105. doi: 10.1038/s41746-025-01496-3. PMID: 38168253 Free PMC article. Updated. Preprint.
-
Deep learning-empowered crop breeding: intelligent, efficient and promising.Front Plant Sci. 2023 Oct 3;14:1260089. doi: 10.3389/fpls.2023.1260089. eCollection 2023. Front Plant Sci. 2023. PMID: 37860239 Free PMC article.
References
-
- Bailey P, Chang DK, Nones K, et al. Genomic analyses identify molecular subtypes of pancreatic cancer. Nature 2016; 531: 47–52. - PubMed
-
- Dienstmann R, Vermeulen L, Guinney J, et al. Consensus molecular subtypes and the evolution of precision medicine in colorectal cancer. Nat Rev Cancer 2017; 17: 79–92. - PubMed
-
- Lindeman NI, Cagle PT, Beasley MB, et al. Molecular testing guideline for selection of lung cancer patients for EGFR and ALK tyrosine kinase inhibitors: guideline from the College of American Pathologists, International Association for the Study of Lung Cancer, and Association for Molecular Pathology. J Thorac Oncol 2013; 8: 823–859. - PMC - PubMed
-
- Russnes HG, Lingjærde OC, Børresen‐Dale A‐L, et al. Breast cancer molecular stratification: from intrinsic subtypes to integrative clusters. Am J Pathol 2017; 187: 2152–2162. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources