. 2024 Sep 10;19(9):e0309380.

doi: 10.1371/journal.pone.0309380. eCollection 2024.

Exploring the interplay between colorectal cancer subtypes genomic variants and cellular morphology: A deep-learning approach

Hadar Hezi¹, Daniel Shats², Daniel Gurevich^{3

4}, Yosef E Maruvka^{3

4}, Moti Freiman¹

Affiliations

¹ Faculty of Biomedical Engineering, Technion - Israel Institute of Technology, Haifa, Israel.
² Faculty of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel.
³ Faculty of Biotechnology and Food Engineering, Technion - Israel Institute of Technology, Haifa, Israel.
⁴ Lokey Center for Life Science and Engineering, Technion - Israel Institute of Technology, Haifa, Israel.

PMID: 39255280
PMCID: PMC11386451
DOI: 10.1371/journal.pone.0309380

Exploring the interplay between colorectal cancer subtypes genomic variants and cellular morphology: A deep-learning approach

Hadar Hezi et al. PLoS One. 2024.

. 2024 Sep 10;19(9):e0309380.

doi: 10.1371/journal.pone.0309380. eCollection 2024.

Authors

Hadar Hezi¹, Daniel Shats², Daniel Gurevich^{3

4}, Yosef E Maruvka^{3

4}, Moti Freiman¹

Affiliations

¹ Faculty of Biomedical Engineering, Technion - Israel Institute of Technology, Haifa, Israel.
² Faculty of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel.
³ Faculty of Biotechnology and Food Engineering, Technion - Israel Institute of Technology, Haifa, Israel.
⁴ Lokey Center for Life Science and Engineering, Technion - Israel Institute of Technology, Haifa, Israel.

PMID: 39255280
PMCID: PMC11386451
DOI: 10.1371/journal.pone.0309380

Abstract

Molecular subtypes of colorectal cancer (CRC) significantly influence treatment decisions. While convolutional neural networks (CNNs) have recently been introduced for automated CRC subtype identification using H&E stained histopathological images, the correlation between CRC subtype genomic variants and their corresponding cellular morphology expressed by their imaging phenotypes is yet to be fully explored. The goal of this study was to determine such correlations by incorporating genomic variants in CNN models for CRC subtype classification from H&E images. We utilized the publicly available TCGA-CRC-DX dataset, which comprises whole slide images from 360 CRC-diagnosed patients (260 for training and 100 for testing). This dataset also provides information on CRC subtype classifications and genomic variations. We trained CNN models for CRC subtype classification that account for potential correlation between genomic variations within CRC subtypes and their corresponding cellular morphology patterns. We assessed the interplay between CRC subtypes' genomic variations and cellular morphology patterns by evaluating the CRC subtype classification accuracy of the different models in a stratified 5-fold cross-validation experimental setup using the area under the ROC curve (AUROC) and average precision (AP) as the performance metrics. The CNN models that account for potential correlation between genomic variations within CRC subtypes and their cellular morphology pattern achieved superior accuracy compared to the baseline CNN classification model that does not account for genomic variations when using either single-nucleotide-polymorphism (SNP) molecular features (AUROC: 0.824±0.02 vs. 0.761±0.04, p<0.05, AP: 0.652±0.06 vs. 0.58±0.08) or CpG-Island methylation phenotype (CIMP) molecular features (AUROC: 0.834±0.01 vs. 0.787±0.03, p<0.05, AP: 0.687±0.02 vs. 0.64±0.05). Combining the CNN models account for variations in CIMP and SNP further improved classification accuracy (AUROC: 0.847±0.01 vs. 0.787±0.03, p = 0.01, AP: 0.68±0.02 vs. 0.64±0.05). The improved accuracy of CNN models for CRC subtype classification that account for potential correlation between genomic variations within CRC subtypes and their corresponding cellular morphology as expressed by H&E imaging phenotypes may elucidate the biological cues impacting cancer histopathological imaging phenotypes. Moreover, considering CRC subtypes genomic variations has the potential to improve the accuracy of deep-learning models in discerning cancer subtype from histopathological imaging data.

Copyright: © 2024 Hezi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Summary of the TCGA COAD and READ datasets application: The total cohort encompasses n = 632 patients.**
Some patients were excluded due to technical reasons, resulting with n = 430 patients. Out of this, Kather et al. [7] pre-processed and published data for n = 360 patients, segmenting them into a training and a testing set. The training set was balanced at the patch (p) level. For our research, we used stratified cross-validation folds at the patient level. The partitioning into these folds was informed by the novel sub-labels based on SNP rates and CIMP classifications.

**Fig 2. Experimental flow for our exploration of the interplay between CRC subtypes genomic variants and cellular morphology.**
The TCGA-CRC dataset, pre-processed by Kather et al. [7] (N = 360) is split into different sets for analysis. A baseline model is trained, and based on its results, a molecular feature analysis is performed. Based on the analysis we choose to define our data classes based on the ranges and categories of SNP, CIMP and CNV (the BP class definitions step). After the definition, we divide the classes into five stratified folds. Next, three models are trained: BP-CNN_CIMP, BP-CNN_SNP, and BP-CNN_CNV to evaluate the interplay between genomic variations and cellular morphology. The BP-CNN_CIMP, BP-CNN_SNP, and BP-CNN_CNV models classify the data based on CIMP, SNP, and CNV features, respectively. Based on their results, BP-CNN_CIMP and BP-CNN_SNP are further combined into BP-CNN_Combined to incorporate the entire set of genomic variations identified as influencing cellular morphology.

**Fig 3. Model architectures.**
(a) Baseline Model Architecture: Patches are input into the Inception-Net [24] for feature extraction, with the last two layers acting as fully connected classifier layers. Outputs are propagated to a softmax layer for determining probabilities. N represents the number of patient patches, while P_i denotes the MSI probability for each patch. The MSI score for each patient, P_w, is the average of its corresponding MSI probabilities. (b) Biologically-Primed Model Architecture: Similar to the baseline model, the softmax layer outputs class probabilities at the patch level. However, the MSI probability here is calculated as the maximum value between *MSI*₁ and *MSI*₂ outputs. The calculation of P_w remains the same as in the baseline model.

**Fig 4. Our BP-CNN_Combined model.**
Models A and B represent biologically-primed models informed by two distinct genomic variations. The network outputs from trained and fixed models A and B are concatenated, fed into a linear layer, and then propagated to a softmax layer to yield probabilities. ‘N’ represents the number of patches for each patient, and P_i indicates the corresponding MSI probabilities for these patches. The MSI score for each patient denoted as P_w, is derived from averaging its respective MSI probabilities.

**Fig 5. Baseline model results for per-patient classification of the test set validated over 5-folds.**
Average and 95% CI curves: (a) ROC curve, (b) PR curve.

**Fig 6. The distribution of patient-level molecular features in the test set, categorized based on the patch-level classification by the baseline model.**
The x-axis indicates the classification of patches, while the y-axis denotes the molecular level determined at the patient level. Here, **MSI** serves as the **positive** class and **MSS** as the **negative** class: (a) A boxplot illustrating SNP rates for each patch. The y-axis quantifies the cumulative count of SNPs throughout the DNA sample. (b) A bar plot depicting the methylation types for each patch. The y-axis showcases the distribution of various methylation types across classification categories. (c) A boxplot highlighting the CNV rates for patches, with the y-axis measuring the proportion of the DNA sample that manifests CNV.

Fig 7. Average and 95% CI ROC and PR curves for per-patient classification using: (a) the BP-CNN_SNP model compared to its corresponding baseline model, (b) the BP-CNN_CIMP model compared to its corresponding baseline model, and (c) the BP-CNN_CNV model compared to its corresponding baseline model.

Fig 8. Box-plot visualization of (a) AUROC results, (b) AP results and (c) F1-scores for per-patient classification, comparing the biologically primed models with their corresponding baseline model on the test set over different training sessions.
It’s worth noting that due to the stratified k-fold approach used to partition the training data across sessions, the performance of the baseline model can vary between experiments.

**Fig 9. Confusion matrices of the patient-level predictions for the different models.**
Each matrix represents an average from the test set over various training sessions. The threshold for MSI prediction is determined by the best F1 score over the folds. (a) Baseline model corresponding to the BP-CNN_SNP folds. (b) Baseline model corresponding to the to BP-CNN_CIMP folds. (c) BP-CNN_SNP model. (d) BP-CNN_CIMP model.

**Fig 10. Average and 95% CI ROC and PR curves for per-patient classification using the BP-CNN_Combined model compared to the baseline model.**
(a) ROC curve. (b) PR curve. (c), (d) and (e) are the 5-fold results comparison of the AUROC, AP, and F1 results respectively.

**Fig 11. A histogram showcasing the MSI scores for patches from selected patients, misclassified by the baseline model but accurately classified by our proposed models.**
The x-axis represents the patch MSI probabilities given by the CNN, while the y-axis denotes the count of patches, normalized to the total number of patches for each patient. The comparisons are between (a) the Baseline and BP-CNN_SNP model, (b) the Baseline and BP-CNN_CIMP model, and (c) the Baseline and BP-CNN_Combined model.

**Fig 12. Patches of patients that were miss-classified by our models.**
Top row: patches of patients that were misclassified by the Baseline model and correctly classified by the BP-CNN_Combined model. (a) TCGA-AA-3833, Baseline: MSS, BP-CNN_Combined: MSI, reference: MSI (SNP<1200), (b) TCGA-AY-6197, Baseline: MSS, BP-CNN_Combined: MSI, reference: MSI (CIMP-low), (c) TCGA-A6-2685, Baseline: MSI, BP-CNN_Combined: MSS, reference: MSS, (d) TCGA-NH-A6GC, Baseline: MSI, BP-CNN_Combined: MSS, reference: MSS. Bottom row: patches of patients that were misclassified by both the Baseline model and the BP-CNN_Combined model. (e) TCGA-A6-2686, Baseline: MSS, BP-CNN_Combined: MSS, reference: MSI, (f) TCGA-AG-A02N, Baseline: MSS, BP-CNN_Combined: MSS, reference: MSI, (g) TCGA-AG-3881, Baseline: MSI, BP-CNN_Combined: MSI, reference: MSS, (h) TCGA-DC-6682, Baseline: MSI, BP-CNN_Combined: MSI, reference: MSS.

See this image and copyright information in PMC

Cited by

Accurate colorectal cancer detection using a random hinge exponential distribution coupled attention network on pathological images.
Bharath E, Raja RV, Kalaivanan K, Deshpande V. Bharath E, et al. Abdom Radiol (NY). 2025 Jul;50(7):2828-2857. doi: 10.1007/s00261-024-04770-2. Epub 2025 Jan 8. Abdom Radiol (NY). 2025. PMID: 39779530

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al.. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: a cancer journal for clinicians. 2021;71(3):209–249. - PubMed
1. Hu LF, Lan HR, Huang D, Li XM, Jin KT. Personalized immunotherapy in colorectal cancers: where do we stand? Frontiers in oncology. 2021;11:769305. doi: 10.3389/fonc.2021.769305 - DOI - PMC - PubMed
1. Le DT, Durham JN, Smith KN, Wang H, Bartlett BR, Aulakh LK, et al.. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science. 2017;357(6349):409–413. doi: 10.1126/science.aan6733 - DOI - PMC - PubMed
1. Baudrin LG, Deleuze JF, How-Kit A. Molecular and computational methods for the detection of microsatellite instability in cancer. Frontiers in oncology. 2018;8:621. doi: 10.3389/fonc.2018.00621 - DOI - PMC - PubMed
1. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 770–778.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- PubMed Central
- Public Library of Science
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Exploring the interplay between colorectal cancer subtypes genomic variants and cellular morphology: A deep-learning approach

Affiliations

Exploring the interplay between colorectal cancer subtypes genomic variants and cellular morphology: A deep-learning approach

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical