Deep learning-based analysis of EGFR mutation prevalence in lung adenocarcinoma H&E whole slide images
- PMID: 39358807
- PMCID: PMC11446692
- DOI: 10.1002/2056-4538.70004
Deep learning-based analysis of EGFR mutation prevalence in lung adenocarcinoma H&E whole slide images
Abstract
EGFR mutations are a major prognostic factor in lung adenocarcinoma. However, current detection methods require sufficient samples and are costly. Deep learning is promising for mutation prediction in histopathological image analysis but has limitations in that it does not sufficiently reflect tumor heterogeneity and lacks interpretability. In this study, we developed a deep learning model to predict the presence of EGFR mutations by analyzing histopathological patterns in whole slide images (WSIs). We also introduced the EGFR mutation prevalence (EMP) score, which quantifies EGFR prevalence in WSIs based on patch-level predictions, and evaluated its interpretability and utility. Our model estimates the probability of EGFR prevalence in each patch by partitioning the WSI based on multiple-instance learning and predicts the presence of EGFR mutations at the slide level. We utilized a patch-masking scheduler training strategy to enable the model to learn various histopathological patterns of EGFR. This study included 868 WSI samples from lung adenocarcinoma patients collected from three medical institutions: Hallym University Medical Center, Inha University Hospital, and Chungnam National University Hospital. For the test dataset, 197 WSIs were collected from Ajou University Medical Center to evaluate the presence of EGFR mutations. Our model demonstrated prediction performance with an area under the receiver operating characteristic curve of 0.7680 (0.7607-0.7720) and an area under the precision-recall curve of 0.8391 (0.8326-0.8430). The EMP score showed Spearman correlation coefficients of 0.4705 (p = 0.0087) for p.L858R and 0.5918 (p = 0.0037) for exon 19 deletions in 64 samples subjected to next-generation sequencing analysis. Additionally, high EMP scores were associated with papillary and acinar patterns (p = 0.0038 and p = 0.0255, respectively), whereas low EMP scores were associated with solid patterns (p = 0.0001). These results validate the reliability of our model and suggest that it can provide crucial information for rapid screening and treatment plans.
Keywords: EGFR; deep learning in histopathology; multiple‐instance learning; whole‐slide image analysis.
© 2024 The Author(s). The Journal of Pathology: Clinical Research published by The Pathological Society of Great Britain and Ireland and John Wiley & Sons Ltd.
Figures





References
-
- Lazcanoiturburu N, García‐Sáez J, González‐Corralejo C, et al. Lack of EGFR catalytic activity in hepatocytes improves liver regeneration following DDC‐induced cholestatic injury by promoting a pro‐restorative inflammatory response. J Pathol 2022; 258: 312–324. - PubMed
-
- Fontugne J, Wong J, Cabel L, et al. Progression‐associated molecular changes in basal/squamous and sarcomatoid bladder carcinogenesis. J Pathol 2023; 259: 455–467. - PubMed
-
- Pastorino GA, Sheraj I, Huebner K, et al. A partial epithelial‐mesenchymal transition signature for highly aggressive colorectal cancer cells that survive under nutrient restriction. J Pathol 2024; 262: 347–361. - PubMed
-
- Gonzalez‐Sanchez E, Vaquero J, Caballero‐Diaz D, et al. The hepatocyte epidermal growth factor receptor (EGFR) pathway regulates the cellular interactome within the liver fibrotic niche. J Pathol 2024; 263: 482–495. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous