Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 4;12(7):733.
doi: 10.3390/bioengineering12070733.

OKEN: A Supervised Evolutionary Optimizable Dimensionality Reduction Framework for Whole Slide Image Classification

Affiliations

OKEN: A Supervised Evolutionary Optimizable Dimensionality Reduction Framework for Whole Slide Image Classification

Soroush Oskouei et al. Bioengineering (Basel). .

Abstract

Classification of lung cancer subtypes is a critical clinical step; however, relying solely on H&E-stained histopathology images can pose challenges, and additional immunohistochemical analysis is sometimes required for definitive subtyping. Digital pathology facilitates the use of artificial intelligence for automatic classification of digital tissue slides. Automatic classification of Whole Slide Images (WSIs) typically involves extracting features from patches obtained from them. The aim of this study was to develop a WSI classification framework utilizing an optimizable kernel to encode features from each patch from a WSI into a desirable and adjustable latent space using an evolutionary algorithm. The encoded data can then be used for classification and segmentation while being computationally more efficient. Our proposed framework is compared with a state-of-the-art model, Vim4Path, on an internal and external dataset of lung cancer WSIs. The proposed model outperforms Vim-S16 in accuracy and F1 score at both ×2.5 and ×10 magnification levels on the internal test set, with the highest accuracy (0.833) and F1 score (0.721) at ×2.5. On the external test set, Vim-S16 at ×10 achieves the highest accuracy (0.732), whereas OKEN-DenseNet121 at ×2.5 has the best F1 score (0.772). In future work, finding a dynamic way to tune the output dimensions of the evolutionary algorithm would be of value.

Keywords: deep learning; digital pathology; dimensionality reduction; evolutionary algorithm; lung cancer.

PubMed Disclaimer

Conflict of interest statement

Author André Pedersen was employed by Sopra Steria. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Visualization of data split and distribution. The diagram was created using Mermaid [37]. Abbreviations: HULC: Haukeland University Hospital Lung Cancer; WSIs: whole slide images; TCGA: The Cancer Genome Atlas.
Figure 2
Figure 2
Overview of the proposed three-stage pipeline for tumor region processing and classification. Stage 1: Patch positions are created from the whole slide image (WSI), and a tumor mask is generated using a pretrained model. Tumor regions are identified, and patch positions and tumor patches are saved for downstream processing. Stage 2: An evolutionary algorithm (EA) is used to optimize a transformation matrix by iteratively clustering feature embeddings across generations, leading to better separation between the two classes of labeled data. The final transformation matrices are saved. Stage 3: Features are extracted from the saved patches, followed by dimensionality reduction using the saved transformation matrices and graph formation based on spatial positions. A 1-Dimensional Convolutional Neural Network (1D-CNN) is then applied for classification.
Figure 3
Figure 3
Comparison of dimensionality reduction techniques applied to high-dimensional features extracted from image patches of 22 whole slide images. The (top-left) plot shows the result of the proposed EA-based transformation. The remaining plots display the outputs of UMAP (top-middle and top-right) and t-SNE (bottom-left and bottom-right) applied to both the original features and the EA-based transformed features. The two visualized colors indicate the two tumor subtype classes. Abbreviations: EA: evolutionary algorithm; UMAP: Uniform manifold approximation and projection; t-SNE: t-distributed stochastic neighbor embedding.

Similar articles

References

    1. Strauss G.M., Jemal A., McKenna M.B., Strauss J.A., Cummings K. B4-06: Lung Cancer Survival in Relation to Histologic Subtype: An Analysis based upon Surveillance Epidemiology and End Results (SEER) Data. J. Thorac. Oncol. 2007;2:S345–S346. doi: 10.1097/01.JTO.0000283165.65986.59. - DOI
    1. Yun J.K., Kwon Y., Kim J., Lee G.D., Choi S., Kim H.R., Kim Y.H., Kim D.K., Park S.I. Clinical impact of histologic type on survival and recurrence in patients with surgically resected stage II and III non-small cell lung cancer. Lung Cancer. 2023;176:24–30. doi: 10.1016/j.lungcan.2022.12.008. - DOI - PubMed
    1. Perez-Moreno P., Brambilla E., Thomas R., Soria J.C. Squamous cell carcinoma of the lung: Molecular subtypes and therapeutic opportunities. Clin. Cancer Res. 2012;18:2443–2451. doi: 10.1158/1078-0432.CCR-11-2370. - DOI - PubMed
    1. Zhang Y., Liu H., Chang C., Yin Y., Wang R. Machine learning for differentiating lung squamous cell cancer from adenocarcinoma using Clinical-Metabolic characteristics and 18F-FDG PET/CT radiomics. PLoS ONE. 2024;19:e0300170. doi: 10.1371/journal.pone.0300170. - DOI - PMC - PubMed
    1. Chen J.W., Dhahbi J. Lung adenocarcinoma and lung squamous cell carcinoma cancer classification, biomarker identification, and gene expression analysis using overlapping feature selection methods. Sci. Rep. 2021;11:13323. doi: 10.1038/s41598-021-92725-8. - DOI - PMC - PubMed

LinkOut - more resources