Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 3;15(3):2329-2346.
doi: 10.21037/qims-24-1641. Epub 2025 Feb 26.

Advanced deep learning for multi-class colorectal cancer histopathology: integrating transfer learning and ensemble methods

Affiliations

Advanced deep learning for multi-class colorectal cancer histopathology: integrating transfer learning and ensemble methods

Qi Ke et al. Quant Imaging Med Surg. .

Abstract

Background: Cancer is a major global health threat, constantly endangering people's well-being and lives. The application of deep learning in the diagnosis of colorectal cancer can improve early detection rates, thereby significantly reducing the incidence and mortality of colorectal cancer patients. Our study aims to optimize the performance of deep learning model in the classification of colorectal cancer histopathological images to assist pathologists in improving diagnostic accuracy.

Methods: In this study, we developed ensemble models based on deep convolutional neural networks (CNNs) for the classification of colorectal cancer histopathology images. The method first involved data preprocessing techniques such as patch cropping, stain normalization, data augmentation and data balancing on histopathology images with different magnifications. Subsequently, the CNN models were fine-tuned and pre-trained using transfer learning methods, and models with superior performance were then selected as the base classifiers to build the ensemble models. Finally, the ensemble models were used to predict the final classification outcomes. To evaluate the effectiveness of the proposed models, we tested their performance on a publicly available colorectal cancer dataset, Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image (EBHI) dataset.

Results: Experimental results show that the proposed ensemble model, composed of the top five classifiers, achieved the promising classification accuracy across sub-databases with four different magnification factors. Specifically, on the 40× magnification subset, the highest classification accuracy reached 99.11%; on the 100× magnification subset, it reached 99.36%; on the 200× magnification subset, it was 99.29%; and on the 400× magnification subset, it was 98.96%. Additionally, the proposed ensemble model achieved exceptional results in recall, precision, and F1 score.

Conclusions: The proposed ensemble models obtained good classification performance on the EBHI dataset of histopathological images for colorectal cancer. The findings of this study may contribute to the early detection and accurate classification of colorectal cancer, thereby aiding in more precise diagnostic analysis of colorectal cancer.

Keywords: Histopathological image; colorectal cancer; deep learning (DL); ensemble learning; transfer learning (TL).

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://qims.amegroups.com/article/view/10.21037/qims-24-1641/coif). The authors have no conflicts of interest to declare.

Figures

Figure 1
Figure 1
Representative EBHI dataset samples. EBHI, Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image; IN, intraepithelial neoplasia.
Figure 2
Figure 2
The workflow of the Patch-Cropping Algorithm processing.
Figure 3
Figure 3
Original and stain normalized image samples.
Figure 4
Figure 4
The architecture of the proposed ensemble model. CNN, convolutional neural network.
Figure 5
Figure 5
Samples and corresponding Grad-CAM heatmaps generated by eight CNN models under the four magnification factor images. Grad-CAM, Gradient Weighted Class Activation Mapping; CNN, convolutional neural network.

References

    1. Mengash HA, Alamgeer M, Maashi M, Othman M, Hamza MA, Ibrahim SS, Zamani AS, Yaseen I. Leveraging Marine Predators Algorithm with Deep Learning for Lung and Colon Cancer Diagnosis. Cancers (Basel) 2023. - PMC - PubMed
    1. Ferlay J, Colombet M, Soerjomataram I, Parkin DM, Piñeros M, Znaor A, Bray F. Cancer statistics for the year 2020: An overview. Int J Cancer 2021. [Epub ahead of print]. doi: .10.1002/ijc.33588 - DOI - PubMed
    1. Chhikara BS, Parang K. Chemical Biology LETTERS Global Cancer Statistics 2022: the trends projection analysis. Chem Biol Lett 2023;10:451.
    1. Yin Z, Yao C, Zhang L, Qi S. Application of artificial intelligence in diagnosis and treatment of colorectal cancer: A novel Prospect. Front Med (Lausanne) 2023;10:1128084. 10.3389/fmed.2023.1128084 - DOI - PMC - PubMed
    1. Li J, Liu J, Yue H, Cheng J, Kuang H, Bai H, Wang Y, Wang J. DARC: Deep adaptive regularized clustering for histopathological image classification. Med Image Anal 2022;80:102521. 10.1016/j.media.2022.102521 - DOI - PubMed

LinkOut - more resources