. 2025 Aug 18;11(8):91.

doi: 10.3390/tomography11080091.

Autoencoder-Assisted Stacked Ensemble Learning for Lymphoma Subtype Classification: A Hybrid Deep Learning and Machine Learning Approach

Roseline Oluwaseun Ogundokun¹, Pius Adewale Owolawi¹, Chunling Tu¹, Etienne van Wyk¹

Affiliations

PMID: 40863882
PMCID: PMC12389832
DOI: 10.3390/tomography11080091

Autoencoder-Assisted Stacked Ensemble Learning for Lymphoma Subtype Classification: A Hybrid Deep Learning and Machine Learning Approach

Roseline Oluwaseun Ogundokun et al. Tomography. 2025.

. 2025 Aug 18;11(8):91.

doi: 10.3390/tomography11080091.

Authors

Roseline Oluwaseun Ogundokun¹, Pius Adewale Owolawi¹, Chunling Tu¹, Etienne van Wyk¹

Affiliation

¹ Department of Computer Systems Engineering, Tshwane University of Technology (TUT), Pretoria 0001, South Africa.

PMID: 40863882
PMCID: PMC12389832
DOI: 10.3390/tomography11080091

Abstract

Background: Accurate subtype identification of lymphoma cancer is crucial for effective diagnosis and treatment planning. Although standard deep learning algorithms have demonstrated robustness, they are still prone to overfitting and limited generalization, necessitating more reliable and robust methods.

Objectives: This study presents an autoencoder-augmented stacked ensemble learning (SEL) framework integrating deep feature extraction (DFE) and ensembles of machine learning classifiers to improve lymphoma subtype identification.

Methods: Convolutional autoencoder (CAE) was utilized to obtain high-level feature representations of histopathological images, followed by dimensionality reduction via Principal Component Analysis (PCA). Various models were utilized for classifying extracted features, i.e., Random Forest (RF), Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), AdaBoost, and Extra Trees classifiers. A Gradient Boosting Machine (GBM) meta-classifier was utilized in an SEL approach to further fine-tune final predictions.

Results: All the models were tested using accuracy, area under the curve (AUC), and Average Precision (AP) metrics. The stacked ensemble classifier performed better than all the individual models with a 99.04% accuracy, 0.9998 AUC, and 0.9996 AP, far exceeding what regular deep learning (DL) methods would achieve. Of standalone classifiers, MLP (97.71% accuracy, 0.9986 AUC, 0.9973 AP) and Random Forest (96.71% accuracy, 0.9977 AUC, 0.9953 AP) provided the best prediction performance, while AdaBoost was the poorest performer (68.25% accuracy, 0.8194 AUC, 0.6424 AP). PCA and t-SNE plots confirmed that DFE effectively enhances class discrimination.

Conclusion: This study demonstrates a highly accurate and reliable approach to lymphoma classification by using autoencoder-assisted ensemble learning, reducing the misclassification rate and significantly enhancing the accuracy of diagnosis. AI-based models are designed to assist pathologists by providing interpretable outputs such as class probabilities and visualizations (e.g., Grad-CAM), enabling them to understand and validate predictions in the diagnostic workflow. Future studies should enhance computational efficacy and conduct multi-centre validation studies to confirm the model's generalizability on extensive collections of histopathological datasets.

Keywords: autoencoder; deep feature extraction; digital pathology; lymphoma classification; machine learning; stacked ensemble learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
Proposed model flow diagram.

**Figure 2**
Visualization of the PCA and t-SNE feature space.

**Figure 4**
Confusion matrix for standalone models. (a) RF; (b) SVM; (c) MLP; (d) AdaBoost; (e) Extra Trees.

**Figure 5**
Confusion matrix for stacked classifier.

**Figure 8**
Models of accuracy comparison.

See this image and copyright information in PMC

References

1. Litjens G., Kooi T., Bejnordi B.E., Setio A.A.A., Ciompi F., Ghafoorian M., Van Der Laak J.A., Van Ginneken B., Sánchez C.I. A survey on deep learning in medical image analysis. Med. Image Anal. 2017;42:60–88. doi: 10.1016/j.media.2017.07.005. - DOI - PubMed
1. Zhang X., Zhang S., Zhang X., Xiong J., Han X., Wu Z., Zhao D., Li Y., Xu Y., Chen D. Fast Virtual Stenting for Thoracic Endovascular Aortic Repair of Aortic Dissection Using Graph Deep Learning. IEEE J. Biomed. Health Inform. 2025;29:4374–4387. doi: 10.1109/JBHI.2025.3540712. - DOI - PubMed
1. Cai L., Gao J., Zhao D. A review of the application of deep learning in medical image classification and segmentation. Ann. Transl. Med. 2020;8:713. doi: 10.21037/atm.2020.02.44. - DOI - PMC - PubMed
1. Luan S., Yu X., Lei S., Ma C., Wang X., Xue X., Ding Y., Ma T., Zhu B. Deep learning for fast super-resolution ultrasound microvessel imaging. Phys. Med. Biol. 2023;68:245023. doi: 10.1088/1361-6560/ad0a5a. - DOI - PubMed
1. Shen D., Wu G., Suk H.I. Deep learning in medical image analysis. Annu. Rev. Biomed. Eng. 2017;19:221–248. doi: 10.1146/annurev-bioeng-071516-044442. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Autoencoder-Assisted Stacked Ensemble Learning for Lymphoma Subtype Classification: A Hybrid Deep Learning and Machine Learning Approach

Affiliation

Autoencoder-Assisted Stacked Ensemble Learning for Lymphoma Subtype Classification: A Hybrid Deep Learning and Machine Learning Approach

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical