Oral squamous cell carcinoma grading classification using deep transformer encoder assisted dilated convolution with global attention
- PMID: 41180827
- PMCID: PMC12575215
- DOI: 10.3389/frai.2025.1575427
Oral squamous cell carcinoma grading classification using deep transformer encoder assisted dilated convolution with global attention
Abstract
In recent years, Oral Squamous Cell Carcinoma (OSCC) has been a common tumor in the orofacial region, affecting areas such as the teeth, jaw, and temporomandibular joint. OSCC is classified into three grades: "well-differentiated, moderately differentiated, and poorly differentiated," with a high morbidity and mortality rate among patients. Several existing methods, such as AlexNet, CNN, U-Net, and V-Net, have been used for OSCC classification. However, these methods face limitations, including low ACC, poor comparability, insufficient data collection, and prolonged training times. To address these limitations, we introduce a novel Deep Transformer Encoder-Assisted Dilated Convolution with Global Attention (DeTr-DiGAtt) model for OSCC classification. To enhance the dataset and mitigate over-fitting, a GAN model is employed for data augmentation. Additionally, an Adaptive Bilateral Filter (Ad-BF) is used to improve image quality and remove undesirable noise. For accurate identification of the affected region, an Improved Multi-Encoder Residual Squeeze U-Net (Imp-MuRs-Unet) model is utilized for segmentation. The DeTr-DiGAtt model is then applied to classify different OSCC grading levels. Furthermore, an Adaptive Grey Lag Goose Optimization Algorithm (Ad-GreLop) is used for hyperparameter tuning. The proposed method achieves an accuracy (ACC) of 98.59%, a Dice score of 97.97%, and an Intersection over Union (IoU) of 98.08%.
Keywords: GAN model; Grey lag goose optimization algorithm and global attention; U-net model; adaptive bilateral filter; dilated convolutional.
Copyright © 2025 Ramya and Minu.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
References
-
- Abuhayi B. M., Bezabh Y. A., Ayalew A. M. (2024). Lumbar disease classification using an Involutional neural based VGG nets (INVGG). IEEE Access 12, 27518–27529. doi: 10.1109/ACCESS.2024.3367774 - DOI
-
- Ahmad M., Irfan M. A., Sadique U., Haq I. U., Jan A., Khattak M. I., et al. (2023). Multi-method analysis of histopathological image for early diagnosis of oral squamous cell carcinoma using deep learning and hybrid techniques. Cancer 15:5247. doi: 10.3390/cancers15215247, PMID: - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
