A hybrid CNN-ViT framework with cross-attention fusion and data augmentation for robust brain tumor classification
- PMID: 41318692
- PMCID: PMC12756294
- DOI: 10.1038/s41598-025-28636-9
A hybrid CNN-ViT framework with cross-attention fusion and data augmentation for robust brain tumor classification
Abstract
Brain tumor classification from MRI scans is a challenging task that requires accurate and timely detection increase patient survival rates. Conventional machine learning methods with hand-crafted features often fail to handle different sizes, forms, and textures of tumors. In this study evaluates, standard transfer learning models (AlexNet, MobileNetV2, InceptionV3, ResNet50, VGG16, VGG19) and conventional classifier models such as Decision Tree, Naïve Bayes, LDA were evaluated for multiclass brain tumor classification. The Vision Transformer (ViT) which leverages global context modeling achieved accuracy of 87.34%. To further improve performance, a hybrid CNN–ViT framework named CAFNet with data augmentation and a Cross-Attention Fusion mechanism achieving a test accuracy of 96.41% on a multiclass MRI dataset. The results show that CAFNet significantly outperforms conventional machine learning, deep learning and transfer learning models for robust brain tumor classification.
Keywords: Brain tumor classification; Convolutional neural networks (CNN); Cross-attention fusion; Transfer learning; Vision transformer model.
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests.
Figures
References
-
- Kumar, R., Verma, A. & Verma, O. P. Brain tumor segmentation using MRI: A review. IEEE Access.7, 170–187 (2019).
-
- Alqudah, A. M., Alquraan, H., Qasmieh, I. A., Alqudah, A. & Al-Sharu, A. Brain tumor classification using deep learning Technique—A comparison between Cropped, Uncropped, and segmented lesion images with different sizes. Int. J. Adv. Trends Comput. Sci. Eng.9 (1), 111–118 (2020).
-
- Mahmud, T., Sarker, S. & Hossain, M. Deep Learning-Based brain tumor detection: A comprehensive review. IEEE Access.11, 43305–43323 (2023).
-
- Dhakshnamurthy, S., Kumaravel, N. & Manickavasagam, R. Brain Tumor Classification Using Deep Learning Models—A Transfer Learning Approach, Mater. Today Proc., 61, 317–323, (2022).
LinkOut - more resources
Full Text Sources
