IEViT: An enhanced vision transformer architecture for chest X-ray image classification
- PMID: 36162246
- DOI: 10.1016/j.cmpb.2022.107141
IEViT: An enhanced vision transformer architecture for chest X-ray image classification
Abstract
Background and objective: Chest X-ray imaging is a relatively cheap and accessible diagnostic tool that can assist in the diagnosis of various conditions, including pneumonia, tuberculosis, COVID-19, and others. However, the requirement for expert radiologists to view and interpret chest X-ray images can be a bottleneck, especially in remote and deprived areas. Recent advances in machine learning have made possible the automated diagnosis of chest X-ray scans. In this work, we examine the use of a novel Transformer-based deep learning model for the task of chest X-ray image classification.
Methods: We first examine the performance of the Vision Transformer (ViT) state-of-the-art image classification machine learning model for the task of chest X-ray image classification, and then propose and evaluate the Input Enhanced Vision Transformer (IEViT), a novel enhanced Vision Transformer model that can achieve improved performance on chest X-ray images associated with various pathologies.
Results: Experiments on four chest X-ray image data sets containing various pathologies (tuberculosis, pneumonia, COVID-19) demonstrated that the proposed IEViT model outperformed ViT for all the data sets and variants examined, achieving an F1-score between 96.39% and 100%, and an improvement over ViT of up to +5.82% in terms of F1-score across the four examined data sets. IEViT's maximum sensitivity (recall) ranged between 93.50% and 100% across the four data sets, with an improvement over ViT of up to +3%, whereas IEViT's maximum precision ranged between 97.96% and 100% across the four data sets, with an improvement over ViT of up to +6.41%.
Conclusions: Results showed that the proposed IEViT model outperformed all ViT's variants for all the examined chest X-ray image data sets, demonstrating its superiority and generalisation ability. Given the relatively low cost and the widespread accessibility of chest X-ray imaging, the use of the proposed IEViT model can potentially offer a powerful, but relatively cheap and accessible method for assisting diagnosis using chest X-ray images.
Keywords: Chest radiography; Deep learning; Image classification; Vision transformer; X-Rays.
Copyright © 2022 The Author(s). Published by Elsevier B.V. All rights reserved.
Similar articles
-
Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.Curr Med Imaging. 2025;21:e15734056326685. doi: 10.2174/0115734056326685250101113959. Curr Med Imaging. 2025. PMID: 39806960
-
Automated classification of chest X-rays: a deep learning approach with attention mechanisms.BMC Med Imaging. 2025 Mar 4;25(1):71. doi: 10.1186/s12880-025-01604-5. BMC Med Imaging. 2025. PMID: 40038588 Free PMC article.
-
COVID-Transformer: Interpretable COVID-19 Detection Using Vision Transformer for Healthcare.Int J Environ Res Public Health. 2021 Oct 21;18(21):11086. doi: 10.3390/ijerph182111086. Int J Environ Res Public Health. 2021. PMID: 34769600 Free PMC article.
-
Deep learning-based analysis of COVID-19 X-ray images: Incorporating clinical significance and assessing misinterpretation.Digit Health. 2023 Nov 24;9:20552076231215915. doi: 10.1177/20552076231215915. eCollection 2023 Jan-Dec. Digit Health. 2023. PMID: 38025114 Free PMC article. Review.
-
Current limitations to identify covid-19 using artificial intelligence with chest x-ray imaging (part ii). The shortcut learning problem.Health Technol (Berl). 2021;11(6):1331-1345. doi: 10.1007/s12553-021-00609-8. Epub 2021 Oct 10. Health Technol (Berl). 2021. PMID: 34660166 Free PMC article. Review.
Cited by
-
Optimization of vision transformer-based detection of lung diseases from chest X-ray images.BMC Med Inform Decis Mak. 2024 Jul 8;24(1):191. doi: 10.1186/s12911-024-02591-3. BMC Med Inform Decis Mak. 2024. PMID: 38978027 Free PMC article.
-
Glaucoma Detection through a Novel Hyperspectral Imaging Band Selection and Vision Transformer Integration.Diagnostics (Basel). 2024 Jun 18;14(12):1285. doi: 10.3390/diagnostics14121285. Diagnostics (Basel). 2024. PMID: 38928700 Free PMC article.
-
Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.J Med Syst. 2024 Sep 12;48(1):84. doi: 10.1007/s10916-024-02105-8. J Med Syst. 2024. PMID: 39264388 Free PMC article.
-
Deep learning approaches for classification tasks in medical X-ray, MRI, and ultrasound images: a scoping review.BMC Med Imaging. 2025 May 7;25(1):156. doi: 10.1186/s12880-025-01701-5. BMC Med Imaging. 2025. PMID: 40335965 Free PMC article.
-
High-Resolution Network with Dynamic Convolution and Coordinate Attention for Classification of Chest X-ray Images.Diagnostics (Basel). 2023 Jun 25;13(13):2165. doi: 10.3390/diagnostics13132165. Diagnostics (Basel). 2023. PMID: 37443559 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical