Optimization of vision transformer-based detection of lung diseases from chest X-ray images

doi:10.1186/s12911-024-02591-3

. 2024 Jul 8;24(1):191.

doi: 10.1186/s12911-024-02591-3.

Optimization of vision transformer-based detection of lung diseases from chest X-ray images

Jinsol Ko^#^{1

2}, Soyeon Park^#³, Hyun Goo Woo^{4

5}

Affiliations

¹ Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea.
² Department of Biomedical Science, Graduate School, Ajou University, Suwon, Republic of Korea.
³ Ajou University School of Medicine, Suwon, Republic of Korea.
⁴ Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea. hg@ajou.ac.kr.
⁵ Department of Biomedical Science, Graduate School, Ajou University, Suwon, Republic of Korea. hg@ajou.ac.kr.

^# Contributed equally.

PMID: 38978027
PMCID: PMC11232177
DOI: 10.1186/s12911-024-02591-3

Optimization of vision transformer-based detection of lung diseases from chest X-ray images

Jinsol Ko et al. BMC Med Inform Decis Mak. 2024.

. 2024 Jul 8;24(1):191.

doi: 10.1186/s12911-024-02591-3.

Authors

Jinsol Ko^#^{1

2}, Soyeon Park^#³, Hyun Goo Woo^{4

5}

Affiliations

¹ Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea.
² Department of Biomedical Science, Graduate School, Ajou University, Suwon, Republic of Korea.
³ Ajou University School of Medicine, Suwon, Republic of Korea.
⁴ Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea. hg@ajou.ac.kr.
⁵ Department of Biomedical Science, Graduate School, Ajou University, Suwon, Republic of Korea. hg@ajou.ac.kr.

^# Contributed equally.

PMID: 38978027
PMCID: PMC11232177
DOI: 10.1186/s12911-024-02591-3

Abstract

Background: Recent advances in Vision Transformer (ViT)-based deep learning have significantly improved the accuracy of lung disease prediction from chest X-ray images. However, limited research exists on comparing the effectiveness of different optimizers for lung disease prediction within ViT models. This study aims to systematically evaluate and compare the performance of various optimization methods for ViT-based models in predicting lung diseases from chest X-ray images.

Methods: This study utilized a chest X-ray image dataset comprising 19,003 images containing both normal cases and six lung diseases: COVID-19, Viral Pneumonia, Bacterial Pneumonia, Middle East Respiratory Syndrome (MERS), Severe Acute Respiratory Syndrome (SARS), and Tuberculosis. Each ViT model (ViT, FastViT, and CrossViT) was individually trained with each optimization method (Adam, AdamW, NAdam, RAdam, SGDW, and Momentum) to assess their performance in lung disease prediction.

Results: When tested with ViT on the dataset with balanced-sample sized classes, RAdam demonstrated superior accuracy compared to other optimizers, achieving 95.87%. In the dataset with imbalanced sample size, FastViT with NAdam achieved the best performance with an accuracy of 97.63%.

Conclusions: We provide comprehensive optimization strategies for developing ViT-based model architectures, which can enhance the performance of these models for lung disease prediction from chest X-ray images.

Keywords: Lung disease; Optimizer; Vision transformer; X-ray.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1**
Schematic overview of the analysis workflow

**Fig. 2**
Classification of the overall classes with various models and optimizers. A, B The 4 class (A) and 7 class (B) datasets were classified using the ViT model with various optimizers (Adam, AdamW, NAdam, RAdam, SGDW, and Momentum), respectively. C, D The 7 class dataset was classified using the FastViT (C) or CrossViT (D) models with various optimizers, respectively. The evaluation metrics included accuracy, F1-score, precision, and recall, calculated at various learning rates of 10^–4, 10^–5, and 10^–6

**Fig. 3**
Classification of each disease class with various models and optimizers. A, B Each class in the 4 class (A) and 7 class (B) datasets was classified using the ViT model with various optimizers (Adam, AdamW, NAdam, RAdam, SGDW, and Momentum), respectively. C, D Each class in the 7 class dataset was classified using the FastViT (C) or CrossViT (D) models with various optimizers, respectively. The evaluation metrics included accuracy, F1-score, precision, and recall, calculated at various learning rates of 10^–4, 10^–5, and 10^–6

See this image and copyright information in PMC

Cited by

Automated classification of chest X-rays: a deep learning approach with attention mechanisms.
Oltu B, Güney S, Yuksel SE, Dengiz B. Oltu B, et al. BMC Med Imaging. 2025 Mar 4;25(1):71. doi: 10.1186/s12880-025-01604-5. BMC Med Imaging. 2025. PMID: 40038588 Free PMC article.
Metaheuristic optimizers integrated with vision transformer model for severity detection and classification via multimodal COVID-19 images.
Padmavathi V, Ganesan K. Padmavathi V, et al. Sci Rep. 2025 Apr 22;15(1):13941. doi: 10.1038/s41598-025-98593-w. Sci Rep. 2025. PMID: 40263404 Free PMC article.
A Deep Convolutional Neural Network Model for Lung Disease Detection Using Chest X-Ray Imaging.
Dardouri S. Dardouri S. Pulm Med. 2025 Jun 24;2025:6614016. doi: 10.1155/pm/6614016. eCollection 2025. Pulm Med. 2025. PMID: 40599379 Free PMC article.
A ubiquitous and interoperable deep learning model for automatic detection of pleomorphic gastroesophageal lesions.
Martins M, Mascarenhas MJ, Almeida MJ, Afonso J, Ribeiro T, Cardoso P, Mendes F, Mota J, Andrade P, Cardoso H, Mascarenhas-Saraiva M, Ferreira J, Macedo G. Martins M, et al. Sci Rep. 2025 Jul 2;15(1):22889. doi: 10.1038/s41598-025-03397-7. Sci Rep. 2025. PMID: 40594126 Free PMC article.

References

1. Khan AI, Shah JL, Bhat MM. CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images. Comput Methods Programs Biomed. 2020;196:105581. doi: 10.1016/j.cmpb.2020.105581. - DOI - PMC - PubMed
1. Wang L, Lin ZQ, Wong A. COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Sci Rep. 2020;10(1):19549. doi: 10.1038/s41598-020-76550-z. - DOI - PMC - PubMed
1. Rahaman MM, Li C, Yao Y, Kulwa F, Rahman MA, Wang Q, Qi S, Kong F, Zhu X, Zhao X. Identification of COVID-19 samples from chest X-Ray images using deep learning: A comparison of transfer learning approaches. J Xray Sci Technol. 2020;28(5):821–839. - PMC - PubMed
1. Zhou SK, Greenspan H, Davatzikos C, Duncan JS, Van Ginneken B, Madabhushi A, Prince JL, Rueckert D, Summers RM. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises. Proc IEEE. 2021;109(5):820–838. doi: 10.1109/JPROC.2021.3054390. - DOI - PMC - PubMed
1. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S. An image is worth 16x16 words: Transformers for image recognition at scale. 2020.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

[1] Khan AI, Shah JL, Bhat MM. CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images. Comput Methods Programs Biomed. 2020;196:105581. doi: 10.1016/j.cmpb.2020.105581. - DOI - PMC - PubMed

[2] Khan AI, Shah JL, Bhat MM. CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images. Comput Methods Programs Biomed. 2020;196:105581. doi: 10.1016/j.cmpb.2020.105581. - DOI - PMC - PubMed

[3] Wang L, Lin ZQ, Wong A. COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Sci Rep. 2020;10(1):19549. doi: 10.1038/s41598-020-76550-z. - DOI - PMC - PubMed

[4] Wang L, Lin ZQ, Wong A. COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Sci Rep. 2020;10(1):19549. doi: 10.1038/s41598-020-76550-z. - DOI - PMC - PubMed

[5] Rahaman MM, Li C, Yao Y, Kulwa F, Rahman MA, Wang Q, Qi S, Kong F, Zhu X, Zhao X. Identification of COVID-19 samples from chest X-Ray images using deep learning: A comparison of transfer learning approaches. J Xray Sci Technol. 2020;28(5):821–839. - PMC - PubMed

[6] Rahaman MM, Li C, Yao Y, Kulwa F, Rahman MA, Wang Q, Qi S, Kong F, Zhu X, Zhao X. Identification of COVID-19 samples from chest X-Ray images using deep learning: A comparison of transfer learning approaches. J Xray Sci Technol. 2020;28(5):821–839. - PMC - PubMed

[7] Zhou SK, Greenspan H, Davatzikos C, Duncan JS, Van Ginneken B, Madabhushi A, Prince JL, Rueckert D, Summers RM. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises. Proc IEEE. 2021;109(5):820–838. doi: 10.1109/JPROC.2021.3054390. - DOI - PMC - PubMed

[8] Zhou SK, Greenspan H, Davatzikos C, Duncan JS, Van Ginneken B, Madabhushi A, Prince JL, Rueckert D, Summers RM. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises. Proc IEEE. 2021;109(5):820–838. doi: 10.1109/JPROC.2021.3054390. - DOI - PMC - PubMed

[9] Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S. An image is worth 16x16 words: Transformers for image recognition at scale. 2020.

[10] Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S. An image is worth 16x16 words: Transformers for image recognition at scale. 2020.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Optimization of vision transformer-based detection of lung diseases from chest X-ray images

Affiliations

Optimization of vision transformer-based detection of lung diseases from chest X-ray images

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous