Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun;61(6):1395-1408.
doi: 10.1007/s11517-022-02746-2. Epub 2023 Jan 31.

PneuNet: deep learning for COVID-19 pneumonia diagnosis on chest X-ray image analysis using Vision Transformer

Affiliations

PneuNet: deep learning for COVID-19 pneumonia diagnosis on chest X-ray image analysis using Vision Transformer

Tianmu Wang et al. Med Biol Eng Comput. 2023 Jun.

Abstract

A long-standing challenge in pneumonia diagnosis is recognizing the pathological lung texture, especially the ground-glass appearance pathological texture. One main difficulty lies in precisely extracting and recognizing the pathological features. The patients, especially those with mild symptoms, show very little difference in lung texture, neither conventional computer vision methods nor convolutional neural networks perform well on pneumonia diagnosis based on chest X-ray (CXR) images. In the meanwhile, the Coronavirus Disease 2019 (COVID-19) pandemic continues wreaking havoc around the world, where quick and accurate diagnosis backed by CXR images is in high demand. Rather than simply recognizing the patterns, extracting feature maps from the original CXR image is what we need in the classification process. Thus, we propose a Vision Transformer (VIT)-based model called PneuNet to make an accurate diagnosis backed by channel-based attention through X-ray images of the lung, where multi-head attention is applied on channel patches rather than feature patches. The techniques presented in this paper are oriented toward the medical application of deep neural networks and VIT. Extensive experiment results show that our method can reach 94.96% accuracy in the three-categories classification problem on the test set, which outperforms previous deep learning models.

Keywords: COVID-19; Deep learning; Multi-head attention; Pneumonia diagnosis; Vision Transformer.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Architecture of PneuNet (a) and the details of Transformer Encoder (b)
Fig. 2
Fig. 2
Architecture of ResNet18, without flatten layer nor logit layer
Fig. 3
Fig. 3
Image partition in conventional VIT process: raw image (a) is divided into several patches (b) and each patch will be encoded and embedded with its position
Fig. 4
Fig. 4
Illustration of patches employed in Transformer modular which obtained from ResNet18
Fig. 5
Fig. 5
Typical chest X-ray images from the combined dataset: CXR image of None Pneumonia (a), CXR images of COVID-19 (b), CXR images of Bacterial Pneumonia (c), and CXR images of Viral Pneumonia (d)
Fig. 6
Fig. 6
History during training: history of cross-entropy loss (a) and history of categorical accuracy (b)
Fig. 7
Fig. 7
The confusion matrix for three-category classification: None Pneumonia, Normal Pneumonia, and COVID-19
Fig. 8
Fig. 8
Confusion matrix generated from binary classification model (a) and four-category classification model (b)

Similar articles

Cited by

References

    1. Rudan I, Boschi-Pinto C, Biloglav Z, Mulholland K, Campbell H. Epidemiology and etiology of childhood pneumonia. Bull World Health Organ. 2008;86:408–416B. doi: 10.2471/BLT.07.048769. - DOI - PMC - PubMed
    1. Loo WK, Hasikin K, Suhaimi A, Yee PL, Teo K, Xia K, Qian P, Jiang Y, Zhang Y, Dhanalakshmi S et al (2022) Systematic review on COVID-19 readmission and risk factors: future of machine learning in COVID-19 readmission studies. Front Public Health, 1311 - PMC - PubMed
    1. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis. 2020;20(5):533–534. doi: 10.1016/S1473-3099(20)30120-1. - DOI - PMC - PubMed
    1. Fang Y, Zhang H, Xie J, Lin M, Ying L, Pang P, Ji W. Sensitivity of chest CT for COVID-19: comparison to RT-PCR. Radiology. 2020;296(2):E115–E117. doi: 10.1148/radiol.2020200432. - DOI - PMC - PubMed
    1. Ng M-Y, Lee EYP, Yang J, Yang F, Li X, Wang H, Lui MM-S, Lo CS-Y, Leung B, Khong P-L, et al. Imaging profile of the COVID-19 infection: radiologic findings and literature review. Radiol Cardiothorac Imaging. 2020;2(1):e200034. doi: 10.1148/ryct.2020200034. - DOI - PMC - PubMed