Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 12;19(3):e0296352.
doi: 10.1371/journal.pone.0296352. eCollection 2024.

Multi-modal deep learning methods for classification of chest diseases using different medical imaging and cough sounds

Affiliations

Multi-modal deep learning methods for classification of chest diseases using different medical imaging and cough sounds

Hassaan Malik et al. PLoS One. .

Abstract

Chest disease refers to a wide range of conditions affecting the lungs, such as COVID-19, lung cancer (LC), consolidation lung (COL), and many more. When diagnosing chest disorders medical professionals may be thrown off by the overlapping symptoms (such as fever, cough, sore throat, etc.). Additionally, researchers and medical professionals make use of chest X-rays (CXR), cough sounds, and computed tomography (CT) scans to diagnose chest disorders. The present study aims to classify the nine different conditions of chest disorders, including COVID-19, LC, COL, atelectasis (ATE), tuberculosis (TB), pneumothorax (PNEUTH), edema (EDE), pneumonia (PNEU). Thus, we suggested four novel convolutional neural network (CNN) models that train distinct image-level representations for nine different chest disease classifications by extracting features from images. Furthermore, the proposed CNN employed several new approaches such as a max-pooling layer, batch normalization layers (BANL), dropout, rank-based average pooling (RBAP), and multiple-way data generation (MWDG). The scalogram method is utilized to transform the sounds of coughing into a visual representation. Before beginning to train the model that has been developed, the SMOTE approach is used to calibrate the CXR and CT scans as well as the cough sound images (CSI) of nine different chest disorders. The CXR, CT scan, and CSI used for training and evaluating the proposed model come from 24 publicly available benchmark chest illness datasets. The classification performance of the proposed model is compared with that of seven baseline models, namely Vgg-19, ResNet-101, ResNet-50, DenseNet-121, EfficientNetB0, DenseNet-201, and Inception-V3, in addition to state-of-the-art (SOTA) classifiers. The effectiveness of the proposed model is further demonstrated by the results of the ablation experiments. The proposed model was successful in achieving an accuracy of 99.01%, making it superior to both the baseline models and the SOTA classifiers. As a result, the proposed approach is capable of offering significant support to radiologists and other medical professionals.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. CSI, CXR, and CT scans are the three diagnostic tools that are indicated for use in the process of identifying a variety of chest disorders.
Fig 2
Fig 2. Sample CXR and CT scan images of multiple chest diseases.
Fig 3
Fig 3. Scalogram image of multiple chest diseases coughs sound.
Fig 4
Fig 4. Steps of conducting pre-processing.
Fig 5
Fig 5. Summary of the proposed models.
Fig 6
Fig 6. Generating CXR, CT scan, and CSI using MWDG.
(a) Rotation, (b) HST, (c) VST, (d) NI, (e) GCOR, (f) RTS, and (g) Scaling.
Fig 7
Fig 7. Training-Validation accuracy and loss.
(a) Vgg-19, (b) ResNet-101, (c) ResNet-50, (d) DenseNet-121, (e) Inception-V3, (f) EfficientNetB0, (g) DenseNet-201, and (h) Proposed P(4) model.
Fig 8
Fig 8. Confusion matrix.
(a) Vgg-19, (b) ResNet-101, (c) ResNet-50, (d) DenseNet-121, (e) Inception-V3, (f) Proposed P(4) model, (g) EfficientNetB0, and (h) DenseNet-201.
Fig 9
Fig 9. AU(ROC) for class wise evaluation of chest diseases.
(a) Vgg-19, (b) ResNet-101, (c) ResNet-50, (d) DenseNet-121, (e) Inception-V3, (f) Proposed P(4) model, (g) EfficientNetB0, and (h) DenseNet-201.
Fig 10
Fig 10. GRAD-CAM visualization of the proposed model for highlighting the infected region of nine chest diseases.

Similar articles

Cited by

References

    1. WHO Coronavirus (COVID-19) Dashboard|WHO Coronavirus (COVID-19) Dashboard with Vaccination Data. Available online: https://covid19.who.int/ (accessed on 10th February 2023).
    1. Zhu Na, Zhang Dingyu, Wang Wenling, Li Xingwang, Yang Bo, Song Jingdong, Zhao Xiang et al.. "A novel coronavirus from patients with pneumonia in China, 2019." New England journal of medicine (2020). doi: 10.1056/NEJMoa2001017 - DOI - PMC - PubMed
    1. Arevalo-Rodriguez Ingrid, Diana Buitrago-Garcia Daniel Simancas-Racines, Paula Zambrano-Achig Rosa Del Campo, Ciapponi Agustin, Sued Omar et al.. "False-negative results of initial RT-PCR assays for COVID-19: a systematic review." PloS one 15, no. 12 (2020): e0242958. doi: 10.1371/journal.pone.0242958 - DOI - PMC - PubMed
    1. Woloshin Steven, Patel Neeraj, and Kesselheim Aaron S. "False negative tests for SARS-CoV-2 infection—challenges and implications." New England Journal of Medicine 383, no. 6 (2020): e38. doi: 10.1056/NEJMp2015897 - DOI - PubMed
    1. Pontone Gianluca, Scafuri Stefano, Maria Elisabetta Mancini Cecilia Agalbato, Guglielmo Marco, Baggiano Andrea, et al.. "Role of computed tomography in COVID-19." Journal of cardiovascular computed tomography 15, no. 1 (2021): 27–36. doi: 10.1016/j.jcct.2020.08.013 - DOI - PMC - PubMed