Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 20;24(8):2641.
doi: 10.3390/s24082641.

COVID-19 Hierarchical Classification Using a Deep Learning Multi-Modal

Affiliations

COVID-19 Hierarchical Classification Using a Deep Learning Multi-Modal

Albatoul S Althenayan et al. Sensors (Basel). .

Abstract

Coronavirus disease 2019 (COVID-19), originating in China, has rapidly spread worldwide. Physicians must examine infected patients and make timely decisions to isolate them. However, completing these processes is difficult due to limited time and availability of expert radiologists, as well as limitations of the reverse-transcription polymerase chain reaction (RT-PCR) method. Deep learning, a sophisticated machine learning technique, leverages radiological imaging modalities for disease diagnosis and image classification tasks. Previous research on COVID-19 classification has encountered several limitations, including binary classification methods, single-feature modalities, small public datasets, and reliance on CT diagnostic processes. Additionally, studies have often utilized a flat structure, disregarding the hierarchical structure of pneumonia classification. This study aims to overcome these limitations by identifying pneumonia caused by COVID-19, distinguishing it from other types of pneumonia and healthy lungs using chest X-ray (CXR) images and related tabular medical data, and demonstrate the value of incorporating tabular medical data in achieving more accurate diagnoses. Resnet-based and VGG-based pre-trained convolutional neural network (CNN) models were employed to extract features, which were then combined using early fusion for the classification of eight distinct classes. We leveraged the hierarchal structure of pneumonia classification within our approach to achieve improved classification outcomes. Since an imbalanced dataset is common in this field, a variety of versions of generative adversarial networks (GANs) were used to generate synthetic data. The proposed approach tested in our private datasets of 4523 patients achieved a macro-avg F1-score of 95.9% and an F1-score of 87.5% for COVID-19 identification using a Resnet-based structure. In conclusion, in this study, we were able to create an accurate deep learning multi-modal to diagnose COVID-19 and differentiate it from other kinds of pneumonia and normal lungs, which will enhance the radiological diagnostic process.

Keywords: COVID-19; CXR; artificial intelligence; deep learning; diagnosis; hierarchical; image classification; multi-classes; multi-modal; pneumonia.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Proposed hierarchal class structure of pneumonia.
Figure 2
Figure 2
Proposed framework for multi-modal classification of CXR images and tabular data.
Figure 3
Figure 3
CXR samples of the dataset.
Figure 4
Figure 4
Exploratory analysis of the numeric features.
Figure 5
Figure 5
Correlation of the continuous features in the dataset.
Figure 6
Figure 6
An example of the rib shadow elimination process and result.
Figure 7
Figure 7
Samples of synthetic CXR images.
Figure 8
Figure 8
VGG-like multi-modal architecture.
Figure 9
Figure 9
VGG-backbone multi-modal architecture.
Figure 10
Figure 10
ResNet-like multi-modal architecture.
Figure 11
Figure 11
ResNet-backbone multi-modal architecture.
Figure 12
Figure 12
Comparison of macro-average accuracy for all models with and without a second dataset.
Figure 13
Figure 13
Testing accuracy (one-fold) against the number of epochs for the Resnet-backbone multi-modal in the last experiments.
Figure 14
Figure 14
Training and testing loss against the number of epochs for each multi-modal in the last experiments.
Figure 15
Figure 15
Confusion matrix for each multi-modal in the last experiments.
Figure 16
Figure 16
Macro-average ROC curve across all decisions for each multi-modal in the last experiments.

Similar articles

Cited by

References

    1. Pal M., Berhanu G., Desalegn C., Kandi V. Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2): An Update. Cureus. 2020;12:e7423. doi: 10.7759/cureus.7423. - DOI - PMC - PubMed
    1. COVID-19 Cases|WHO COVID-19 Dashboard. Datadot. [(accessed on 20 January 2024)]. Available online: https://data.who.int/dashboards/covid19/cases.
    1. Chowdhury M.E.H., Rahman T., Khandakar A., Mazhar R., Kadir M.A., Bin Mahbub Z., Islam K.R., Khan M.S., Iqbal A., Al Emadi N., et al. Can AI Help in Screening Viral and COVID-19 Pneumonia? IEEE Access. 2020;8:132665–132676. doi: 10.1109/ACCESS.2020.3010287. - DOI
    1. Maharjan N., Thapa N., Magar B.P., Maharjan M., Tu J. COVID-19 Diagnosed by Real-Time Reverse Transcriptase-Polymerase Chain Reaction in Nasopharyngeal Specimens of Suspected Cases in a Tertiary Care Center: A Descriptive Cross-sectional Study. J. Nepal Med. Assoc. 2021;59:464–467. doi: 10.31729/jnma.5383. - DOI - PMC - PubMed
    1. Swapnarekha H., Behera H.S., Nayak J., Naik B. Role of intelligent computing in COVID-19 prognosis: A state-of-the-art review. Chaos Solitons Fractals. 2020;138:109947. doi: 10.1016/j.chaos.2020.109947. - DOI - PMC - PubMed