Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 17:3:662343.
doi: 10.3389/fdgth.2021.662343. eCollection 2021.

Explainable Machine Learning for COVID-19 Pneumonia Classification With Texture-Based Features Extraction in Chest Radiography

Affiliations

Explainable Machine Learning for COVID-19 Pneumonia Classification With Texture-Based Features Extraction in Chest Radiography

Luís Vinícius de Moura et al. Front Digit Health. .

Abstract

Both reverse transcription-PCR (RT-PCR) and chest X-rays are used for the diagnosis of the coronavirus disease-2019 (COVID-19). However, COVID-19 pneumonia does not have a defined set of radiological findings. Our work aims to investigate radiomic features and classification models to differentiate chest X-ray images of COVID-19-based pneumonia and other types of lung patterns. The goal is to provide grounds for understanding the distinctive COVID-19 radiographic texture features using supervised ensemble machine learning methods based on trees through the interpretable Shapley Additive Explanations (SHAP) approach. We use 2,611 COVID-19 chest X-ray images and 2,611 non-COVID-19 chest X-rays. After segmenting the lung in three zones and laterally, a histogram normalization is applied, and radiomic features are extracted. SHAP recursive feature elimination with cross-validation is used to select features. Hyperparameter optimization of XGBoost and Random Forest ensemble tree models is applied using random search. The best classification model was XGBoost, with an accuracy of 0.82 and a sensitivity of 0.82. The explainable model showed the importance of the middle left and superior right lung zones in classifying COVID-19 pneumonia from other lung patterns.

Keywords: SHAP; X-rays; coronavirus; explainable models; machine learning; radiological findings; radiomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Examples of CXR images from COVID-19 and non-COVID-19 datasets.
Figure 2
Figure 2
Workflow of the segmentation process.
Figure 3
Figure 3
Feature importance for each radiomic feature with the XGBoost model.
Figure 4
Figure 4
Impact of important features in the XGBoost model output.
Figure 5
Figure 5
Example of SHAP values affecting XGBoost model output for a single COVID-19 CXR image.
Figure 6
Figure 6
Example of SHAP values affecting XGBoost model output for a single non-COVID-19 CXR image.

References

    1. Zou L, Dai L, Zhang Y, Fu W, Gao Y, Zhang Z, et al. . Clinical characteristics and risk factors for disease severity and death in patients with coronavirus disease 2019 in Wuhan, China. Front Med. (2020) 7:532. 10.3389/fmed.2020.00532 - DOI - PMC - PubMed
    1. Sohrabi C, Alsafi Z, O'Neill N, Khan M, Kerwan A, Al-Jabir A, et al. . World Health Organization declares global emergency: a review of the 2019 novel coronavirus (COVID-19). Int J Surgery. (2020) 76:71–6. 10.1016/j.ijsu.2020.02.034 - DOI - PMC - PubMed
    1. Aljondi R, Alghamdi S. Diagnostic value of imaging modalities for COVID-19: scoping review. J Med Internet Res. (2020) 22:e19673. 10.2196/19673 - DOI - PMC - PubMed
    1. WHO . Coronavirus Disease (COVID-19) Dashboard. (2020). Available online at: https://covid19.who.int/ (accessed January 31, 2021).
    1. Zuckerman N, Pando R, Bucris E, Drori Y, Lustig Y, Erster O, et al. . Comprehensive analyses of SARS-CoV-2 transmission in a public health virology laboratory. Viruses. (2020) 12:854. 10.3390/v12080854 - DOI - PMC - PubMed

LinkOut - more resources