Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 15;18(2):e0281666.
doi: 10.1371/journal.pone.0281666. eCollection 2023.

Using machine learning to improve our understanding of COVID-19 infection in children

Affiliations

Using machine learning to improve our understanding of COVID-19 infection in children

Shraddha Piparia et al. PLoS One. .

Abstract

Purpose: Children are at elevated risk for COVID-19 (SARS-CoV-2) infection due to their social behaviors. The purpose of this study was to determine if usage of radiological chest X-rays impressions can help predict whether a young adult has COVID-19 infection or not.

Methods: A total of 2572 chest impressions from 721 individuals under the age of 18 years were considered for this study. An ensemble learning method, Random Forest Classifier (RFC), was used for classification of patients suffering from infection.

Results: Five RFC models were implemented with incremental features and the best model achieved an F1-score of 0.79 with Area Under the ROC curve as 0.85 using all input features. Hyper parameter tuning and cross validation was performed using grid search cross validation and SHAP model was used to determine feature importance. The radiological features such as pneumonia, small airways disease, and atelectasis (confounded with catheter) were found to be highly associated with predicting the status of COVID-19 infection.

Conclusions: In this sample, radiological X-ray films can predict the status of COVID-19 infection with good accuracy. The multivariate model including symptoms presented around the time of COVID-19 test yielded good prediction score.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Flow chart used to divide Chest X-Ray impressions into three categories.
The scans were preprocessed and then given as an input for categorization. These categories include Normal, Stable, and New findings.
Fig 2
Fig 2. Frequencies of each feature in COVID-19 positive and negative patients.
Fig 3
Fig 3. Random forest classifier performance for identifying COVID-19 infection.
The ROC curves shown above identifies the ability of all ML models to classify pediatric patients with COVID-19 infection.
Fig 4
Fig 4. SHAP values for most important model features in model 5 shown in decreasing order of their importance along the Y-axis.
Features in the upper case indicate RoS data, the lower case without an underscore represents features obtained from CXRi, and some additional demographic features. The top 20 features for the Random Forest classifier using a total of 76 features are shown using model 4. Each dot on the X-axis represents the importance value of the corresponding feature for each patient. The location of each dot indicates whether the feature is positively or negatively associated with the output. The color of each dot indicates whether the value is high (shown in red) or the value is low (indicated in blue).
Fig 5
Fig 5
SHAP values for two patients with absence (a) and presence (b) of COVID-19 infection. The SHAP values above indicate the impact of a particular feature with a certain value in comparison to the prediction made if the feature took some baseline value. As observed in (a), the absence of pneumonia, atelectasis, and small airways disease indicates the absence of COVID-19 infection and the presence of these features in (b) indicates the presence of COVID-19 infection.

References

    1. Kenneth McIntosh MD. COVID-19: Epidemiology, virology, and prevention. In: UpToDate, Post TW (Ed), UpToDate, Waltham, MA. (Accessed on August 14, 2021)
    1. Safiabadi Tali SH, LeBlanc JJ, Sadiq Z, Oyewunmi OD, Camargo C, Nikpour B, et al.. Tools and techniques for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)/COVID-19 detection. Clinical microbiology reviews. 2021. May 12;34(3):e00228–20. doi: 10.1128/CMR.00228-20 - DOI - PMC - PubMed
    1. Böger B, Fachi MM, Vilhena RO, Cobre AF, Tonin FS, Pontarolo R. Systematic review with meta-analysis of the accuracy of diagnostic tests for COVID-19. American journal of infection control. 2021. Jan 1;49(1):21–9. doi: 10.1016/j.ajic.2020.07.011 - DOI - PMC - PubMed
    1. Ochani R, Asad A, Yasmin F, Shaikh S, Khalid H, Batra S, et al.. COVID-19 pandemic: from origins to outcomes. A comprehensive review of viral pathogenesis, clinical manifestations, diagnostic evaluation, and management. Infez Med. 2021. Mar 1;29(1):20–36. - PubMed
    1. Ravert RD, Fu LY, Zimet GD. Young Adults’ COVID-19 Testing Intentions: The Role of Health Beliefs and Anticipated Regret. Journal of Adolescent Health. 2021. Mar 1;68(3):460–3. doi: 10.1016/j.jadohealth.2020.12.001 - DOI - PMC - PubMed

Publication types