Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2022 Apr;50(2):359-370.
doi: 10.1007/s15010-021-01656-z. Epub 2021 Jul 19.

Prediction of COVID-19 deterioration in high-risk patients at diagnosis: an early warning score for advanced COVID-19 developed by machine learning

Collaborators, Affiliations
Multicenter Study

Prediction of COVID-19 deterioration in high-risk patients at diagnosis: an early warning score for advanced COVID-19 developed by machine learning

Carolin E M Jakob et al. Infection. 2022 Apr.

Abstract

Purpose: While more advanced COVID-19 necessitates medical interventions and hospitalization, patients with mild COVID-19 do not require this. Identifying patients at risk of progressing to advanced COVID-19 might guide treatment decisions, particularly for better prioritizing patients in need for hospitalization.

Methods: We developed a machine learning-based predictor for deriving a clinical score identifying patients with asymptomatic/mild COVID-19 at risk of progressing to advanced COVID-19. Clinical data from SARS-CoV-2 positive patients from the multicenter Lean European Open Survey on SARS-CoV-2 Infected Patients (LEOSS) were used for discovery (2020-03-16 to 2020-07-14) and validation (data from 2020-07-15 to 2021-02-16).

Results: The LEOSS dataset contains 473 baseline patient parameters measured at the first patient contact. After training the predictor model on a training dataset comprising 1233 patients, 20 of the 473 parameters were selected for the predictor model. From the predictor model, we delineated a composite predictive score (SACOV-19, Score for the prediction of an Advanced stage of COVID-19) with eleven variables. In the validation cohort (n = 2264 patients), we observed good prediction performance with an area under the curve (AUC) of 0.73 ± 0.01. Besides temperature, age, body mass index and smoking habit, variables indicating pulmonary involvement (respiration rate, oxygen saturation, dyspnea), inflammation (CRP, LDH, lymphocyte counts), and acute kidney injury at diagnosis were identified. For better interpretability, the predictor was translated into a web interface.

Conclusion: We present a machine learning-based predictor model and a clinical score for identifying patients at risk of developing advanced COVID-19.

Keywords: Advanced stage; COVID-19; Complicated stage; LEOSS; Machine learning; Predictive model.

PubMed Disclaimer

Conflict of interest statement

Authors declare no conflict of interest.

Figures

Fig. 1
Fig. 1
Machine-learning scheme and results. A Schematic workflow illustrating the iterative reduction of variables. First, the best performing predictor was selected based on all baseline variables (“base predictor”). Next, variables were removed following an iterative optimization procedure leading to the slim predictor and the minimalistic predictor. B Ranking of the variables for the “slim predictor” by their scaled importance. Values in parentheses depict the relative importance. C Performance (area under the curve [AUC] and accuracy) of predictors during the iterative optimization in a top down procedure. From right to left, the procedure started with n = 61 variables removing one variable at a time (displayed on the x-axis). The performance declined considerably at n = 21 variables which led to the selection of the minimalistic predictor just before this decline. D Receiver operating characteristics (ROC) curve of the minimalistic predictor on the test and validation set
Fig. 2
Fig. 2
Stability and performance of the minimalistic predictor. A Performance of the minimalistic predictor when leaving out one variable at a time (displayed on the x-axis). Leaving out a single variable from the identified binary variables did not markedly influence the performance. B Receiver operating characteristics (ROC) curve of the minimalistic predictor on the validation set considering only patients without any missing value (in the list of n = 20 variables for the minimalistic predictor). C ROC curve of SACOV-19 based on patients of the validation set without missing values (in the list of the 11 patient variables of SACOV-19)

References

    1. Okada P, Buathong R, Phuygun S, Thanadachakul T, Parnmen S, Wongboot W, et al. Early transmission patterns of coronavirus disease 2019 (COVID-19) in travellers from Wuhan to Thailand, January 2020. Eurosurveillance 2020;25:2000097. - PMC - PubMed
    1. Coronaviridae Study Group of the International Committee on Taxonomy of Viruses The species severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat Microbiol. 2020;5:536–544. doi: 10.1038/s41564-020-0695-z. - DOI - PMC - PubMed
    1. WHO. Weekly operational update on COVID-19. 2020. http://www.who.int.
    1. Guan W, Ni Z, Hu Y, Liang W, Ou C, He J, et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382:1708–1720. doi: 10.1056/NEJMoa2002032. - DOI - PMC - PubMed
    1. Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–1062. doi: 10.1016/S0140-6736(20)30566-3. - DOI - PMC - PubMed

Publication types

LinkOut - more resources