Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec;53(1):257-266.
doi: 10.1080/07853890.2020.1868564.

Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study

Affiliations

Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study

Xin Guan et al. Ann Med. 2021 Dec.

Abstract

Objectives: To appraise effective predictors for COVID-19 mortality in a retrospective cohort study.

Methods: A total of 1270 COVID-19 patients, including 984 admitted in Sino French New City Branch (training and internal validation sets randomly split at 7:3 ratio) and 286 admitted in Optical Valley Branch (external validation set) of Wuhan Tongji hospital, were included in this study. Forty-eight clinical and laboratory features were screened with LASSO method. Further multi-tree extreme gradient boosting (XGBoost) machine learning-based model was used to rank importance of features selected from LASSO and subsequently constructed death risk prediction model with simple-tree XGBoost model. Performances of models were evaluated by AUC, prediction accuracy, precision, and F1 scores.

Results: Six features, including disease severity, age, levels of high-sensitivity C-reactive protein (hs-CRP), lactate dehydrogenase (LDH), ferritin, and interleukin-10 (IL-10), were selected as predictors for COVID-19 mortality. Simple-tree XGBoost model conducted by these features can predict death risk accurately with >90% precision and >85% sensitivity, as well as F1 scores >0.90 in training and validation sets.

Conclusion: We proposed the disease severity, age, serum levels of hs-CRP, LDH, ferritin, and IL-10 as significant predictors for death risk of COVID-19, which may help to identify the high-risk COVID-19 cases. KEY MESSAGES A machine learning method is used to build death risk model for COVID-19 patients. Disease severity, age, hs-CRP, LDH, ferritin, and IL-10 are death risk factors. These findings may help to identify the high-risk COVID-19 cases.

Keywords: COVID-19; extreme gradient boosting; fatal risk; machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they do not have any conflicts of interest.

Figures

Figure 1.
Figure 1.
A decision rule using 6 key clinical features and their thresholds in absolute values among COVID-19 patients admitted in the Sino French New City Branch of Wuhan Tongji Hospital (n = 787). Num: the number of patients in a class; T: the number of correctly classified; F: the number of misclassified patients.
Figure 2.
Figure 2.
ROC curves for COVID-19 patients in the training (A) and internal validation (B) sets from Sino French New City Branch of Wuhan Tongji hospital (n = 787) and in the external validation set (C) from Optical Valley Branch of Wuhan Tongji Hospital (n = 286). Since 197 subjects with missing detection for at least one of the 6 features, the remaining 787 cases were randomly split into training (n = 554) and internal validation sets (n = 233) in the ratio 7:3.

Similar articles

Cited by

References

    1. World Health Organization . Coronavirus Disease 2019 (COVID-19) Situation Reports: Weekly Epidemiological Update, 17 November 2020. https://www.who.int/publications/m/item/weekly-epidemiological-update–-1....
    1. Wu Z, McGoogan JM.. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72–314 cases from the Chinese Center for Disease Control and Prevention. JAMA. 2020;323(13):1239–1242. - PubMed
    1. Azoulay E, Fartoukh M, Darmon M, et al. . Increased mortality in patients with severe SARS-CoV-2 infection admitted within seven days of disease onset. Intensive Care Med. 2020;46(9):1714–1719. - PMC - PubMed
    1. Liang W, Liang H, Ou L, et al. ; China Medical Treatment Expert Group for COVID-19 . Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern Med. 2020;180(8):1081–1089. - PMC - PubMed
    1. Albahri AS, Hamid RA, Alwan JK, Al-Qays ZT, et al. . Role of biological data mining and machine learning techniques in detecting and diagnosing the novel coronavirus (COVID-19): a systematic review. J Med Syst. 2020;44(7):122. - PMC - PubMed

MeSH terms