Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study
- PMID: 33410720
- PMCID: PMC7799376
- DOI: 10.1080/07853890.2020.1868564
Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study
Abstract
Objectives: To appraise effective predictors for COVID-19 mortality in a retrospective cohort study.
Methods: A total of 1270 COVID-19 patients, including 984 admitted in Sino French New City Branch (training and internal validation sets randomly split at 7:3 ratio) and 286 admitted in Optical Valley Branch (external validation set) of Wuhan Tongji hospital, were included in this study. Forty-eight clinical and laboratory features were screened with LASSO method. Further multi-tree extreme gradient boosting (XGBoost) machine learning-based model was used to rank importance of features selected from LASSO and subsequently constructed death risk prediction model with simple-tree XGBoost model. Performances of models were evaluated by AUC, prediction accuracy, precision, and F1 scores.
Results: Six features, including disease severity, age, levels of high-sensitivity C-reactive protein (hs-CRP), lactate dehydrogenase (LDH), ferritin, and interleukin-10 (IL-10), were selected as predictors for COVID-19 mortality. Simple-tree XGBoost model conducted by these features can predict death risk accurately with >90% precision and >85% sensitivity, as well as F1 scores >0.90 in training and validation sets.
Conclusion: We proposed the disease severity, age, serum levels of hs-CRP, LDH, ferritin, and IL-10 as significant predictors for death risk of COVID-19, which may help to identify the high-risk COVID-19 cases. KEY MESSAGES A machine learning method is used to build death risk model for COVID-19 patients. Disease severity, age, hs-CRP, LDH, ferritin, and IL-10 are death risk factors. These findings may help to identify the high-risk COVID-19 cases.
Keywords: COVID-19; extreme gradient boosting; fatal risk; machine learning.
Conflict of interest statement
The authors declare that they do not have any conflicts of interest.
Figures


Similar articles
-
Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation.J Med Internet Res. 2020 Nov 6;22(11):e24018. doi: 10.2196/24018. J Med Internet Res. 2020. PMID: 33027032 Free PMC article.
-
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0. BMC Public Health. 2024. PMID: 38943093 Free PMC article.
-
Uncovering Clinical Risk Factors and Predicting Severe COVID-19 Cases Using UK Biobank Data: Machine Learning Approach.JMIR Public Health Surveill. 2021 Sep 30;7(9):e29544. doi: 10.2196/29544. JMIR Public Health Surveill. 2021. PMID: 34591027 Free PMC article.
-
Predictors of adverse prognosis in COVID-19: A systematic review and meta-analysis.Eur J Clin Invest. 2020 Oct;50(10):e13362. doi: 10.1111/eci.13362. Epub 2020 Aug 27. Eur J Clin Invest. 2020. PMID: 32726868
-
Clinical Features and Laboratory Examination to Identify Severe Patients with COVID-19: A Systematic Review and Meta-Analysis.Biomed Res Int. 2021 Nov 15;2021:6671291. doi: 10.1155/2021/6671291. eCollection 2021. Biomed Res Int. 2021. PMID: 34796234 Free PMC article.
Cited by
-
Machine-learning-derived predictive score for early estimation of COVID-19 mortality risk in hospitalized patients.PLoS One. 2022 Sep 22;17(9):e0274171. doi: 10.1371/journal.pone.0274171. eCollection 2022. PLoS One. 2022. PMID: 36137106 Free PMC article.
-
Prediction of in-hospital hypokalemia using machine learning and first hospitalization day records in patients with traumatic brain injury.CNS Neurosci Ther. 2023 Jan;29(1):181-191. doi: 10.1111/cns.13993. Epub 2022 Oct 18. CNS Neurosci Ther. 2023. PMID: 36258296 Free PMC article.
-
Prediction of bone metastasis in non-small cell lung cancer based on machine learning.Front Oncol. 2023 Jan 9;12:1054300. doi: 10.3389/fonc.2022.1054300. eCollection 2022. Front Oncol. 2023. PMID: 36698411 Free PMC article.
-
Clinical Characteristics Associated with Bacterial Bloodstream Coinfection in COVID-19.Infect Dis Ther. 2022 Jun;11(3):1281-1296. doi: 10.1007/s40121-022-00636-6. Epub 2022 May 11. Infect Dis Ther. 2022. PMID: 35538335 Free PMC article.
-
Reliability of predictive models to support early decision making in the emergency department for patients with confirmed diagnosis of COVID-19: the Pescara Covid Hospital score.BMC Health Serv Res. 2022 Aug 19;22(1):1062. doi: 10.1186/s12913-022-08421-4. BMC Health Serv Res. 2022. PMID: 35986291 Free PMC article.
References
-
- World Health Organization . Coronavirus Disease 2019 (COVID-19) Situation Reports: Weekly Epidemiological Update, 17 November 2020. https://www.who.int/publications/m/item/weekly-epidemiological-update–-1....
-
- Wu Z, McGoogan JM.. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72–314 cases from the Chinese Center for Disease Control and Prevention. JAMA. 2020;323(13):1239–1242. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials
Miscellaneous