Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 30:15:1430899.
doi: 10.3389/fimmu.2024.1430899. eCollection 2024.

Development of a COVID-19 early risk assessment system based on multiple machine learning algorithms and routine blood tests: a real-world study

Affiliations

Development of a COVID-19 early risk assessment system based on multiple machine learning algorithms and routine blood tests: a real-world study

Qiangqiang Qin et al. Front Immunol. .

Abstract

Backgrounds: During the Coronavirus Disease 2019 (COVID-19) epidemic, the massive spread of the disease has placed an enormous burden on the world's healthcare and economy. The early risk assessment system based on a variety of machine learning (ML) algorithms may be able to provide more accurate advice on the classification of COVID-19 patients, offering predictive, preventive, and personalized medicine (PPPM) solutions in the future.

Methods: In this retrospective study, we divided a portion of the data into training and validation cohorts in a 7:3 ratio and established a model based on a combination of two ML algorithms first. Then, we used another portion of the data as an independent testing cohort to determine the most accurate and stable model and compared it with other scoring systems. Finally, patients were categorized according to risk scores and then the correlation between their clinical data and risk scores was studied.

Results: The elderly accounted for the majority of hospitalized patients with COVID-19. The C-index of the model constructed by combining the stepcox[both] and survivalSVM algorithms was 0.840 in the training cohort and 0.815 in the validation cohort, which was calculated to have the highest C-index in the testing cohort compared to the other 119 ML model combinations. Compared with current scoring systems, including the CURB-65 and several reported prognosis models previously, our model had the highest AUC value of 0.778, representing an even higher predictive performance. In addition, the model's AUC values for specific time intervals, including days 7,14 and 28, demonstrate excellent predictive performance. Most importantly, we stratified patients according to the model's risk score and demonstrated a difference in survival status between the high-risk, median-risk, and low-risk groups, which means a new and stable risk assessment system was built. Finally, we found that COVID-19 patients with a history of cerebral infarction had a significantly higher risk of death.

Conclusion: This novel risk assessment system is highly accurate in predicting the prognosis of patients with COVID-19, especially elderly patients with COVID-19, and can be well applied within the PPPM framework. Our ML model facilitates stratified patient management, meanwhile promoting the optimal use of healthcare resources.

Keywords: COVID-19; categorized treatment; machine learning; predictive model; predictive preventive personalized medicine.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Flowchart of this research.
Figure 2
Figure 2
A comprehensive set of 119 prediction models and their predictive performance was systematically evaluated in the training, validation and testing cohorts using the C-index as the primary performance metric.
Figure 3
Figure 3
Prediction performance of ML-based model in training, validation and testing cohort. (A–C) Receiver operating characteristic (ROC) curve of ML-based model and other evaluation methods. (D–F) Time-dependent receiver operating characteristic (ROC) curve of ML-based model in 7-days, 14-days and 28-days. (G) Comparisons of ML-based model and other published models. *** p < 0.001.
Figure 4
Figure 4
Stratification survival analysis in training (A), validation (B), and testing cohort (C).
Figure 5
Figure 5
Correlation analysis between ML-based model and the results of clinical parameters and laboratory testing. (A) Butterfly plot demonstrates the correlation between the ML-based model and patients’ baseline characteristics, clinical vital signs, and laboratory results. (B) Raincloud plot illustrated the correlation between ML-model risk score and cerebral infarction.

Similar articles

References

    1. Che Y, Jiang D, Zhang Y, Zhang J, Xu T, Sun Y, et al. . Elevated ubiquitination contributes to protective immunity against severe SARS-CoV-2 infection. Clin Transl Med. (2022) 12:e1103. doi: 10.1002/ctm2.1103 - DOI - PMC - PubMed
    1. Jeong YJ, Wi YM, Park H, Lee JE, Kim SH, Lee KS. Current and emerging knowledge in COVID-19. Radiology. (2023) 306:e222462. doi: 10.1148/radiol.222462 - DOI - PMC - PubMed
    1. Bartsch SM, Ferguson MC, McKinnell JA, O'Shea KJ, Wedlock PT, Siegmund SS, et al. . The potential health care costs and resource use associated with COVID-19 in the United States. Health Aff (Millwood). (2020) 39:927–35. doi: 10.1377/hlthaff.2020.00426 - DOI - PMC - PubMed
    1. Sjodin H, Johansson AF, Brannstrom A, Farooq Z, Kriit HK, Wilder-Smith A, et al. . COVID-19 healthcare demand and mortality in Sweden in response to non-pharmaceutical mitigation and suppression scenarios. Int J Epidemiol. (2020) 49:1443–53. doi: 10.1093/ije/dyaa121 - DOI - PMC - PubMed
    1. Alenzi KA, Al-Malky HS, Altebainawi AF, Abushomi HQ, Alatawi FO, Atwadi MH, et al. . Health economic burden of COVID-19 in Saudi Arabia. Front Public Health. (2022) 10:927494. doi: 10.3389/fpubh.2022.927494 - DOI - PMC - PubMed

LinkOut - more resources