Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 3:8:357.
doi: 10.3389/fpubh.2020.00357. eCollection 2020.

COVID-19 Patient Health Prediction Using Boosted Random Forest Algorithm

Affiliations

COVID-19 Patient Health Prediction Using Boosted Random Forest Algorithm

Celestine Iwendi et al. Front Public Health. .

Abstract

Integration of artificial intelligence (AI) techniques in wireless infrastructure, real-time collection, and processing of end-user devices is now in high demand. It is now superlative to use AI to detect and predict pandemics of a colossal nature. The Coronavirus disease 2019 (COVID-19) pandemic, which originated in Wuhan China, has had disastrous effects on the global community and has overburdened advanced healthcare systems throughout the world. Globally; over 4,063,525 confirmed cases and 282,244 deaths have been recorded as of 11th May 2020, according to the European Centre for Disease Prevention and Control agency. However, the current rapid and exponential rise in the number of patients has necessitated efficient and quick prediction of the possible outcome of an infected patient for appropriate treatment using AI techniques. This paper proposes a fine-tuned Random Forest model boosted by the AdaBoost algorithm. The model uses the COVID-19 patient's geographical, travel, health, and demographic data to predict the severity of the case and the possible outcome, recovery, or death. The model has an accuracy of 94% and a F1 Score of 0.86 on the dataset used. The data analysis reveals a positive correlation between patients' gender and deaths, and also indicates that the majority of patients are aged between 20 and 70 years.

Keywords: COVID-19; boosting; healthcare analytics; infection; patient data; random forest classification.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Symptoms in patients.
Figure 2
Figure 2
Correlation between data features.
Figure 3
Figure 3
Evaluation metrics for decision tree.
Figure 4
Figure 4
Evaluation metrics for SVM classifier.
Figure 5
Figure 5
Evaluation metrics for Gaussian NB.
Figure 6
Figure 6
Evaluation metrics for Boosted Random Forest.
Figure 7
Figure 7
Decision tree.
Figure 8
Figure 8
Decision tree 1.
Figure 9
Figure 9
Decision tree 10.
Figure 10
Figure 10
Decision tree 25.
Figure 11
Figure 11
Decision tree 100.
Figure 12
Figure 12
Comparison of Models' performance.

References

    1. Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. . Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. (2020) 395:497–506. 10.1016/S0140-6736(20)30183-5 - DOI - PMC - PubMed
    1. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. . Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. N Engl J Med. (2020) 382:1199–207. 10.1056/NEJMoa2001316 - DOI - PMC - PubMed
    1. Chen N, Zhou M, Dong X, Qu J, Gong F, Han Y, et al. . Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. (2020) 395:507–13. 10.1016/S0140-6736(20)30211-7 - DOI - PMC - PubMed
    1. Clinical Management of Severe Acute Respiratory Infection When Novel Coronavirus (2019-nCoV) Infection Is Suspected: Interim Guidance. (2020). Available online at: https://apps.who.int/iris/handle/10665/330893 (accessed April 31, 2020).
    1. Zumla A, Hui DS, Perlman S. Middle East respiratory syndrome. Lancet. (2015) 386:995–1007. 10.1016/S0140-6736(15)60454-8 - DOI - PMC - PubMed

Publication types