Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 16:9:120422-120441.
doi: 10.1109/ACCESS.2021.3105321. eCollection 2021.

Development and Validation of an Early Scoring System for Prediction of Disease Severity in COVID-19 Using Complete Blood Count Parameters

Affiliations

Development and Validation of an Early Scoring System for Prediction of Disease Severity in COVID-19 Using Complete Blood Count Parameters

Tawsifur Rahman et al. IEEE Access. .

Abstract

The coronavirus disease 2019 (COVID-19) after outbreaking in Wuhan increasingly spread throughout the world. Fast, reliable, and easily accessible clinical assessment of the severity of the disease can help in allocating and prioritizing resources to reduce mortality. The objective of the study was to develop and validate an early scoring tool to stratify the risk of death using readily available complete blood count (CBC) biomarkers. A retrospective study was conducted on twenty-three CBC blood biomarkers for predicting disease mortality for 375 COVID-19 patients admitted to Tongji Hospital, China from January 10 to February 18, 2020. Machine learning based key biomarkers among the CBC parameters as the mortality predictors were identified. A multivariate logistic regression-based nomogram and a scoring system was developed to categorize the patients in three risk groups (low, moderate, and high) for predicting the mortality risk among COVID-19 patients. Lymphocyte count, neutrophils count, age, white blood cell count, monocytes (%), platelet count, red blood cell distribution width parameters collected at hospital admission were selected as important biomarkers for death prediction using random forest feature selection technique. A CBC score was devised for calculating the death probability of the patients and was used to categorize the patients into three sub-risk groups: low (<=5%), moderate (>5% and <=50%), and high (>50%), respectively. The area under the curve (AUC) of the model for the development and internal validation cohort were 0.961 and 0.88, respectively. The proposed model was further validated with an external cohort of 103 patients of Dhaka Medical College, Bangladesh, which exhibits in an AUC of 0.963. The proposed CBC parameter-based prognostic model and the associated web-application, can help the medical doctors to improve the management by early prediction of mortality risk of the COVID-19 patients in the low-resource countries.

Keywords: COVID-19; Complete blood count; early prediction of mortality risk; machine learning; prognostic model.

PubMed Disclaimer

Figures

FIGURE 1.
FIGURE 1.
Schematic diagram of the experimental frame work.
FIGURE 2.
FIGURE 2.
Top-ranked-10 features using random forest feature selection technique.
FIGURE 3.
FIGURE 3.
ROC curves for top-10 features using logistic regression classifier (Imputation-KNN, Feature selection–Random forest).
FIGURE 4.
FIGURE 4.
Comparison of the top-ranked 8 features identified using random forest algorithm from data imputed using KNN algorithm A) without neutrophils and lymphocyte counts and B) without neutrophils and lymphocytes percentage.
FIGURE 5.
FIGURE 5.
Confusion Matrix of the best performing combination of features using logistic regression classifier: A) without neutrophils and lymphocyte counts and B) without neutrophils and lymphocytes percentage.
FIGURE 6.
FIGURE 6.
Calibration plot comparing predicted and actual death probability of patients with COVID-19: (A) represents the internal validation using Chinese test data, and (B) represents the external validation using Bangladeshi data.
FIGURE 7.
FIGURE 7.
Decision curves analysis comparing different models to predict the death probability of patients with COVID-19. The net benefit balances the mortality risk and potential harm from unnecessary over-intervention for patients with COVID-19.
FIGURE 8.
FIGURE 8.
Multivariate logistic regression-based Nomogram to predict the probability of death. A Nomogram for prediction of death was created using the following seven predictors: Neutrophils count, Age, Platelet count, Monocytes, WBC, Lymphocyte count, Red blood cell distribution width.
FIGURE 9.
FIGURE 9.
Example of the developed nomogram helping early severity classification of mortality of a patient from the dataset collected at Dhaka Medical College.
FIGURE 10.
FIGURE 10.
Mortality risk prediction Web-Application .

Similar articles

Cited by

References

    1. (2021). Coronavirus Cases. [Online]. Available: https://www.worldometers.info/coronavirus/
    1. Krit B. L., Kuvshinov V. V., Kukushkin D. Y., Morozova N. B., Omel’chuk Y. A., Revenok T. V., and Sleptsov V. V., “The application of nanoclaster coatngs for modification of image receiving surface of thermophotoelectric energy converters,” Surf. Eng. Appl. Electrochem., vol. 56, no. 1, pp. 100–104, Jan. 2020.
    1. Zhu N., Zhang D., Wang W., Li X., Yang B., Song J., and Zhao X., “A novel coronavirus from patients with pneumonia in China, 2019,” New England J. Med., to be published. - PMC - PubMed
    1. Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., Zhang L., Fan G., Xu J., Gu X., Cheng Z., Yu T., Xia J., Wei Y., Wu W., Xie X., Yin W., and Li H., “Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China,” Lancet, vol. 395, no. 10223, pp. 497–506, May 2020. - PMC - PubMed
    1. Guan W., Ni Z.-Y., Hu Y., Liang W.-H., Ou C.-Q., He J.-X., Liu L., Shan H., Lei C.-L., Hui D. S. C., Du B., and Li L.-J., “Clinical characteristics of coronavirus disease 2019 in China,” New England J. Med., vol. 382, no. 18, pp. 1708–1720, 2020. - PMC - PubMed

LinkOut - more resources