Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb:112:102018.
doi: 10.1016/j.artmed.2021.102018. Epub 2021 Jan 15.

A novel computational method for assigning weights of importance to symptoms of COVID-19 patients

Affiliations

A novel computational method for assigning weights of importance to symptoms of COVID-19 patients

Mohammad A Alzubaidi et al. Artif Intell Med. 2021 Feb.

Abstract

Background and objective: The novel coronavirus disease 2019 (COVID-19) is considered a pandemic by the World Health Organization (WHO). As of April 3, 2020, there were 1,009,625 reported confirmed cases, and 51,737 reported deaths. Doctors have been faced with a myriad of patients who present with many different symptoms. This raises two important questions. What are the common symptoms, and what are their relative importance?

Methods: A non-structured and incomplete COVID-19 dataset of 14,251 confirmed cases was preprocessed. This produced a complete and organized COVID-19 dataset of 738 confirmed cases. Six different feature selection algorithms were then applied to this new dataset. Five of these algorithms have been proposed earlier in the literature. The sixth is a novel algorithm being proposed by the authors, called Variance Based Feature Weighting (VBFW), which not only ranks the symptoms (based on their importance) but also assigns a quantitative importance measure to each symptom.

Results: For our COVID-19 dataset, the five different feature selection algorithms provided different rankings for the most important top-five symptoms. They even selected different symptoms for inclusion within the top five. This is because each of the five algorithms ranks the symptoms based on different data characteristics. Each of these algorithms has advantages and disadvantages. However, when all these five rankings were aggregated (using two different aggregating methods) they produced two identical rankings of the five most important COVID-19 symptoms. Starting from the most important to least important, they were: Fever/Cough, Fatigue, Sore Throat, and Shortness of Breath. (Fever and cough were ranked equally in both aggregations.) Meanwhile, the sixth novel Variance Based Feature Weighting algorithm, chose the same top five symptoms, but ranked fever much higher than cough, based on its quantitative importance measures for each of those symptoms (Fever - 75 %, Cough - 39.8 %, Fatigue - 16.5 %, Sore Throat - 10.8 %, and Shortness of Breath - 6.6 %). Moreover, the proposed VBFW method achieved an accuracy of 92.1 % when used to build a one-class SVM model, and an NDCG@5 of 100 %.

Conclusions: Based on the dataset, and the feature selection algorithms employed here, symptoms of Fever, Cough, Fatigue, Sore Throat and Shortness of Breath are important symptoms of COVID-19. The VBFW algorithm also indicates that Fever and Cough symptoms were especially indicative of COVID-19, for the confirmed cases that are documented in our database.

Keywords: COVID-19; Feature selection; Importance weights; Important symptoms; Novel coronavirus.

PubMed Disclaimer

Conflict of interest statement

The authors report no declarations of interest.

Figures

Fig. 1
Fig. 1
VBFW Weights and Ranking.
Fig. 2
Fig. 2
One-Class SVM Prediction Accuracy.
Fig. 3
Fig. 3
Rank-Aware Evaluation using Normalized Discounted Cumulative Gain.
Fig. 4
Fig. 4
VBFW Weights and Ranking for (a) Males and (b) Females.
Fig. 5
Fig. 5
VBFW Weights and Ranking for People of Age (a) 0-14, (b) 15-49, (c) 50-64 and (d) ≥ 65.
Fig. 6
Fig. 6
VBFW Weights and Ranking for People in (a) China, (b) Hong Kong, (c) Japan, (d) Malaysia, (e) South Korea and (f) Taiwan.

References

    1. World Health Organization (WHO) World Health Organization; 2020. Coronavirus disease (COVID-19) pandemic.https://www.who.int/emergencies/diseases/novel-coronavirus-2019
    1. Lauer Stephen A., Grantz Kyra H., Bi Qifang, Jones Forrest K., Zheng Qulu, Meredith Hannah R., et al. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Ann Intern Med. 2020 - PMC - PubMed
    1. Bai Yan, Yao Lingsheng, Wei Tao, Tian Fei, Jin Dong-Yan, Chen Lijuan, et al. Presumed asymptomatic carrier transmission of COVID-19. JAMA. 2020 - PMC - PubMed
    1. Shi Heshui, Han Xiaoyu, Jiang Nanchuan, Cao Yukun, Alwalid Osamah, Gu Jin, et al. Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: a descriptive study. Lancet Infect Dis. 2020 - PMC - PubMed
    1. Bernheim Adam, Mei Xueyan, Huang Mingqian, Yang Yang., Fayad Zahi A., Zhang Ning, et al. Chest CT findings in coronavirus disease-19 (COVID-19): relationship to duration of infection. Radiology. 2020 - PMC - PubMed

LinkOut - more resources