Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 27;9(7):281.
doi: 10.3390/bioengineering9070281.

Development of a Smartphone-Based Expert System for COVID-19 Risk Prediction at Early Stage

Affiliations

Development of a Smartphone-Based Expert System for COVID-19 Risk Prediction at Early Stage

M Raihan et al. Bioengineering (Basel). .

Abstract

COVID-19 has imposed many challenges and barriers on traditional healthcare systems due to the high risk of being infected by the coronavirus. Modern electronic devices like smartphones with information technology can play an essential role in handling the current pandemic by contributing to different telemedical services. This study has focused on determining the presence of this virus by employing smartphone technology, as it is available to a large number of people. A publicly available COVID-19 dataset consisting of 33 features has been utilized to develop the aimed model, which can be collected from an in-house facility. The chosen dataset has 2.82% positive and 97.18% negative samples, demonstrating a high imbalance of class populations. The Adaptive Synthetic (ADASYN) has been applied to overcome the class imbalance problem with imbalanced data. Ten optimal features are chosen from the given 33 features, employing two different feature selection algorithms, such as K Best and recursive feature elimination methods. Mainly, three classification schemes, Random Forest (RF), eXtreme Gradient Boosting (XGB), and Support Vector Machine (SVM), have been applied for the ablation studies, where the accuracy from the XGB, RF, and SVM classifiers achieved 97.91%, 97.81%, and 73.37%, respectively. As the XGB algorithm confers the best results, it has been implemented in designing the Android operating system base and web applications. By analyzing 10 users' questionnaires, the developed expert system can predict the presence of COVID-19 in the human body of the primary suspect. The preprocessed data and codes are available on the GitHub repository.

Keywords: Android or web-based user applications; COVID-19 prediction; adaptive synthetic sampling; feature selection methods; machine learning classifiers.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1
Figure 1
The recommended workflow for the identification of COVID-19 in conjunction with the proposed user application.
Figure 2
Figure 2
Heat–map representation of the features, dispensing the correlations between them.
Figure 3
Figure 3
Flow chart of the working mechanism of the XGB algorithm in the XGB classifier. The highest accuracy was achieved by applying the XGB algorithm to the dataset.
Figure 4
Figure 4
Confusion matrices of (a) XGB, (b) RF, (c) SVM, (d) Twin-SVM with linear kernel, and (e) Twin-SVM with RBF kernel for evaluating their class-wise performance.
Figure 5
Figure 5
(a) ROC of RF, XGB, and SVM; and (b) bootstrap ROC analysis of XGB. The XGB classifier outperformed all the other algorithms.
Figure 6
Figure 6
(a) Box and Whisker plot of 10-fold CV of the main three ML classifiers and (b) a multi-comparison test of the three trained models in this article.
Figure 7
Figure 7
Feature importance and cumulative importance using XGB. The most important feature is loss_of_test, which is a noteworthy discovery from the dataset investigation. Then, (a) showed the selected 10 critical features, and (b) visualizes the cumulative feature importance curve, which also indicates that it grows to 95% above.
Figure 8
Figure 8
Violin plot of data distribution in terms of fever and cough.
Figure 9
Figure 9
Association rules (a) scatter plot, (b) support and lift, and (c) items in the left hand side (LHS) and the right hand side (RHS) group.
Figure 10
Figure 10
Screenshot of the designed web application for COVID-19 prognostication, deploying the proposed framework.
Figure 11
Figure 11
Several screenshots, such as (a) queries from users, (b) queries from users, and (c) predicted results of the implemented mobile application for COVID-19 prognostication, deploying the proposed framework.

Similar articles

Cited by

References

    1. Alshukry A., Ali H., Ali Y., Al-Taweel T., Abu-Farha M., AbuBaker J., Devarajan S., Dashti A.A., Bandar A., Taleb H., et al. Clinical characteristics of coronavirus disease 2019 (COVID-19) patients in Kuwait. PLoS ONE. 2020;15:e0242768. doi: 10.1371/journal.pone.0242768. - DOI - PMC - PubMed
    1. 210,214,749 Cases and 4,407,236 Deaths from the Coronavirus. Worldometer. 2021. [(accessed on 1 May 2022)]. Available online: https://www.worldometers.info/coronavirus.
    1. Raihan M., Bairagi A.K., Rahman S. A Machine Learning Based Study to Predict Depression with Monitoring Actigraph Watch Data; Proceedings of the 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT); Kharagpur, India. 6–8 July 2021; Piscataway, NJ, USA: IEEE; 2021. pp. 1–5.
    1. Raihan M., Islam M.T., Ghosh P., Hassan M.M., Angon J.H., Kabiraj S. Human Behavior Analysis using Association Rule Mining Techniques; Proceedings of the 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT); Kharagpur, India. 1–3 July 2020; Piscataway, NJ, USA: IEEE; 2020. pp. 1–5.
    1. Hasan M.K., Alam M.A., Dahal L., Elahi M.T.E., Roy S., Wahid S.R., Marti R., Khanal B. Challenges of deep learning methods for COVID-19 detection using public datasets. Inform. Med. Unlocked. 2020;30:100945. doi: 10.1016/j.imu.2022.100945. - DOI - PMC - PubMed