Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 1:213:118935.
doi: 10.1016/j.eswa.2022.118935. Epub 2022 Oct 3.

Equilibrium-based COVID-19 diagnosis from routine blood tests: A sparse deep convolutional model

Affiliations

Equilibrium-based COVID-19 diagnosis from routine blood tests: A sparse deep convolutional model

Doaa A Altantawy et al. Expert Syst Appl. .

Abstract

SARS-CoV2 (COVID-19) is the virus that causes the pandemic that has severely impacted human society with a massive death toll worldwide. Hence, there is a persistent need for fast and reliable automatic tools to help health teams in making clinical decisions. Predictive models could potentially ease the strain on healthcare systems by early and reliable screening of COVID-19 patients which helps to combat the spread of the disease. Recent studies have reported some key advantages of employing routine blood tests for initial screening of COVID-19 patients. Thus, in this paper, we propose a novel COVID-19 prediction model based on routine blood tests. In this model, we depend on exploiting the real dependency among the employed feature pool by a sparsification procedure. In this sparse domain, a hybrid feature selection mechanism is proposed. This mechanism fuses the selected features from two perspectives, the first is Pearson correlation and the second is a new Minkowski-based equilibrium optimizer (MEO). Then, the selected features are fed into a new 1D Convolutional Neural Network (1DCNN) for a final diagnosis decision. The proposed prediction model is tested with a new public dataset from San Raphael Hospital, Milan, Italy, i.e., OSR dataset which has two sub-datasets. According to the experimental results, the proposed model outperforms the state-of-the-art techniques with an average testing accuracy of 98.5% while we employ only less than half the size of the feature pool, i.e., we need only less than half the given blood tests in the employed dataset to get a final diagnosis decision.

Keywords: 1DCNN; Blood tests; COVID-19; Equilibrium optimization; Feature pool sparsification; Feature selection; Pearson correlation.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
A logarithmic scale for COVID-19 Monthly total deaths (Worldometer, 2020).
Fig. 2
Fig. 2
An illustration of the proposed COVID-19 prediction model.
Fig. 3
Fig. 3
COVID-19 examination results for COVID-specific dataset in (a) and for CBC dataset in (b).
Fig. 4
Fig. 4
COVID-19 swab result distribution according to age and gender for COVID-specific dataset in (a) and for CBC dataset in (b).
Fig. 5
Fig. 5
3D Visualization of the predicted outliers/inliers in COVID-specific dataset via three PCA components.
Fig. 6
Fig. 6
An illustration of the proposed feature selection technique that is based on a fusion process between Pearson dropping (PCC) and the introduced Minkowski-based equilibrium optimizer (MEO) in a serial and parallel manner in the original features domain F once and in the proposed sparse domain FS another. "+" represents combining decisions by OR operations while "×" represents seeking the intersections of decisions by AND operations.
Fig. 7
Fig. 7
Pairwise Pearson correlation of features: (a), (c) for the original feature pool F while (b), (d) for the sparsified feature pool FS. The first row for COVID-specific dataset and the second one for CBC dataset.
Fig. 8
Fig. 8
2D illustration of Equilibrium candidates’ collaboration in updating particles’ concentration.
Fig. 9
Fig. 9
Flow chart of the proposed MEO algorithm.
Fig. 10
Fig. 10
Comparison of the results of average fitness over iterations for the traditional EO, in the first row, and the proposed MEO, in the second one, for COVID-specific dataset. The first column is the results of the original feature pool F while the second one for the sparsified feature pool FS.
Fig. 11
Fig. 11
An example of 1DCNN model for a binary classification problem. In this example, the network consists of two convolutional layers (Conv_1with 32 filters and Conv_2 with 64 filters), Max pooling layer, flattening layer and finally some fully connected layers with soft-max layer.
Fig. 12
Fig. 12
Summary of the proposed 1DCNN for COVID-19 prediction considering 9 selected features.
Fig. 13
Fig. 13
The employed evaluation metrics.
Fig. 14
Fig. 14
Confusion matrices of testing the proposed COVID-19 prediction algorithm adopting the four cases indicated in Table 4 showing the effect of SMOTE and iForest on the performance.
Fig. 15
Fig. 15
AdaBoost feature importance employing all features for COVID-specific dataset in the first row and CBC dataset in the second row. (a) and (c) in the features original domain while (b) and (d) in the sparse domain.
Fig. 16
Fig. 16
AdaBoost feature importance, for COVID-specific dataset, adopting the followings: 1. PCC-based feature selection in the features original domain (22 feature selected) while applying training and testing once for the original samples in F (a), and another for the sparse samples in Fs (b). 2. PCC-based feature selection in sparse domain (14 feature selected) while applying training and testing once for the original samples in F (c), and another for the sparse samples in Fs (d). 3. MEO-based feature selection in features original domain (12 feature selected) while applying training and testing once for the original samples in F (e), and another for the sparse samples in Fs (f). 4. PCC-based feature selection in sparse domain (12 feature selected) while applying training and testing once for the original samples in F (g), and another for the sparse samples in Fs (h).
Fig. 17
Fig. 17
Classification reports of testing the proposed COVID diagnosis model based on the proposed fused selection method and 1DCNN in both original domain (a), (c) and sparse domain (b), (d). The first two rows belong to COVID-specific dataset while the other rows belong to CBC dataset.
Fig. 18
Fig. 18
Training-validation performance in terms of accuracy for the proposed COVID prediction algorithm. The first row for COVID-specific dataset and the other one for CBC dataset. The training in (a), (c) is performed in features original domain and the others (b), and (d) in sparse domain. The training is performed over the selected features by the proposed fused-based feature selection mechanism which results 13 features for COVID-specific dataset and 6 features for CBC-dataset.
Fig. 19
Fig. 19
Classification reports of testing the following studies: (Alakus & Turkoglu, 2020) {18/33–13/13} as (a), (AlJame et al., 2020) {18/33–13/13} as (b), (Cabitza et al., 2021) {33/33–13/13} as (c), (Brinati et al., 2020) {33/33–13/13} as (d), (Shaban et al., 2021) {33/33–13/13} as (e), and Ours {13/33–6/13} as (f). {} denotes {selected features/total no. of features for COVID-specific dataset – selected features/total no. of features for CBC dataset. (?.1) for COVID-specific dataset and (?.2) for CBC dataset.

Similar articles

Cited by

References

    1. Ai T., Yang Z., Hou H., Zhan C., Chen C., Lv W.…Xia L. Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: A report of 1014 cases. Radiology. 2020 doi: 10.3410/f.737441336.793572890. - DOI - PMC - PubMed
    1. Alafif T., Tehame A.M., Bajaba S., Barnawi A., Zia S. Machine and deep learning towards COVID-19 diagnosis and treatment: Survey, challenges, and future directions. International Journal of Environmental Research and Public Health. 2021;18(3):1117. doi: 10.31224/osf.io/w3zxy. - DOI - PMC - PubMed
    1. Alakus T.B., Turkoglu I. Comparison of deep learning approaches to predict COVID-19 infection. Chaos, Solitons & Fractals. 2020;140 doi: 10.1016/j.chaos.2020.110120. - DOI - PMC - PubMed
    1. Albahri, O. S., Zaidan, A. A., Albahri, A. S., Zaidan, B. B., Abdulkareem, K. H., Al-Qaysi, Z. T., ... & Rashid, N. A. (2020). Systematic review of artificial intelligence techniques in the detection and classification of COVID-19 medical images in terms of evaluation and benchmarking: Taxonomy analysis, challenges, future solutions and methodological aspects. Journal of Infection and Public Health, 13(10), 1381-1396.https://doi.org/10.1016/j.jiph.2020.06.028. - PMC - PubMed
    1. Alballa N., Al-Turaiki I. Machine learning approaches in COVID-19 diagnosis, mortality, and severity risk prediction: A review. Informatics in Medicine Unlocked. 2021;24 doi: 10.1016/j.imu.2021.100564. - DOI - PMC - PubMed

LinkOut - more resources