Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct 12:205:106270.
doi: 10.1016/j.knosys.2020.106270. Epub 2020 Jul 18.

A new COVID-19 Patients Detection Strategy (CPDS) based on hybrid feature selection and enhanced KNN classifier

Affiliations

A new COVID-19 Patients Detection Strategy (CPDS) based on hybrid feature selection and enhanced KNN classifier

Warda M Shaban et al. Knowl Based Syst. .

Abstract

COVID-19 infection is growing in a rapid rate. Due to unavailability of specific drugs, early detection of (COVID-19) patients is essential for disease cure and control. There is a vital need to detect the disease at early stage and instantly quarantine the infected people. Many research have been going on, however, none of them introduces satisfactory results yet. In spite of its simplicity, K-Nearest Neighbor (KNN) classifier has proven high flexibility in complex classification problems. However, it can be easily trapped. In this paper, a new COVID-19 diagnose strategy is introduced, which is called COVID-19 Patients Detection Strategy (CPDS). The novelty of CPDS is concentrated in two contributions. The first is a new hybrid feature selection Methodology (HFSM), which elects the most informative features from those extracted from chest Computed Tomography (CT) images for COVID-19 patients and non COVID-19 peoples. HFSM is a hybrid methodology as it combines evidence from both wrapper and filter feature selection methods. It consists of two stages, namely; Fast Selection Stage (FS2) and Accurate Selection Stage (AS2). FS2relies on filter, while AS2 uses Genetic Algorithm (GA) as a wrapper method. As a hybrid methodology, HFSM elects the significant features for the next detection phase. The second contribution is an enhanced K-Nearest Neighbor (EKNN) classifier, which avoids the trapping problem of the traditional KNN by adding solid heuristics in choosing the neighbors of the tested item. EKNN depends on measuring the degree of both closeness and strength of each neighbor of the tested item, then elects only the qualified neighbors for classification. Accordingly, EKNN can accurately detect infected patients with the minimum time penalty based on those significant features selected by HFSM technique. Extensive experiments have been done considering the proposed detection strategy as well as recent competitive techniques on the chest CT images. Experimental results have shown that the proposed detection strategy outperforms recent techniques as it introduces the maximum accuracy rate.

Keywords: COVID-19; Classification; Feature selection; KNN.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
A graphic representation of the rapid spike in infections.
Fig. 2
Fig. 2
Different types of KNN trapping.
Fig. 3
Fig. 3
Corona patients detection strategy.
Fig. 4
Fig. 4
The sequential steps of HFSM method.
Fig. 5
Fig. 5
EKNN training phase.
Fig. 6
Fig. 6
Illustrative example showing the followed steps during EKNN testing phase.
Fig. 7
Fig. 7
Accuracy of features selection methods using NB.
Fig. 8
Fig. 8
Error of features selection methods using NB.
Fig. 9
Fig. 9
Precision of features selection methods using NB.
Fig. 10
Fig. 10
Sensitivity of features selection methods using NB.
Fig. 11
Fig. 11
Macro-average of precision for features selection methods using NB.
Fig. 12
Fig. 12
Macro-average of recall for features selection methods using NB.
Fig. 13
Fig. 13
Micro-average of precision for features selection methods using NB.
Fig. 14
Fig. 14
Micro-average of recall for features selection methods using NB.
Fig. 15
Fig. 15
F-Measure of the different features selection methods using NB.
Fig. 16
Fig. 16
Run time of the different features selection methods using NB.
Fig. 17
Fig. 17
Accuracy of the different classification techniques.
Fig. 18
Fig. 18
Error of the different classification techniques.
Fig. 19
Fig. 19
Precision of the different classification techniques.
Fig. 20
Fig. 20
Sensitivity of the different classification techniques.
Fig. 21
Fig. 21
Macro-average precision of the different classification techniques.
Fig. 22
Fig. 22
Macro-average recall of the different classification techniques.
Fig. 23
Fig. 23
Micro-average precision of the different classification techniques.
Fig. 24
Fig. 24
Micro-average recall of the different classification techniques.
Fig. 25
Fig. 25
F-Measure of the different classification techniques.
Fig. 26
Fig. 26
Run time of the different classification techniques.
None
None

References

    1. Chung M., Bernheim A., Mei X., Zhang N. CT Imaging features of 2019 Novel Coronavirus (2019-nCoV) Radiology. 2019;275(1):202–207. - PMC - PubMed
    1. Narin A., Kaya C., Pamuk Z. 2020. Automatic detection of Coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks.arXiv:2003.10849 - PMC - PubMed
    1. Kong W., Agarwal P. Chest imaging appearance of COVID-19 infection. Radiology. 2020;2(1):1–22. - PMC - PubMed
    1. Conghy T., Pon B., Anderson E. 2020. When does hospital capacity get overwhelmed in USA? Germany? A model of beds needed and available for Coronavirus patients. trent.st.
    1. Thamilselvan P., Sathiaseelan J. A comparative study of data mining algorithms for image classification. Int. J. Educ. Manag. Eng. (IJEME) 2015;5(3):1–9.

LinkOut - more resources