Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr:98:104778.
doi: 10.1016/j.micpro.2023.104778. Epub 2023 Feb 6.

A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset

Affiliations

A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset

Nebojsa Bacanin et al. Microprocess Microsyst. 2023 Apr.

Abstract

Feature selection is one of the most important challenges in machine learning and data science. This process is usually performed in the data preprocessing phase, where the data is transformed to a proper format for further operations by machine learning algorithm. Many real-world datasets are highly dimensional with many irrelevant, even redundant features. These kinds of features do not improve classification accuracy and can even shrink down performance of a classifier. The goal of feature selection is to find optimal (or sub-optimal) subset of features that contain relevant information about the dataset from which machine learning algorithms can derive useful conclusions. In this manuscript, a novel version of firefly algorithm (FA) is proposed and adapted for feature selection challenge. Proposed method significantly improves performance of the basic FA, and also outperforms other state-of-the-art metaheuristics for both, benchmark bound-constrained and practical feature selection tasks. Method was first validated on standard unconstrained benchmarks and later it was applied for feature selection by using 21 standard University of California, Irvine (UCL) datasets. Moreover, presented approach was also tested for relatively novel COVID-19 dataset for predicting patients health, and one microcontroller microarray dataset. Results obtained in all practical simulations attest robustness and efficiency of proposed algorithm in terms of convergence, solutions' quality and classification accuracy. More precisely, the proposed approach obtained the best classification accuracy on 13 out of 21 total datasets, significantly outperforming other competitor methods.

Keywords: COVID-19 dataset; Feature selection; Firefly algorithm; Genetic operators; Quasi-reflection-based learning; Swarm intelligence.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Swarm plot diagrams — GOQRFA (green) vs. FA (red). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2
Fig. 2
Convergence speed graphs for 30-dimensional benchmarks — GOQRFA vs. FA.
Fig. 3
Fig. 3
Convergence speed graphs for 100-dimensional benchmarks — GOQRFA vs. FA.
Fig. 4
Fig. 4
Convergence graphs for benchmark datasets.
Fig. 5
Fig. 5
Box plot comparison: bGOQRFA vs. bFA.
Fig. 6
Fig. 6
Box plot comparison: bGOQRFA vs. bFA.
None

Similar articles

Cited by

References

    1. Luo S., Cheng L., Ren B. Practical swarm optimization based fault-tolerance algorithm for the Internet of Things. KSII Trans. Internet Inf. Syst. (TIIS) 2014;8(3):735–748.
    1. Wu Q., Ding G., Xu Y., Feng S., Du Z., Wang J., Long K. Cognitive Internet of Things: A new paradigm beyond connection. IEEE Internet Things J. 2014;1(2):129–143. doi: 10.1109/JIOT.2014.2311513. - DOI
    1. Messaoud S., Bradai A., Bukhari S.H.R., Quang P.T.A., Ahmed O.B., Atri M. A survey on machine learning in Internet of Things: Algorithms, strategies, and applications. Internet Things. 2020;12 doi: 10.1016/j.iot.2020.100314. URL https://www.sciencedirect.com/science/article/pii/S2542660520301451. - DOI
    1. Zenggang X., Mingyang Z., Xuemin Z., Sanyuan Z., Fang X., Xiaochao Z., Yunyun W., Xiang L. Social similarity routing algorithm based on socially aware networks in the big data environment. J. Signal Process. Syst. 2022;94(11):1253–1267.
    1. Chandrashekar G., Sahin F. A survey on feature selection methods. Comput. Electr. Eng. 2014;40(1):16–28. doi: 10.1016/j.compeleceng.2013.11.024. URL https://www.sciencedirect.com/science/article/pii/S0045790613003066, 40th-year commemorative issue. - DOI

LinkOut - more resources