Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 18;11(7):819.
doi: 10.3390/genes11070819.

Classification of Microarray Gene Expression Data Using an Infiltration Tactics Optimization (ITO) Algorithm

Affiliations

Classification of Microarray Gene Expression Data Using an Infiltration Tactics Optimization (ITO) Algorithm

Javed Zahoor et al. Genes (Basel). .

Abstract

A number of different feature selection and classification techniques have been proposed in literature including parameter-free and parameter-based algorithms. The former are quick but may result in local maxima while the latter use dataset-specific parameter-tuning for higher accuracy. However, higher accuracy may not necessarily mean higher reliability of the model. Thus, generalized optimization is still a challenge open for further research. This paper presents a warzone inspired "infiltration tactics" based optimization algorithm (ITO)-not to be confused with the ITO algorithm based on the Itõ Process in the field of Stochastic calculus. The proposed ITO algorithm combines parameter-free and parameter-based classifiers to produce a high-accuracy-high-reliability (HAHR) binary classifier. The algorithm produces results in two phases: (i) Lightweight Infantry Group (LIG) converges quickly to find non-local maxima and produces comparable results (i.e., 70 to 88% accuracy) (ii) Followup Team (FT) uses advanced tuning to enhance the baseline performance (i.e., 75 to 99%). Every soldier of the ITO army is a base model with its own independently chosen Subset selection method, pre-processing, and validation methods and classifier. The successful soldiers are combined through heterogeneous ensembles for optimal results. The proposed approach addresses a data scarcity problem, is flexible to the choice of heterogeneous base classifiers, and is able to produce HAHR models comparable to the established MAQC-II results.

Keywords: cancer; classification; clustering; computational intelligence; ensembles; infiltration; infiltration tactics optimization algorithm; machine learning; microarray.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
LIG(i) filtered-in accuracy, MCC, and ρ.
Figure 2
Figure 2
LIGEnsemble vs LIG(i) filtered-in accuracy, MCC, and ρ.
Figure 3
Figure 3
FT(i) filtered-in accuracy, MCC, and ρ.
Figure 4
Figure 4
FTEnsemble vs. FT(i) filtered-in accuracy, MCC, and ρ.
Figure 5
Figure 5
Overall improved results produced by ITO Algorithm on all datasets.
Figure 6
Figure 6
Dataset A—LIG(i) filtered-in training times.
Figure 7
Figure 7
Dataset B—LIG(i) filtered-in training times.
Figure 8
Figure 8
Dataset C—LIG(i) filtered-in training times.
Figure 9
Figure 9
Dataset A—FT(i) filtered-in training times.
Figure 10
Figure 10
Dataset B—FT(i) filtered-in training times.
Figure 11
Figure 11
Dataset C—FT(i) filtered-in training times.

References

    1. Alanni R., Hou J., Azzawi H., Xiang Y. Deep gene selection method to select genes from microarray datasets for cancer classification. BMC Bioinform. 2019;20:608. doi: 10.1186/s12859-019-3161-2. - DOI - PMC - PubMed
    1. Zhao Z., Morstatter F., Sharma S., Alelyani S., Anand A., Liu H. Advancing feature selection research. ASU Feature Sel. Repos. 2010:1–28.
    1. Elloumi M., Zomaya A.Y. Algorithms in Computational Molecular Biology: Techniques, Approaches and Applications. Volume 21 John Wiley & Sons; Hoboken, NJ, USA: 2011.
    1. Bolón-Canedo V., Sánchez-Marono N., Alonso-Betanzos A., Benítez J.M., Herrera F. A review of microarray datasets and applied feature selection methods. Inf. Sci. 2014;282:111–135. doi: 10.1016/j.ins.2014.05.042. - DOI
    1. Almugren N., Alshamlan H. A survey on hybrid feature selection methods in microarray gene expression data for cancer classification. IEEE Access. 2019;7:78533–78548. doi: 10.1109/ACCESS.2019.2922987. - DOI

MeSH terms

LinkOut - more resources