Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug;12(4):162-169.
doi: 10.1049/iet-syb.2017.0044.

Gene expression feature selection for prostate cancer diagnosis using a two-phase heuristic-deterministic search strategy

Affiliations

Gene expression feature selection for prostate cancer diagnosis using a two-phase heuristic-deterministic search strategy

Saleh Shahbeig et al. IET Syst Biol. 2018 Aug.

Abstract

Here, a two-phase search strategy is proposed to identify the biomarkers in gene expression data set for the prostate cancer diagnosis. A statistical filtering method is initially employed to remove the noisiest data. In the first phase of the search strategy, a multi-objective optimisation based on the binary particle swarm optimisation algorithm tuned by a chaotic method is proposed to select the optimal subset of genes with the minimum number of genes and the maximum classification accuracy. Finally, in the second phase of the search strategy, the cache-based modification of the sequential forward floating selection algorithm is used to find the most discriminant genes from the optimal subset of genes selected in the first phase. The results of applying the proposed algorithm on the available challenging prostate cancer data set demonstrate that the proposed algorithm can perfectly identify the informative genes such that the classification accuracy, sensitivity, and specificity of 100% are achieved with only nine biomarkers.

Keywords: available challenging prostate cancer data; biological organs; biomarkers; cancer; chaotic method; discriminant genes; feature extraction; gene expression data; gene expression feature selection; genetics; heuristic-deterministic search strategy; informative genes; multiobjective optimisation; noisiest data; optimisation; particle swarm optimisation; particle swarm optimisation algorithm; pattern classification; prostate cancer diagnosis; search problems; selection algorithm; statistical filtering method; two-phase search strategy.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Flowchart of the proposed cache‐based modification of the SFFS algorithm
Fig. 2
Fig. 2
Flowchart of the proposed gene selection algorithm
Fig. 3
Fig. 3
Result of applying the statistical gene selection method for gene ranking of the prostate cancer data set
Fig. 4
Fig. 4
Best result of applying first phase of the gene selection process (multi‐objective optimisation‐based phase) on the prostate cancer microarray data set
Fig. 5
Fig. 5
Final result of applying the proposed CMSFFS‐based phase on the optimal subset of genes selected in the first phase (the mean results achieved in the 100 trials)

Similar articles

Cited by

References

    1. Lv J., Peng Q., and Chen X. et al.: ‘A multi‐objective heuristic algorithm for gene expression microarray data classification’, Expert Syst. Appl., 2016, 59, pp. 13–19
    1. Sharma A., Imoto S., and Miyano S.: ‘A top‐R feature selection algorithm for microarray gene expression data’, IEEE/ACM Trans. Comput. Biol. Bioinform. (TCBB), 2012, 9, (3), pp. 754–764 - PubMed
    1. Nguyen T., Khosravi A., and Creighton D. et al.: ‘Hidden Markov models for cancer classification using gene expression profiles’, Inf. Sci., 2015, 316, pp. 293–307
    1. Wang L., and Han B.: ‘Hybrid feature selection method for gene expression analysis’, Electron. Lett., 2014, 50, (18), pp. 1269–1271
    1. Cui Y., Zheng C.‐H., and Yang J. et al.: ‘Sparse maximum margin discriminant analysis for feature extraction and gene selection on gene expression data’, Comput. Biol. Med., 2013, 43, (7), pp. 933–941 - PubMed

LinkOut - more resources