Robust and stable feature selection by integrating ranking methods and wrapper technique in genetic data classification
- PMID: 24657268
- DOI: 10.1016/j.bbrc.2014.02.146
Robust and stable feature selection by integrating ranking methods and wrapper technique in genetic data classification
Abstract
High dimensional data increase the dimension of space and consequently the computational complexity and result in lower generalization. From these types of classification problems microarray data classification can be mentioned. Microarrays contain genetic and biological data which can be used to diagnose diseases including various types of cancers and tumors. Having intractable dimensions, dimension reduction process is necessary on these data. The main goal of this paper is to provide a method for dimension reduction and classification of genetic data sets. The proposed approach includes different stages. In the first stage, several feature ranking methods are fused for enhancing the robustness and stability of feature selection process. Wrapper method is combined with the proposed hybrid ranking method to embed the interaction between genes. Afterwards, the classification process is applied using support vector machine. Before feeding the data to the SVM classifier the problem of imbalance classes of data in the training phase should be overcame. The experimental results of the proposed approach on five microarray databases show that the robustness metric of the feature selection process is in the interval of [0.70, 0.88]. Also the classification accuracy is in the range of [91%, 96%].
Keywords: Dimension reduction; Filter method; Imbalance classes; Microarray classification; Support vector machine; Wrapper method.
Copyright © 2014 Elsevier Inc. All rights reserved.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
