Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Apr 18;446(4):850-6.
doi: 10.1016/j.bbrc.2014.02.146. Epub 2014 Mar 19.

Robust and stable feature selection by integrating ranking methods and wrapper technique in genetic data classification

Affiliations

Robust and stable feature selection by integrating ranking methods and wrapper technique in genetic data classification

Maryam Yassi et al. Biochem Biophys Res Commun. .

Abstract

High dimensional data increase the dimension of space and consequently the computational complexity and result in lower generalization. From these types of classification problems microarray data classification can be mentioned. Microarrays contain genetic and biological data which can be used to diagnose diseases including various types of cancers and tumors. Having intractable dimensions, dimension reduction process is necessary on these data. The main goal of this paper is to provide a method for dimension reduction and classification of genetic data sets. The proposed approach includes different stages. In the first stage, several feature ranking methods are fused for enhancing the robustness and stability of feature selection process. Wrapper method is combined with the proposed hybrid ranking method to embed the interaction between genes. Afterwards, the classification process is applied using support vector machine. Before feeding the data to the SVM classifier the problem of imbalance classes of data in the training phase should be overcame. The experimental results of the proposed approach on five microarray databases show that the robustness metric of the feature selection process is in the interval of [0.70, 0.88]. Also the classification accuracy is in the range of [91%, 96%].

Keywords: Dimension reduction; Filter method; Imbalance classes; Microarray classification; Support vector machine; Wrapper method.

PubMed Disclaimer

Similar articles

Cited by

LinkOut - more resources