ESVM: evolutionary support vector machine for automatic feature selection and classification of microarray data
- PMID: 17280775
- DOI: 10.1016/j.biosystems.2006.12.003
ESVM: evolutionary support vector machine for automatic feature selection and classification of microarray data
Abstract
An optimal design of support vector machine (SVM)-based classifiers for prediction aims to optimize the combination of feature selection, parameter setting of SVM, and cross-validation methods. However, SVMs do not offer the mechanism of automatic internal relevant feature detection. The appropriate setting of their control parameters is often treated as another independent problem. This paper proposes an evolutionary approach to designing an SVM-based classifier (named ESVM) by simultaneous optimization of automatic feature selection and parameter tuning using an intelligent genetic algorithm, combined with k-fold cross-validation regarded as an estimator of generalization ability. To illustrate and evaluate the efficiency of ESVM, a typical application to microarray classification using 11 multi-class datasets is adopted. By considering model uncertainty, a frequency-based technique by voting on multiple sets of potentially informative features is used to identify the most effective subset of genes. It is shown that ESVM can obtain a high accuracy of 96.88% with a small number 10.0 of selected genes using 10-fold cross-validation for the 11 datasets averagely. The merits of ESVM are three-fold: (1) automatic feature selection and parameter setting embedded into ESVM can advance prediction abilities, compared to traditional SVMs; (2) ESVM can serve not only as an accurate classifier but also as an adaptive feature extractor; (3) ESVM is developed as an efficient tool so that various SVMs can be used conveniently as the core of ESVM for bioinformatics problems.
Similar articles
-
ProLoc: prediction of protein subnuclear localization using SVM with automatic selection from physicochemical composition features.Biosystems. 2007 Sep-Oct;90(2):573-81. doi: 10.1016/j.biosystems.2007.01.001. Epub 2007 Jan 4. Biosystems. 2007. PMID: 17291684
-
Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers.Biosystems. 2007 Jul-Aug;90(1):78-86. doi: 10.1016/j.biosystems.2006.07.002. Epub 2006 Jul 10. Biosystems. 2007. PMID: 17291683
-
An evolutionary approach for gene selection and classification of microarray data based on SVM error-bound theories.Biosystems. 2010 Apr;100(1):39-46. doi: 10.1016/j.biosystems.2009.12.006. Epub 2010 Jan 4. Biosystems. 2010. PMID: 20045444
-
Advances in metaheuristics for gene selection and classification of microarray data.Brief Bioinform. 2010 Jan;11(1):127-41. doi: 10.1093/bib/bbp035. Epub 2009 Sep 29. Brief Bioinform. 2010. PMID: 19789265 Review.
-
Support vector machine applications in bioinformatics.Appl Bioinformatics. 2003;2(2):67-77. Appl Bioinformatics. 2003. PMID: 15130823 Review.
Cited by
-
A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data.Genom Data. 2016 Feb 23;8:4-15. doi: 10.1016/j.gdata.2016.02.012. eCollection 2016 Jun. Genom Data. 2016. PMID: 27081632 Free PMC article.
-
A framework model using multifilter feature selection to enhance colon cancer classification.PLoS One. 2021 Apr 16;16(4):e0249094. doi: 10.1371/journal.pone.0249094. eCollection 2021. PLoS One. 2021. PMID: 33861766 Free PMC article.
-
Automated classification of fMRI data employing trial-based imagery tasks.Med Image Anal. 2009 Jun;13(3):392-404. doi: 10.1016/j.media.2009.01.001. Epub 2009 Jan 16. Med Image Anal. 2009. PMID: 19233711 Free PMC article.
-
A hybrid BPSO-CGA approach for gene selection and classification of microarray data.J Comput Biol. 2012 Jan;19(1):68-82. doi: 10.1089/cmb.2010.0064. Epub 2011 Jan 6. J Comput Biol. 2012. PMID: 21210743 Free PMC article.
-
Co-ABC: Correlation artificial bee colony algorithm for biomarker gene discovery using gene expression profile.Saudi J Biol Sci. 2018 Jul;25(5):895-903. doi: 10.1016/j.sjbs.2017.12.012. Epub 2018 Jan 3. Saudi J Biol Sci. 2018. PMID: 30108438 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources