Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers
- PMID: 17291683
- DOI: 10.1016/j.biosystems.2006.07.002
Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers
Abstract
It is essential to select a minimal number of relevant genes from microarray data while maximizing classification accuracy for the development of inexpensive diagnostic tests. However, it is intractable to simultaneously optimize gene selection and classification accuracy that is a large parameter optimization problem. We propose an efficient evolutionary approach to gene selection from microarray data which can be combined with the optimal design of various multiclass classifiers. The proposed method (named GeneSelect) consists of three parts which are fully cooperated: an efficient encoding scheme of candidate solutions, a generalized fitness function, and an intelligent genetic algorithm (IGA). An existing hybrid approach based on genetic algorithm and maximum likelihood classification (GA/MLHD) is proposed to select a small number of relevant genes for accurate classification of samples. To evaluate the performance of GeneSelect, the gene selection is combined with the same maximum likelihood classification (named IGA/MLHD) for convenient comparisons. The performance of IGA/MLHD is applied to 11 cancer-related human gene expression datasets. The simulation results show that IGA/MLHD is superior to GA/MLHD in terms of the number of selected genes, classification accuracy, and robustness of selected genes and accuracy.
Similar articles
-
Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis.Biosystems. 2006 Sep;85(3):165-76. doi: 10.1016/j.biosystems.2006.01.002. Epub 2006 Feb 21. Biosystems. 2006. PMID: 16490299
-
ESVM: evolutionary support vector machine for automatic feature selection and classification of microarray data.Biosystems. 2007 Sep-Oct;90(2):516-28. doi: 10.1016/j.biosystems.2006.12.003. Epub 2006 Dec 16. Biosystems. 2007. PMID: 17280775
-
Ensemble gene selection by grouping for microarray data classification.J Biomed Inform. 2010 Feb;43(1):81-7. doi: 10.1016/j.jbi.2009.08.010. Epub 2009 Aug 20. J Biomed Inform. 2010. PMID: 19699316
-
Filter versus wrapper gene selection approaches in DNA microarray domains.Artif Intell Med. 2004 Jun;31(2):91-103. doi: 10.1016/j.artmed.2004.01.007. Artif Intell Med. 2004. PMID: 15219288 Review.
-
Microarray data analysis: from disarray to consolidation and consensus.Nat Rev Genet. 2006 Jan;7(1):55-65. doi: 10.1038/nrg1749. Nat Rev Genet. 2006. PMID: 16369572 Review.
Cited by
-
DQB: A novel dynamic quantitive classification model using artificial bee colony algorithm with application on gene expression profiles.Saudi J Biol Sci. 2018 Jul;25(5):932-946. doi: 10.1016/j.sjbs.2018.01.017. Epub 2018 Feb 9. Saudi J Biol Sci. 2018. PMID: 30108444 Free PMC article.
-
Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data.BMC Bioinformatics. 2015 Feb 21;16:54. doi: 10.1186/s12859-015-0463-x. BMC Bioinformatics. 2015. PMID: 25881029 Free PMC article.
-
mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.Biomed Res Int. 2015;2015:604910. doi: 10.1155/2015/604910. Epub 2015 Apr 15. Biomed Res Int. 2015. PMID: 25961028 Free PMC article.
-
Finding minimum gene subsets with heuristic breadth-first search algorithm for robust tumor classification.BMC Bioinformatics. 2012 Jul 25;13:178. doi: 10.1186/1471-2105-13-178. BMC Bioinformatics. 2012. PMID: 22830977 Free PMC article.
-
Combining MLC and SVM Classifiers for Learning Based Decision Making: Analysis and Evaluations.Comput Intell Neurosci. 2015;2015:423581. doi: 10.1155/2015/423581. Epub 2015 May 21. Comput Intell Neurosci. 2015. PMID: 26089862 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous