Biomarker identification by feature wrappers
- PMID: 11691853
- PMCID: PMC311150
- DOI: 10.1101/gr.190001
Biomarker identification by feature wrappers
Abstract
Gene expression studies bridge the gap between DNA information and trait information by dissecting biochemical pathways into intermediate components between genotype and phenotype. These studies open new avenues for identifying complex disease genes and biomarkers for disease diagnosis and for assessing drug efficacy and toxicity. However, the majority of analytical methods applied to gene expression data are not efficient for biomarker identification and disease diagnosis. In this paper, we propose a general framework to incorporate feature (gene) selection into pattern recognition in the process to identify biomarkers. Using this framework, we develop three feature wrappers that search through the space of feature subsets using the classification error as measure of goodness for a particular feature subset being "wrapped around": linear discriminant analysis, logistic regression, and support vector machines. To effectively carry out this computationally intensive search process, we employ sequential forward search and sequential forward floating search algorithms. To evaluate the performance of feature selection for biomarker identification we have applied the proposed methods to three data sets. The preliminary results demonstrate that very high classification accuracy can be attained by identified composite classifiers with several biomarkers.
Figures




Comment in
-
Bringing out the best features of expression data.Genome Res. 2001 Nov;11(11):1801-2. doi: 10.1101/gr.215501. Genome Res. 2001. PMID: 11691842 No abstract available.
Similar articles
-
Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases.Health Inf Sci Syst. 2013 May 30;1:10. doi: 10.1186/2047-2501-1-10. eCollection 2013. Health Inf Sci Syst. 2013. PMID: 26042184 Free PMC article.
-
Incremental forward feature selection with application to microarray gene expression data.J Biopharm Stat. 2008;18(5):827-40. doi: 10.1080/10543400802277868. J Biopharm Stat. 2008. PMID: 18781519
-
Assessment of feature selection and classification approaches to enhance information from overnight oximetry in the context of apnea diagnosis.Int J Neural Syst. 2013 Oct;23(5):1350020. doi: 10.1142/S0129065713500202. Epub 2013 Jul 3. Int J Neural Syst. 2013. PMID: 23924411
-
What should be expected from feature selection in small-sample settings.Bioinformatics. 2006 Oct 1;22(19):2430-6. doi: 10.1093/bioinformatics/btl407. Epub 2006 Jul 26. Bioinformatics. 2006. PMID: 16870934
-
A novel feature selection approach for biomedical data classification.J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30. J Biomed Inform. 2010. PMID: 19647098
Cited by
-
Gene selection for classification of microarray data based on the Bayes error.BMC Bioinformatics. 2007 Oct 3;8(1):370. doi: 10.1186/1471-2105-8-370. BMC Bioinformatics. 2007. PMID: 17915022 Free PMC article.
-
Ensemble Classification of Cancer Types and Biomarker Identification.Drug Dev Res. 2012 Nov;73(7):414-419. doi: 10.1002/ddr.21032. Drug Dev Res. 2012. PMID: 25221378 Free PMC article.
-
Machine learning-based biomarkers identification from toxicogenomics - Bridging to regulatory relevant phenotypic endpoints.J Hazard Mater. 2022 Feb 5;423(Pt B):127141. doi: 10.1016/j.jhazmat.2021.127141. Epub 2021 Sep 11. J Hazard Mater. 2022. PMID: 34560480 Free PMC article.
-
Three methods for optimization of cross-laboratory and cross-platform microarray expression data.Nucleic Acids Res. 2007;35(10):e72. doi: 10.1093/nar/gkl1133. Epub 2007 May 3. Nucleic Acids Res. 2007. PMID: 17478523 Free PMC article.
-
Functional dissociation between anterior and posterior temporal cortical regions during retrieval of remote memory.J Neurosci. 2012 Jul 11;32(28):9659-70. doi: 10.1523/JNEUROSCI.5553-11.2012. J Neurosci. 2012. PMID: 22787051 Free PMC article.
References
-
- Allgayer H, Heiss MM, Schildberg FW. Prognostic factors in gastric cancer. Br J Surg. 1997;84:1651–1664. - PubMed
-
- Brazma A, Vilo J. Gene expression data analysis. FEBS Lett. 2000;480:17–24. - PubMed
-
- Brien TP, Depowski PL, Sheeehan CE, Ross JS, McKenna BJ. Prognostic factors in gastric cancer. Mol Pathol. 1998;11:870–877. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous