Building interpretable fuzzy models for high dimensional data analysis in cancer diagnosis
- PMID: 21989191
- PMCID: PMC3194236
- DOI: 10.1186/1471-2164-12-S2-S5
Building interpretable fuzzy models for high dimensional data analysis in cancer diagnosis
Abstract
Background: Analysing gene expression data from microarray technologies is a very important task in biology and medicine, and particularly in cancer diagnosis. Different from most other popular methods in high dimensional bio-medical data analysis, such as microarray gene expression or proteomics mass spectroscopy data analysis, fuzzy rule-based models can not only provide good classification results, but also easily be explained and interpreted in human understandable terms, by using fuzzy rules. However, the advantages offered by fuzzy-based techniques in microarray data analysis have not yet been fully explored in the literature. Although some recently developed fuzzy-based modeling approaches can provide satisfactory classification results, the rule bases generated by most of the reported fuzzy models for gene expression data are still too large to be easily comprehensible.
Results: In this paper, we develop some Multi-Objective Evolutionary Algorithms based Interpretable Fuzzy (MOEAIF) methods for analysing high dimensional bio-medical data sets, such as microarray gene expression data and proteomics mass spectroscopy data. We mainly focus on evaluating our proposed models on microarray gene expression cancer data sets, i.e., the lung cancer data set and the colon cancer data set, but we extend our investigations to other type of cancer data set, such as the ovarian cancer data set. The experimental studies have shown that relatively simple and small fuzzy rule bases, with satisfactory classification performance, can be successfully obtained for challenging microarray gene expression datasets.
Conclusions: We believe that fuzzy-based techniques, and in particular the methods proposed in this paper, can be very useful tools in dealing with high dimensional cancer data. We also argue that the potential of applying fuzzy-based techniques to microarray data analysis need to be further explored.
Figures



Similar articles
-
Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis.Biosystems. 2006 Sep;85(3):165-76. doi: 10.1016/j.biosystems.2006.01.002. Epub 2006 Feb 21. Biosystems. 2006. PMID: 16490299
-
Data mining of gene expression data by fuzzy and hybrid fuzzy methods.IEEE Trans Inf Technol Biomed. 2010 Jan;14(1):23-9. doi: 10.1109/TITB.2009.2033590. Epub 2009 Oct 20. IEEE Trans Inf Technol Biomed. 2010. PMID: 19846381
-
Hybrid Ant Bee Algorithm for Fuzzy Expert System Based Sample Classification.IEEE/ACM Trans Comput Biol Bioinform. 2014 Mar-Apr;11(2):347-60. doi: 10.1109/TCBB.2014.2307325. IEEE/ACM Trans Comput Biol Bioinform. 2014. PMID: 26355782
-
Techniques for clustering gene expression data.Comput Biol Med. 2008 Mar;38(3):283-93. doi: 10.1016/j.compbiomed.2007.11.001. Epub 2007 Dec 3. Comput Biol Med. 2008. PMID: 18061589 Review.
-
Increasing the efficiency of fuzzy logic-based gene expression data analysis.Physiol Genomics. 2003 Apr 16;13(2):107-17. doi: 10.1152/physiolgenomics.00097.2002. Physiol Genomics. 2003. PMID: 12595578 Review.
Cited by
-
Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data.Sci Rep. 2023 Jan 9;13(1):456. doi: 10.1038/s41598-022-27132-8. Sci Rep. 2023. PMID: 36624117 Free PMC article.
-
Data-derived modeling characterizes plasticity of MAPK signaling in melanoma.PLoS Comput Biol. 2014 Sep 4;10(9):e1003795. doi: 10.1371/journal.pcbi.1003795. eCollection 2014 Sep. PLoS Comput Biol. 2014. PMID: 25188314 Free PMC article.
References
-
- Alon U, Barkai N, Notterman D, Gish K, Ybarra S, Mack D, Levine A. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natnl. Acad. Sci. USA. 1999;96(12):6745– 6750. doi: 10.1073/pnas.96.12.6745. - DOI - PMC - PubMed
-
- Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999;286:531–537. doi: 10.1126/science.286.5439.531. - DOI - PubMed
-
- Hong JH, Cho SB. Gene boosting for cancer classification based on gene expression profiles. Pattern Recogn. 2009;42(9):1761–1767. doi: 10.1016/j.patcog.2009.01.006. - DOI
-
- Kohonen T. Self-organizing maps. Secaucus, NJ, USA: Springer-Verlag New York, Inc; 1997.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources