Analyzing microarray data using quantitative association rules
- PMID: 16204090
- DOI: 10.1093/bioinformatics/bti1121
Analyzing microarray data using quantitative association rules
Abstract
Motivation: We tackle the problem of finding regularities in microarray data. Various data mining tools, such as clustering, classification, Bayesian networks and association rules, have been applied so far to gain insight into gene-expression data. Association rule mining techniques used so far work on discretizations of the data and cannot account for cumulative effects. In this paper, we investigate the use of quantitative association rules that can operate directly on numeric data and represent cumulative effects of variables. Technically speaking, this type of quantitative association rules based on half-spaces can find non-axis-parallel regularities.
Results: We performed a variety of experiments testing the utility of quantitative association rules for microarray data. First of all, the results should be statistically significant and robust against fluctuations in the data. Next, the approach should be scalable in the number of variables, which is important for such high-dimensional data. Finally, the rules should make sense biologically and be sufficiently different from rules found in regular association rule mining working with discretizations. In all of these dimensions, the proposed approach performed satisfactorily. Therefore, quantitative association rules based on half-spaces should be considered as a tool for the analysis of microarray gene-expression data.
Availability: The code is available from the authors on request.
Similar articles
-
Classification of microarray data with factor mixture models.Bioinformatics. 2006 Jan 15;22(2):202-8. doi: 10.1093/bioinformatics/bti779. Epub 2005 Nov 15. Bioinformatics. 2006. PMID: 16287938
-
Data-adaptive test statistics for microarray data.Bioinformatics. 2005 Sep 1;21 Suppl 2:ii108-14. doi: 10.1093/bioinformatics/bti1119. Bioinformatics. 2005. PMID: 16204088
-
Analysis with respect to instrumental variables for the exploration of microarray data structures.BMC Bioinformatics. 2006 Sep 29;7:422. doi: 10.1186/1471-2105-7-422. BMC Bioinformatics. 2006. PMID: 17010189 Free PMC article.
-
Towards knowledge-based gene expression data mining.J Biomed Inform. 2007 Dec;40(6):787-802. doi: 10.1016/j.jbi.2007.06.005. Epub 2007 Jun 21. J Biomed Inform. 2007. PMID: 17683991 Review.
-
Techniques for clustering gene expression data.Comput Biol Med. 2008 Mar;38(3):283-93. doi: 10.1016/j.compbiomed.2007.11.001. Epub 2007 Dec 3. Comput Biol Med. 2008. PMID: 18061589 Review.
Cited by
-
MIDClass: microarray data classification by association rules and gene expression intervals.PLoS One. 2013 Aug 6;8(8):e69873. doi: 10.1371/journal.pone.0069873. Print 2013. PLoS One. 2013. PMID: 23936357 Free PMC article.
-
Association rule based similarity measures for the clustering of gene expression data.Open Med Inform J. 2010;4:63-73. doi: 10.2174/1874431101004010063. Epub 2010 May 28. Open Med Inform J. 2010. PMID: 21603179 Free PMC article.
-
A primer to frequent itemset mining for bioinformatics.Brief Bioinform. 2015 Mar;16(2):216-31. doi: 10.1093/bib/bbt074. Epub 2013 Oct 26. Brief Bioinform. 2015. PMID: 24162173 Free PMC article. Review.
-
RulNet: A Web-Oriented Platform for Regulatory Network Inference, Application to Wheat -Omics Data.PLoS One. 2015 May 19;10(5):e0127127. doi: 10.1371/journal.pone.0127127. eCollection 2015. PLoS One. 2015. PMID: 25993562 Free PMC article.
-
Integrated analysis of gene expression by Association Rules Discovery.BMC Bioinformatics. 2006 Feb 7;7:54. doi: 10.1186/1471-2105-7-54. BMC Bioinformatics. 2006. PMID: 16464256 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources