KMeans greedy search hybrid algorithm for biclustering gene expression data
- PMID: 20865500
- DOI: 10.1007/978-1-4419-5913-3_21
KMeans greedy search hybrid algorithm for biclustering gene expression data
Abstract
Microarray technology demands the development of algorithms capable of extracting novel and useful patterns like biclusters. A bicluster is a submatrix of the gene expression datamatrix such that the genes show highly correlated activities across all conditions in the submatrix. A measure called Mean Squared Residue (MSR) is used to evaluate the coherence of rows and columns within the submatrix. In this paper, the KMeans greedy search hybrid algorithm is developed for finding biclusters from the gene expression data. This algorithm has two steps. In the first step, high quality bicluster seeds are generated using KMeans clustering algorithm. In the second step, these seeds are enlarged by adding more genes and conditions using the greedy strategy. Here, the objective is to find the biclusters with maximum size and the MSR value lower than a given threshold. The biclusters obtained from this algorithm on both the bench mark datasets are of high quality. The statistical significance and biological relevance of the biclusters are verified using gene ontology database.
Similar articles
-
Comparative advantages of novel algorithms using MSR threshold and MSR difference threshold for biclustering gene expression data.Adv Exp Med Biol. 2011;696:123-34. doi: 10.1007/978-1-4419-7046-6_13. Adv Exp Med Biol. 2011. PMID: 21431553
-
Biclustering of gene expression data using reactive greedy randomized adaptive search procedure.BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S27. doi: 10.1186/1471-2105-10-S1-S27. BMC Bioinformatics. 2009. PMID: 19208127 Free PMC article.
-
Parallelized evolutionary learning for detection of biclusters in gene expression data.IEEE/ACM Trans Comput Biol Bioinform. 2012;9(2):560-70. doi: 10.1109/TCBB.2011.53. Epub 2011 Mar 3. IEEE/ACM Trans Comput Biol Bioinform. 2012. PMID: 21383419
-
Biclustering on expression data: A review.J Biomed Inform. 2015 Oct;57:163-80. doi: 10.1016/j.jbi.2015.06.028. Epub 2015 Jul 6. J Biomed Inform. 2015. PMID: 26160444 Review.
-
Biclustering data analysis: a comprehensive survey.Brief Bioinform. 2024 May 23;25(4):bbae342. doi: 10.1093/bib/bbae342. Brief Bioinform. 2024. PMID: 39007596 Free PMC article. Review.
Cited by
-
Configurable pattern-based evolutionary biclustering of gene expression data.Algorithms Mol Biol. 2013 Feb 23;8(1):4. doi: 10.1186/1748-7188-8-4. Algorithms Mol Biol. 2013. PMID: 23433178 Free PMC article.
-
Running in the wheel: Defining individual severity levels in mice.PLoS Biol. 2018 Oct 18;16(10):e2006159. doi: 10.1371/journal.pbio.2006159. eCollection 2018 Oct. PLoS Biol. 2018. PMID: 30335759 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Molecular Biology Databases