Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
- PMID: 22373334
- PMCID: PMC3278848
- DOI: 10.1186/1471-2105-12-S13-S8
Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
Abstract
Background: Clustering-based methods on gene-expression analysis have been shown to be useful in biomedical applications such as cancer subtype discovery. Among them, Matrix factorization (MF) is advantageous for clustering gene expression patterns from DNA microarray experiments, as it efficiently reduces the dimension of gene expression data. Although several MF methods have been proposed for clustering gene expression patterns, a systematic evaluation has not been reported yet.
Results: Here we evaluated the clustering performance of orthogonal and non-orthogonal MFs by a total of nine measurements for performance in four gene expression datasets and one well-known dataset for clustering. Specifically, we employed a non-orthogonal MF algorithm, BSNMF (Bi-directional Sparse Non-negative Matrix Factorization), that applies bi-directional sparseness constraints superimposed on non-negative constraints, comprising a few dominantly co-expressed genes and samples together. Non-orthogonal MFs tended to show better clustering-quality and prediction-accuracy indices than orthogonal MFs as well as a traditional method, K-means. Moreover, BSNMF showed improved performance in these measurements. Non-orthogonal MFs including BSNMF showed also good performance in the functional enrichment test using Gene Ontology terms and biological pathways.
Conclusions: In conclusion, the clustering performance of orthogonal and non-orthogonal MFs was appropriately evaluated for clustering microarray data by comprehensive measurements. This study showed that non-orthogonal MFs have better performance than orthogonal MFs and K-means for clustering microarray data.
Figures





Similar articles
-
Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.J Biomed Inform. 2008 Aug;41(4):602-6. doi: 10.1016/j.jbi.2007.12.003. Epub 2007 Dec 23. J Biomed Inform. 2008. PMID: 18234564
-
Exploring matrix factorization techniques for significant genes identification of Alzheimer's disease microarray gene expression data.BMC Bioinformatics. 2011;12 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2105-12-S5-S7. Epub 2011 Jul 27. BMC Bioinformatics. 2011. PMID: 21989140 Free PMC article.
-
Improving molecular cancer class discovery through sparse non-negative matrix factorization.Bioinformatics. 2005 Nov 1;21(21):3970-5. doi: 10.1093/bioinformatics/bti653. Bioinformatics. 2005. PMID: 16244221
-
Techniques for clustering gene expression data.Comput Biol Med. 2008 Mar;38(3):283-93. doi: 10.1016/j.compbiomed.2007.11.001. Epub 2007 Dec 3. Comput Biol Med. 2008. PMID: 18061589 Review.
-
Clustering approaches to identifying gene expression patterns from DNA microarray data.Mol Cells. 2008 Apr 30;25(2):279-88. Epub 2008 Mar 31. Mol Cells. 2008. PMID: 18414008 Review.
Cited by
-
Impact of the Choice of Normalization Method on Molecular Cancer Class Discovery Using Nonnegative Matrix Factorization.PLoS One. 2016 Oct 14;11(10):e0164880. doi: 10.1371/journal.pone.0164880. eCollection 2016. PLoS One. 2016. PMID: 27741311 Free PMC article.
-
Integrative analysis reveals disrupted pathways regulated by microRNAs in cancer.Nucleic Acids Res. 2018 Feb 16;46(3):1089-1101. doi: 10.1093/nar/gkx1250. Nucleic Acids Res. 2018. PMID: 29294105 Free PMC article.
-
Archetypal analysis of diverse Pseudomonas aeruginosa transcriptomes reveals adaptation in cystic fibrosis airways.BMC Bioinformatics. 2013 Sep 23;14:279. doi: 10.1186/1471-2105-14-279. BMC Bioinformatics. 2013. PMID: 24059747 Free PMC article.
-
Non-negative matrix factorization by maximizing correntropy for cancer clustering.BMC Bioinformatics. 2013 Mar 24;14:107. doi: 10.1186/1471-2105-14-107. BMC Bioinformatics. 2013. PMID: 23522344 Free PMC article.
-
Classification of breast cancer patients using somatic mutation profiles and machine learning approaches.BMC Syst Biol. 2016 Aug 26;10 Suppl 3(Suppl 3):62. doi: 10.1186/s12918-016-0306-z. BMC Syst Biol. 2016. PMID: 27587275 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Molecular Biology Databases