CloudNMF: a MapReduce implementation of nonnegative matrix factorization for large-scale biological datasets
- PMID: 23933456
- PMCID: PMC4411332
- DOI: 10.1016/j.gpb.2013.06.001
CloudNMF: a MapReduce implementation of nonnegative matrix factorization for large-scale biological datasets
Abstract
In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them, and has been applied to various fields of biological research. In this paper, we present CloudNMF, a distributed open-source implementation of NMF on a MapReduce framework. Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data, which may enable various kinds of a high-throughput biological data analysis in the cloud. CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html.
Keywords: Bioinformatics; MapReduce; Nonnegative matrix factorization.
Copyright © 2013. Production and hosting by Elsevier Ltd.
Figures
References
-
- Lee D.D., Seung H.S. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401:788–791. - PubMed
-
- Gao Y., Church G. Improving molecular cancer class discovery through sparse non-negative matrix factorization. Bioinformatics. 2005;21:3970–3975. - PubMed
-
- Qi Q., Zhao Y., Li M., Simon R. Non-negative matrix factorization of gene expression profiles: a plug-in for BRB-ArrayTools. Bioinformatics. 2009;25:545–547. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
