Big data bioinformatics
- PMID: 24799088
- PMCID: PMC5604462
- DOI: 10.1002/jcp.24662
Big data bioinformatics
Erratum in
-
Erratum: Big Data Bioinformatics by C. S. Greene, J. Tan, M. Ung, J. H. Moore, and C. Cheng.J Cell Physiol. 2016 Jan;231(1):257. doi: 10.1002/jcp.25077. J Cell Physiol. 2016. PMID: 26414214 No abstract available.
Abstract
Recent technological advances allow for high throughput profiling of biological systems in a cost-efficient manner. The low cost of data generation is leading us to the "big data" era. The availability of big data provides unprecedented opportunities but also raises new challenges for data mining and analysis. In this review, we introduce key concepts in the analysis of big data, including both "machine learning" algorithms as well as "unsupervised" and "supervised" examples of each. We note packages for the R programming language that are available to perform machine learning analyses. In addition to programming based solutions, we review webservers that allow users with limited or no programming background to perform these analyses on large data compendia.
© 2014 Wiley Periodicals, Inc.
Figures
References
-
- The Cancer Genome Atlas. http://cancergenome.nih.gov/
-
- The R package “Cluster”. http://cran.r-project.org/web/packages/cluster/citation.html.
-
- Cheng C, Alexander R, Min R, Leng J, Yip KY, Rozowsky J, Yan KK, Dong X, Djebali S, Ruan Y, Davis CA, Carninci P, Lassman T, Gingeras TR, Guigo R, Birney E, Weng Z, Snyder M, Gerstein M. Understanding transcriptional regulation by integrative analysis of transcription factor binding data. Genome Res. 2012;22:1658–1667. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
