Big data to knowledge: common pitfalls in transcriptomics data analysis and representation
- PMID: 31385553
- PMCID: PMC6779380
- DOI: 10.1080/15476286.2019.1652525
Big data to knowledge: common pitfalls in transcriptomics data analysis and representation
Abstract
The omics technologies provide an invaluable opportunity to employ a global view towards human diseases. However, the appropriate translation of big data to knowledge remains a major challenge. In this study, we have performed quality control assessments for 91 transcriptomics datasets deposited in gene expression omnibus database and also have evaluated the publications derived from these datasets. This survey shows that drawbacks in the analyses and reports of transcriptomics studies are more common than one may assume. This report is concluded with some suggestions for researchers and reviewers to enhance the minimal requirements for gene expression data generation, analysis and report.
Keywords: Big data; data analysis; differentially expressed gene; quality control; transcriptomics.
Figures
References
-
- Wickham H. Ggplot2: elegant graphics for data analysis. Springer-Verlag New York 2009.
-
- R Core Team (2014). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.
-
- Mootha VK, Lindgren CM, Eriksson KF, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003. July;34(3):267–273. - PubMed
-
- Rabieian R, Moein S, Khanahmad H, et al. Transcriptional noise in intact and TGF-beta treated human kidney cells; the importance of time-series designs. Cell Biol Int. 2018. Sep;42(9):1265-1269. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources