Jupyter and Galaxy: Easing entry barriers into complex data analyses for biomedical researchers
- PMID: 28542180
- PMCID: PMC5444614
- DOI: 10.1371/journal.pcbi.1005425
Jupyter and Galaxy: Easing entry barriers into complex data analyses for biomedical researchers
Abstract
What does it take to convert a heap of sequencing data into a publishable result? First, common tools are employed to reduce primary data (sequencing reads) to a form suitable for further analyses (i.e., the list of variable sites). The subsequent exploratory stage is much more ad hoc and requires the development of custom scripts and pipelines, making it problematic for biomedical researchers. Here, we describe a hybrid platform combining common analysis pathways with the ability to explore data interactively. It aims to fully encompass and simplify the "raw data-to-publication" pathway and make it reproducible.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- Fleury V, Gouyet JF, Leonetti M. Branching in Nature. Dynamics and Morphogenesis of Branching Structures, from Cell to River Networks. Springer Science & Business Media; 2013. Available from: http://books.google.com/books?id=WKXyCAAAQBAJ&pg=PR6&dq=intitle:branchin....
-
- van der Walt S, Colbert SC, Varoquaux G. The NumPy Array: A Structure for Efficient Numerical Computation. Comput Sci Eng. 2011;13(2):22–30.
-
- Jones E, Oliphant T, Peterson P. SciPy: Open source scientific tools for Python, 2001-2008b;. Available from: https://www.scipy.org/
-
- Hunter JD. Matplotlib: A 2D Graphics Environment. Comput Sci Eng. 2007;9(3):90–95.
-
- Sloggett C, Goonasekera N, Afgan E. BioBlend: automating pipeline analyses within Galaxy and CloudMan. Bioinformatics. 2013;29(13):1685–1686. doi: 10.1093/bioinformatics/btt199 - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
