Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 27;9(5):508-514.e3.
doi: 10.1016/j.cels.2019.08.007. Epub 2019 Sep 11.

Building Containerized Workflows Using the BioDepot-Workflow-Builder

Affiliations

Building Containerized Workflows Using the BioDepot-Workflow-Builder

Ling-Hong Hung et al. Cell Syst. .

Abstract

We present the BioDepot-workflow-builder (Bwb), a software tool that allows users to create and execute reproducible bioinformatics workflows using a drag-and-drop interface. Graphical widgets represent Docker containers executing a modular task. Widgets are linked graphically to build bioinformatics workflows that can be reproducibly deployed across different local and cloud platforms. Each widget contains a form-based user interface to facilitate parameter entry and a console to display intermediate results. Bwb provides tools for rapid customization of widgets, containers, and workflows. Saved workflows can be shared using Bwb's native format or exported as shell scripts.

Keywords: Docker; RNA sequencing; bioinformatics workflows; reproducibility of research; software development.

PubMed Disclaimer

Conflict of interest statement

Declaration of Interests

Ka Yee Yeung is also affiliated with the Department of Microbiology, University of Washington, Seattle, WA.

Figures

Figure 1.
Figure 1.
Bwb interface: The basic elements of the Bwb interface are shown. When the Bwb application is launched and the user connects to the application using a browser, a window consisting of a canvas and a Tool Dock appears. Modules (widgets) are dragged from the Tool Dock and dropped onto the canvas and linked together to form a workflow, in this case, the kallisto-sleuth (Pimentel et al., 2017) workflow. Right-clicking on a module brings up a set of forms that allow the user to edit the properties of the widget such as the parameters to be queried and the command(s) to be executed. Double clicking allows the user to enter parameter values and execute the workflow.
Figure 2.
Figure 2.
Kallisto-sleuth workflow: This workflow illustrates how Bwb is used to execute the kallisto-sleuth workflow. The leftmost widgets in the workflow download support files and set up the indices, whereas the later downstream widgets are responsible for pseudo-alignment and determination of differentially expressed genes (DEGs). When a widget completes its task, it sends signals to downstream connected widgets to trigger their execution. Execution can be started at any widget. For example, after downloading support files and generating indices once, the user can start at the pipeline at the kallisto pseudo alignment if he or she wishes to analyze a new set of reads. In this workflow, the final widget displays a Firefox window inside Bwb to allow interaction with the Jupyter notebook containing the R script that calls Sleuth. The user can re-run the code cells, adjust parameters and visualize results without leaving the Bwb sandbox. When the user is satisfied, the notebook can be saved to the host computer.

Similar articles

Cited by

References

    1. Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Cech M, Chilton J, Clements D, Coraor N, Eberhard C, et al. (2016). The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic acids research 44, W3–W10. - PMC - PubMed
    1. Almugbel R, Hung LH, Hu J, Almutairy A, Ortogero N, Tamta Y, and Yeung KY (2018). Reproducible Bioconductor workflows using browser-based interactive notebooks and containers. Journal of the American Medical Informatics Association : JAMIA 25, 4–12. - PMC - PubMed
    1. Anders S, and Huber W (2010). Differential expression analysis for sequence count data. Genome biology 11, R106. - PMC - PubMed
    1. Belmann P, Droge J, Bremges A, McHardy AC, Sczyrba A, and Barton MD (2015). Bioboxes: standardised containers for interchangeable bioinformatics software. Gigascience 4, 47. - PMC - PubMed
    1. Bray NL, Pimentel H, Melsted P, and Pachter L (2016). Near-optimal probabilistic RNA-seq quantification. Nature biotechnology 34, 525–527. - PubMed

Publication types

LinkOut - more resources