Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Dec 14:4:1443.
doi: 10.12688/f1000research.7536.1. eCollection 2015.

BioShaDock: a community driven bioinformatics shared Docker-based tools registry

Affiliations

BioShaDock: a community driven bioinformatics shared Docker-based tools registry

François Moreews et al. F1000Res. .

Abstract

Linux container technologies, as represented by Docker, provide an alternative to complex and time-consuming installation processes needed for scientific software. The ease of deployment and the process isolation they enable, as well as the reproducibility they permit across environments and versions, are among the qualities that make them interesting candidates for the construction of bioinformatic infrastructures, at any scale from single workstations to high throughput computing architectures. The Docker Hub is a public registry which can be used to distribute bioinformatic software as Docker images. However, its lack of curation and its genericity make it difficult for a bioinformatics user to find the most appropriate images needed. BioShaDock is a bioinformatics-focused Docker registry, which provides a local and fully controlled environment to build and publish bioinformatic software as portable Docker images. It provides a number of improvements over the base Docker registry on authentication and permissions management, that enable its integration in existing bioinformatic infrastructures such as computing platforms. The metadata associated with the registered images are domain-centric, including for instance concepts defined in the EDAM ontology, a shared and structured vocabulary of commonly used terms in bioinformatics. The registry also includes user defined tags to facilitate its discovery, as well as a link to the tool description in the ELIXIR registry if it already exists. If it does not, the BioShaDock registry will synchronize with the registry to create a new description in the Elixir registry, based on the BioShaDock entry metadata. This link will help users get more information on the tool such as its EDAM operations, input and output types. This allows integration with the ELIXIR Tools and Data Services Registry, thus providing the appropriate visibility of such images to the bioinformatics community.

Keywords: bioinformatics; community driven registry; container; deployment; docker; interoperability; maintainability.

PubMed Disclaimer

Conflict of interest statement

Competing interests: No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. The BioShaDock web interface.
The interface enables the creation of Dockerfiles and allows to search the repository using full text queries.
Figure 2.
Figure 2.. The BioShaDock Docker container processing steps.
Figure 3.
Figure 3.. The BioShaDock use cases.
The Docker repository acts as a platform that facilitates the dissemination of bioinformatics tools by providing ready to use Docker images.

References

    1. Woelfle M, Olliaro P, Todd MH: Open science is a research accelerator. Nat Chem. 2011;3(10):745–748. 10.1038/nchem.1149 - DOI - PubMed
    1. Stajich JE, Lapp H: Open source tools and toolkits for bioinformatics: significance, and where are we? Brief Bioinform. 2006;7(3):287–296. 10.1093/bib/bbl026 - DOI - PubMed
    1. Ison J, Rapacki K, Ménager H, et al. : Tools and data services registry: a community effort to document bioinformatics resources. Nucleic Acids Res. 2015; pii: gkv1116. 10.1093/nar/gkv1116 - DOI - PMC - PubMed
    1. Connor BO, Kartashov A, Yuen D, et al. : ELIXIR Tools and Data Services Registry.2015. Reference Source
    1. Goecks J, Nekrutenko A, Taylor J, et al. : Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010;11(8):R86. 10.1186/gb-2010-11-8-r86 - DOI - PMC - PubMed