Practical Computational Reproducibility in the Life Sciences
- PMID: 29953862
- PMCID: PMC6263957
- DOI: 10.1016/j.cels.2018.03.014
Practical Computational Reproducibility in the Life Sciences
Abstract
Many areas of research suffer from poor reproducibility, particularly in computationally intensive domains where results rely on a series of complex methodological decisions that are not well captured by traditional publication approaches. Various guidelines have emerged for achieving reproducibility, but implementation of these practices remains difficult due to the challenge of assembling software tools plus associated libraries, connecting tools together into pipelines, and specifying parameters. Here, we discuss a suite of cutting-edge technologies that make computational reproducibility not just possible, but practical in both time and effort. This suite combines three well-tested components-a system for building highly portable packages of bioinformatics software, containerization and virtualization technologies for isolating reusable execution environments for these packages, and workflow systems that automatically orchestrate the composition of these packages for entire pipelines-to achieve an unprecedented level of computational reproducibility. We also provide a practical implementation and five recommendations to help set a typical researcher on the path to performing data analyses reproducibly.
Copyright © 2018 Elsevier Inc. All rights reserved.
Conflict of interest statement
DECLARATION OF INTERESTS
Authors declare no competing financial interests.
Figures

References
-
- Baker M (2016). 1,500 scientists lift the lid on reproducibility. Nature 533, 452–454. - PubMed
-
- Baumer B, Cetinkaya-Rundel M, Bray A, Loi L, and Horton NJ (2014). R Markdown: Inte grating A Reproducible Analysis Tool into Introductory Statistics. arXiv, arXiv:1402.1894.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources