The role of metadata in reproducible computational research
- PMID: 34553169
- PMCID: PMC8441584
- DOI: 10.1016/j.patter.2021.100322
The role of metadata in reproducible computational research
Abstract
Reproducible computational research (RCR) is the keystone of the scientific method for in silico analyses, packaging the transformation of raw data to published results. In addition to its role in research integrity, improving the reproducibility of scientific studies can accelerate evaluation and reuse. This potential and wide support for the FAIR principles have motivated interest in metadata standards supporting reproducibility. Metadata provide context and provenance to raw data and methods and are essential to both discovery and validation. Despite this shared connection with scientific data, few studies have explicitly described how metadata enable reproducible computational research. This review employs a functional content analysis to identify metadata standards that support reproducibility across an analytic stack consisting of input data, tools, notebooks, pipelines, and publications. Our review provides background context, explores gaps, and discovers component trends of embeddedness and methodology weight from which we derive recommendations for future work.
Keywords: FAIR; RCR; containers; metadata; notebooks; ontologies; pipelines; provenance; replicability; reproducibility; reproducible computational research; reproducible research; semantic; software dependencies; workflows.
© 2021 The Authors.
Conflict of interest statement
The authors declare no competing interests.
Figures
References
-
- National Academies of Sciences, Engineering, and Medicine; Policy and Global Affairs; Committee on Science, Engineering, Medicine, and Public Policy; Board on Research Data and Information; Division on Engineering and Physical Sciences; Committee on Applied and Theoretical Statistics; Board on Mathematical Sciences and Analytics; Division on Earth and Life Studies; Nuclear and Radiation Studies Board; Division of Behavioral and Social Sciences and Education; Committee on National Statistics; Board on Behavioral, Cognitive, and Sensory Sciences . National Academies Press; 2019. Committee on Reproducibility and Replicability in Science. Reproducibility and Replicability in Science.
-
- Leipzig J. 2019. Awesome Reproducible Research.
Publication types
LinkOut - more resources
Full Text Sources
