Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Dec 10;2(12):100368.
doi: 10.1016/j.patter.2021.100368.

Common-sense approaches to sharing tabular data alongside publication

Affiliations
Review

Common-sense approaches to sharing tabular data alongside publication

Nicholas J Tierney et al. Patterns (N Y). .

Abstract

Numerous arguments strongly support the practice of open science, which offers several societal and individual benefits. For individual researchers, sharing research artifacts such as data can increase trust and transparency, improve the reproducibility of one's own work, and catalyze new collaborations. Despite a general appreciation of the benefits of data sharing, research data are often only available to the original investigators. For data that are shared, lack of useful metadata and documentation make them challenging to reuse. In this paper, we argue that a lack of incentives and infrastructure for making data useful is the biggest barrier to creating a culture of widespread data sharing. We compare data with code, examine computational environments in the context of their ability to facilitate the reproducibility of research, provide some practical guidance on how one can improve the chances of their data being reusable, and partially bridge the incentive gap. While previous papers have focused on describing ideal best practices for data and code, we focus on common-sense ideas for sharing tabular data for a target audience of academics working in data science adjacent fields who are about to submit for publication.

Keywords: DSML 4: Production: Data science output is validated, understood, and regularly used for multiple domains/platforms.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The mechanisms for behavior change, the incentives, and our assessment of where the elements of data, code, and computational environment rank in terms of completing these aspects. We note that data are often required, but the preceding steps are not, in contrast to code, which has no policy.

References

    1. Nielsen M. Princeton University Press; 2020. Reinventing Discovery: The New Era of Networked Science.
    1. Peng R.D. Reproducible research in computational science. Science. 2011;334:1226–1227. - PMC - PubMed
    1. McKiernan E.C., Bourne P.E., Brown C.T., Buck S., Kenall A., Lin J., et al. How open science helps researchers succeed. eLife. 2016;5:e16800. - PMC - PubMed
    1. Barnes N. Publish your computer code: it is good enough. Nature. 2010;467:753. - PubMed
    1. Ram K. Git can facilitate greater reproducibility and increased transparency in science. Source Code Biol. Med. 2013;8:7. - PMC - PubMed

LinkOut - more resources