Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 27;8(1):192.
doi: 10.1038/s41597-021-00981-0.

Data sharing practices and data availability upon request differ across scientific disciplines

Affiliations

Data sharing practices and data availability upon request differ across scientific disciplines

Leho Tedersoo et al. Sci Data. .

Abstract

Data sharing is one of the cornerstones of modern science that enables large-scale analyses and reproducibility. We evaluated data availability in research articles across nine disciplines in Nature and Science magazines and recorded corresponding authors' concerns, requests and reasons for declining data sharing. Although data sharing has improved in the last decade and particularly in recent years, data availability and willingness to share data still differ greatly among disciplines. We observed that statements of data availability upon (reasonable) request are inefficient and should not be allowed by journals. To improve data sharing at the time of manuscript acceptance, researchers should be better motivated to release their data with real benefits such as recognition, or bonus points in grant and job applications. We recommend that data management costs should be covered by funding agencies; publicly available research data ought to be included in the evaluation of applications; and surveillance of data sharing should be enforced by both academic publishers and funders. These cross-discipline survey data are available from the plutoF repository.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Schematic rationale of the study.
Fig. 2
Fig. 2
Differences in partial (grey) and full (black) data availability among disciplines depending on journal and publishing period (P1, 2000–2009; P2, 2010–2019) before contacting the authors (n = 875). Letters above bars indicate statistically significant difference groups among disciplines in full data availability compared to no data availability. Asterisks show significant differences in full data availability between journals and publishing periods.
Fig. 3
Fig. 3
Types of critical data (n = 875). (a) Distribution of data types among disciplines (blue, dataset; purple, image; black, model); (b) Partial (light shades) and full (dark shades) data availability among disciplines depending on the type of critical data (DS, dataset; Img, image; Mod, model) before contacting the author(s).
Fig. 4
Fig. 4
Differences in partial (grey) and full (black) data availability among disciplines after data requests (n = 672) depending on the type of critical data (DS, dataset; image; model) and publishing period (P1, 2000–2009; P2, 2010–2019). Numbers above bars indicate statistically significant difference groups among disciplines in full data availability.
Fig. 5
Fig. 5
Histogram of time for receiving data from authors upon request within the 60-day reasonable time period (blue bars) and beyond (purple bar; data excluded from analyses; n = 199 requests). Note the 2-base logarithmic scale until 60 days.
Fig. 6
Fig. 6
Authors’ response to data request (n = 199) depending on discipline (blue, declined; orange, ignored; purple, obtained). Bars indicate 95% CI of Sison and Glaz. Letters above bars indicate statistically significant difference groups in frequency of data availability by each category based on Tukey post-hoc test and Bonferroni correction.
Fig. 7
Fig. 7
Decay in critical data availability initially (blue circles; n = 672), at the end of a 60-day contacting period (purple circles; n = 672) and upon request from the authors (black circles; n = 310).
Fig. 8
Fig. 8
Frequency distribution of authors’ (a) Concerns and requests (n = 199) and (b) reasons for declining data sharing (n = 67). White bars indicate answers where no concerns or reasons were specified.
Fig. 9
Fig. 9
Preferred ways of data storage in articles (n = 875) representing different disciplines (blue, text and supplement; purple, data archive; yellow, authors’ homepage; vermillion, previous publications; grey, museum; black, upon (reasonable) request; white, none declared.

References

    1. Fan J, et al. Challenges of big data analysis. Nat. Sci. Rev. 2014;1:293–314. doi: 10.1093/nsr/nwt032. - DOI - PMC - PubMed
    1. Kitchin R. The data revolution: Big data, open data, data infrastructures and their consequences. London: Sage Publications; 2014.
    1. Science Staff Challenges and opportunities. Science. 2011;331:692–693. doi: 10.1126/science.331.6018.692. - DOI - PubMed
    1. Cech, T. R. et al. Sharing publication-related data and materials: responsibilities of authorship in the life sciences. National Academies Press, Washington, D.C. (2003). - PubMed
    1. Fischer BA, Zigmond MJ. The essential nature of sharing in science. Sci. Engineer. Ethics. 2010;16:783–799. doi: 10.1007/s11948-010-9239-x. - DOI - PubMed