Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 27;289(1979):20220938.
doi: 10.1098/rspb.2022.0938. Epub 2022 Jul 20.

Data rescue: saving environmental data from extinction

Affiliations

Data rescue: saving environmental data from extinction

Ellen K Bledsoe et al. Proc Biol Sci. .

Abstract

Historical and long-term environmental datasets are imperative to understanding how natural systems respond to our changing world. Although immensely valuable, these data are at risk of being lost unless actively curated and archived in data repositories. The practice of data rescue, which we define as identifying, preserving, and sharing valuable data and associated metadata at risk of loss, is an important means of ensuring the long-term viability and accessibility of such datasets. Improvements in policies and best practices around data management will hopefully limit future need for data rescue; these changes, however, do not apply retroactively. While rescuing data is not new, the term lacks formal definition, is often conflated with other terms (i.e. data reuse), and lacks general recommendations. Here, we outline seven key guidelines for effective rescue of historically collected and unmanaged datasets. We discuss prioritization of datasets to rescue, forming effective data rescue teams, preparing the data and associated metadata, and archiving and sharing the rescued materials. In an era of rapid environmental change, the best policy solutions will require evidence from both contemporary and historical sources. It is, therefore, imperative that we identify and preserve valuable, at-risk environmental data before they are lost to science.

Keywords: data archiving; historical data; long-term ecological data; open science; reproducibility; transparency.

PubMed Disclaimer

Conflict of interest statement

We declare we have no competing interests.

Figures

Box 2.1.
Box 2.1.
Photograph of loose data sheets, maps, reports and picture slides; these items and many more filled the boxes of research material left behind by Dr La Roi. Image credit: A. Hesketh.
Box 2.2.
Box 2.2.
Example of non-standard data to be rationalized and digitized, representing the significance of correlations between habitat features. These symbols were converted to numeric factors during digitization. Reproduced with modification from Lancaster [, see Appendix 4, p. 103–104 therein].
Figure 1.
Figure 1.
Prioritizing data for rescue: balancing the value of the data and its risk of loss. With many datasets in need of preservation and limited resources, the first step in the data rescue process requires developing a list of priorities for consideration and identifying relevant datasets (figure 2). We consider data prioritization to be a balance between the assessed value of a dataset in question and the potential risk of its loss in the absence of intervention (see Data prioritization under Guidelines). Alt-text is available in the electronic supplementary material. (Online version in colour.)
Figure 2.
Figure 2.
Steps in the data rescue assembly line. First, data must be prioritized for rescue (Step 1). After team creation (Step 2) and metadata creation (Step 3), the data must be transferred and compiled into a logical format (Step 4). After data cleaning and validation (Step 5) is complete, the finalized data and metadata should be archived on a long-term data repository (Step 6). The ultimate goal is to have the rescued data openly available for re-use (Step 7). Alt-text is available in the electronic supplementary material.

References

    1. McClenachan L, Ferretti F, Baum JK. 2012. From archives to conservation: why historical data are needed to set baselines for marine animals and ecosystems. Conserv. Lett. 5, 349-359. (10.1111/j.1755-263X.2012.00253.x) - DOI
    1. Gatti G, Bianchi CN, Parravicini V, Rovere A, Peirano A, Montefalcone M, Massa F, Morri C. 2015. Ecological change, sliding baselines and the importance of historical data: lessons from combining observational and quantitative data on a temperate reef over 70 years. PLoS ONE 10, e0123268. (10.1371/journal.pone.0118581) - DOI - PMC - PubMed
    1. Willis KJ, Araùjo MB, Bennett KD, Figueroa-Rangel B, Freud CA, Myers N. 2007. How can a knowledge of the past help to conserve the future? Biodiversity conservation and the relevance of long-term ecological data. Phil. Trans. R. Soc. B 362, 175-187. (10.1098/rstb.2006.1977) - DOI - PMC - PubMed
    1. Renaut S, Budden AE, Gravel D, Poisot T, Peres-Neto P. 2018. Management, archiving, and sharing for biologists and the role of research institutions in the technology-oriented age. Bioscience 68, 400-411. (10.1093/biosci/biy038) - DOI
    1. Vines TH, et al. 2014. The availability of research data declines rapidly with article age. Curr. Biol. 24, 94-97. (10.1016/j.cub.2013.11.014) - DOI - PubMed

LinkOut - more resources