A story of data won, data lost and data re-found: the realities of ecological data preservation
- PMID: 30473618
- PMCID: PMC6235994
- DOI: 10.3897/BDJ.6.e28073
A story of data won, data lost and data re-found: the realities of ecological data preservation
Abstract
This paper discusses the process of retrieval and updating legacy data to allow on-line discovery and delivery. There are many pitfalls of institutional and non-institutional ecological data conservation over the long term. Interruptions to custodianship, old media, lost knowledge and the continuous evolution of species names makes resurrection of old data challenging. We caution against technological arrogance and emphasise the importance of international standards. We use a case study of a compiled set of continent-wide vegetation survey data for which, although the analyses had been published, the raw data had not. In the original study, publications containing plot data collected from the 1880s onwards had been collected, interpreted, digitised and integrated for the classification of vegetation and analysis of its conservation status across Australia. These compiled data are an extremely valuable national collection that demanded publishing in open, readily accessible online repositories, such as the Terrestrial Ecosystem Research Network (http://www.tern.org.au) and the Atlas of Living Australia (ALA: http://www.ala.org.au), the Australian node of the Global Biodiversity Information Facility (GBIF: http://www.gbif.org). It is hoped that the lessons learnt from this project may trigger a sober review of the value of endangered data, the cost of retrieval and the importance of suitable and timely archiving through the vicissitudes of technological change, so the initial unique collection investment enables multiple re-use in perpetuity.
Keywords: data conservation; data curation; data retrieval; legacy data; long-term data accessibility.
Figures



Similar articles
-
SIFlore, a dataset of geographical distribution of vascular plants covering five centuries of knowledge in France: Results of a collaborative project coordinated by the Federation of the National Botanical Conservatories.PhytoKeys. 2015 Sep 29;(56):47-60. doi: 10.3897/phytokeys.56.5723. eCollection 2015. PhytoKeys. 2015. PMID: 26491386 Free PMC article.
-
Herbarium data: Global biodiversity and societal botanical needs for novel research.Appl Plant Sci. 2018 Feb 28;6(2):e1024. doi: 10.1002/aps3.1024. eCollection 2018 Feb. Appl Plant Sci. 2018. PMID: 29732255 Free PMC article. Review.
-
The Australian SuperSite Network: A continental, long-term terrestrial ecosystem observatory.Sci Total Environ. 2016 Oct 15;568:1263-1274. doi: 10.1016/j.scitotenv.2016.05.170. Epub 2016 Jun 3. Sci Total Environ. 2016. PMID: 27267722
-
No one-size-fits-all solution to clean GBIF.PeerJ. 2020 Sep 28;8:e9916. doi: 10.7717/peerj.9916. eCollection 2020. PeerJ. 2020. PMID: 33062422 Free PMC article.
-
Building essential biodiversity variables (EBVs) of species distribution and abundance at a global scale.Biol Rev Camb Philos Soc. 2018 Feb;93(1):600-625. doi: 10.1111/brv.12359. Epub 2017 Aug 2. Biol Rev Camb Philos Soc. 2018. PMID: 28766908 Review.
Cited by
-
Checklist of the suborder Terebrantia (Thysanoptera): generic diversity and species composition in Xishuangbanna, Yunnan Province, China.Biodivers Data J. 2021 Nov 24;9:e72670. doi: 10.3897/BDJ.9.e72670. eCollection 2021. Biodivers Data J. 2021. PMID: 34866961 Free PMC article.
-
Open Data Practices among Users of Primary Biodiversity Data.Bioscience. 2021 Aug 18;71(11):1128-1147. doi: 10.1093/biosci/biab072. eCollection 2021 Nov. Bioscience. 2021. PMID: 34733117 Free PMC article.
-
Outbound Data Legality Analysis in CPTPP Countries under the Environment of Cross-Border Data Flow Governance.J Environ Public Health. 2022 Sep 28;2022:6105804. doi: 10.1155/2022/6105804. eCollection 2022. J Environ Public Health. 2022. Retraction in: J Environ Public Health. 2023 Jun 28;2023:9769087. doi: 10.1155/2023/9769087. PMID: 36213036 Free PMC article. Retracted.
References
-
- Aronova E., Baker K. S., Oreskes N. Big Science and Big Data in biology: from the International Geophysical Year through the International Biological Program to the Long Term Ecological Research (LTER) Network, 1957-Presentamore. Historical Studies of Natural Science. 2014;40(2):183–224. doi: 10.1525/hsns.2010.40.2.183. - DOI
-
- Bagley P. R. Extension of programming language concepts. University City Science Center; Philadelphia, USA: 1968.
-
- Barker W. R. Standardising informal names in Australian publications. http://www.anbg.gov.au/asbs/newsletter/pdf/05-march-122.pdf Australian Systematic Botany Society Newsletter. 2005;122:11–12.
-
- Barlow B. A. Flora and Fauna of Alpine Australasia. CSIRO in association with the Australian Systematic Botany Society; Melbourne, Australia: 1986. 543.
-
- Belbin L. CSIRO, Division of Wildlife Ecology, Australia; 1994. PATN: Pattern analysis package: technical reference .
LinkOut - more resources
Full Text Sources