Data integration enables global biodiversity synthesis
- PMID: 33526679
- PMCID: PMC8017944
- DOI: 10.1073/pnas.2018093118
Data integration enables global biodiversity synthesis
Abstract
The accessibility of global biodiversity information has surged in the past two decades, notably through widespread funding initiatives for museum specimen digitization and emergence of large-scale public participation in community science. Effective use of these data requires the integration of disconnected datasets, but the scientific impacts of consolidated biodiversity data networks have not yet been quantified. To determine whether data integration enables novel research, we carried out a quantitative text analysis and bibliographic synthesis of >4,000 studies published from 2003 to 2019 that use data mediated by the world's largest biodiversity data network, the Global Biodiversity Information Facility (GBIF). Data available through GBIF increased 12-fold since 2007, a trend matched by global data use with roughly two publications using GBIF-mediated data per day in 2019. Data-use patterns were diverse by authorship, geographic extent, taxonomic group, and dataset type. Despite facilitating global authorship, legacies of colonial science remain. Studies involving species distribution modeling were most prevalent (31% of literature surveyed) but recently shifted in focus from theory to application. Topic prevalence was stable across the 17-y period for some research areas (e.g., macroecology), yet other topics proportionately declined (e.g., taxonomy) or increased (e.g., species interactions, disease). Although centered on biological subfields, GBIF-enabled research extends surprisingly across all major scientific disciplines. Biodiversity data mobilization through global data aggregation has enabled basic and applied research use at temporal, spatial, and taxonomic scales otherwise not possible, launching biodiversity sciences into a new era.
Keywords: Global Biodiversity Information Facility (GBIF); biodiversity informatics; biological collections; community science; scientometrics.
Copyright © 2021 the Author(s). Published by PNAS.
Conflict of interest statement
Competing interest statement: GBIF opened a call for proposals to carry out a contracted analysis of scientific research published based on GBIF-mediated data in 2016 to 2019. J.M.H. was selected to carry out and lead author this study; J.T.M., D.N., and D.S. are employees of GBIF. GBIF as a funder of the study took part in scoping and setting the study goals, but had no influence on study analysis, results, or conclusions.
Figures
References
-
- IPBES , Summary for policymakers of the global assessment report on biodiversity and ecosystem services of the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services. 10.5281/zenodo.3553579. Accessed 10 June 2020. - DOI
-
- Proença V., et al. ., Global biodiversity monitoring: From data sources to essential biodiversity variables. Biol. Conserv. 213, 256–263 (2017).
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
