Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 18:2023:baad048.
doi: 10.1093/database/baad048.

Unified and pluralistic ideals for data sharing and reuse in biodiversity

Affiliations

Unified and pluralistic ideals for data sharing and reuse in biodiversity

Beckett Sterner et al. Database (Oxford). .

Abstract

How should billions of species observations worldwide be shared and made reusable? Many biodiversity scientists assume the ideal solution is to standardize all datasets according to a single, universal classification and aggregate them into a centralized, global repository. This ideal has known practical and theoretical limitations, however, which justifies investigating alternatives. To support better community deliberation and normative evaluation, we develop a novel conceptual framework showing how different organizational models, regulative ideals and heuristic strategies are combined to form shared infrastructures supporting data reuse. The framework is anchored in a general definition of data pooling as an activity of making a taxonomically standardized body of information available for community reuse via digital infrastructure. We describe and illustrate unified and pluralistic ideals for biodiversity data pooling and show how communities may advance toward these ideals using different heuristic strategies. We present evidence for the strengths and limitations of the unification and pluralistic ideals based on systemic relationships of power, responsibility and benefit they establish among stakeholders, and we conclude the pluralistic ideal is better suited for biodiversity data.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
A typology for suitable management approaches to resources, modified from Ostrom et al. (62).
Figure 2.
Figure 2.
Five epistemic-organizational models for data pooling: data aggregators, search index portals, research contractors, peer data commons and custom databases. The properties of these models are illustrated by analogous approaches to how businesses or social groups make food for broader consumption. The models rely on data standardization and preprocessing to an increasing degree from left to right.
Figure 3.
Figure 3.
Four possible heuristics (A–D) for data pooling and their relationship to the unified and pluralistic ideals. See main text for discussion of A-D. White boxes represent distinct data portals, and colored symbols within the box represent data standardized to a local, potentially unique taxonomic classification that is not deliberately harmonized with other taxonomies. The vertical overlap in white boxes indicates where portals overlap in the scope of data they include.
Figure 4.
Figure 4.
Two heuristic approaches to unified data pooling. (A) Image from Bisby (28), illustrating his envisioned unified hierarchy of classificatory theories for taxonomic domains. The hierarchy would enable a centralized aggregator or modular system of data pools. (B) Image from Gallagher et al. (52), illustrating a modular hub-and-spoke system for integrating trait data about species by taxonomic domains. Black circles at the edges of the network show primary data publishers or sources. Intermediate nodes show projects harmonizing data and developing standards. The central, largest node represents a trait dataset registry that functions as a basic search index portal across domains.
Figure 5.
Figure 5.
Selected examples of North American and global biodiversity data pooling projects illustrate the current partially coordinated flow of biodiversity data (uni- or bi-directional arrows). Projects are distinguished by the geographic scale (y-axis) and type of domain used to characterize their scope (colors by column on the x-axis). The aggregator column represents some important biodiversity data pooling projects whose scopes cut across the other domain categories.

Similar articles

Cited by

References

    1. Leonelli S. (2016) Data-Centric Biology: A Philosophical Study. University of Chicago Press, Chicago, IL.
    1. Berman F., Rutenbar R., Christensen H.. et al. (2016) Realizing the Potential of Data Science. National Science Foundation Computer and Information Science and Engineering Advisory Committee Data Science Working Group.
    1. Feng X., Enquist B.J., Park D.S.. et al. (2022) A review of the heterogeneous landscape of biodiversity databases: opportunities and challenges for a synthesized biodiversity knowledge base. Glob. Ecol. Biogeogr., 31, 1242–1260.
    1. Lendemer J., Thiers B., Monfils A.K.. et al. (2020) The extended specimen network: a strategy to enhance us biodiversity collections, promote research and education. BioScience, 70, 23–30. - PMC - PubMed
    1. Heberling J.M., Miller J.T., Noesgaard D.. et al. (2021) Data integration enables global biodiversity synthesis. Proc. Natl. Acad. Sci., 118, e2018093118. - PMC - PubMed

Publication types