Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug;28(8):e70188.
doi: 10.1111/ele.70188.

Natural History Collections at the Crossroads: Shifting Priorities and Data-Driven Opportunities

Affiliations

Natural History Collections at the Crossroads: Shifting Priorities and Data-Driven Opportunities

Owen Forbes et al. Ecol Lett. 2025 Aug.

Abstract

Natural history collections face a critical juncture as environmental change and biodiversity crises accelerate. While collections data are key components of eco-evolutionary and environmental research in both fundamental and applied contexts, collecting strategies remain primarily taxonomically motivated. We argue that sampling strategies must evolve to better address broader ecological challenges, through emerging applications enabled by advances in data science and digital technology. Here, we propose a flexible framework using modern statistical approaches to optimise sampling design and research prioritisation. By considering biodiversity, environmental, spatial and temporal dimensions, we demonstrate how information theory and decision science tools can support strategic collecting, databasing and digitisation efforts. Our framework provides an evidence-based pathway for collections to enhance their role as critical research infrastructure for addressing 21st century environmental challenges. To illustrate this data-driven approach to research prioritisation, we present an example based on sampling design for wombats (Vombatus ursinus) in Australia.

Keywords: Bayesian decision theory; FAIR data principles; adaptive sampling design; conservation planning; data science; digital extended specimen; museum genomics; natural history collections; research prioritisation; value of information.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Information theoretic framework for research prioritisation in collections science. (A) Conceptual illustration of information gain, showing how uncertainty decreases as new data are collected. When data with higher information value is collected, the posterior distribution becomes more concentrated with additional observations, improving precision for outcomes of interest (e.g., species distributions, phenological patterns, or morphological variation). (B) Decision support framework combining Value of Information (VOI) and Need for Information (NFI) metrics to guide sampling design. This consensus plot illustrates a quadrant contrasting high and low VOI against high and low NFI, providing a structured approach for balancing information gain potential against ecological priorities when allocating sampling effort.
FIGURE 2
FIGURE 2
Data‐driven research prioritisation example—sampling design for the bare‐nosed wombat ( Vombatus ursinus ) in New South Wales, Australia, based on observation and specimen data from GBIF. (A) Value of Information (VOI) map showing expected information gain from additional sampling (darker blue = higher gain), calculated using a binomial model of annual observations in each 50 km grid, with expected information gain calculated using Kullback–Leibler divergence between current and updated probability distributions after simulated sampling. (B) Need for Information (NFI) map based on habitat loss percentiles (darker green = greater habitat loss), derived from the Habitat Condition Assessment System. (C) Cost of Information (COI) map using the Australian Bureau of Statistics' remoteness classification system (darker purple = metropolitan areas with higher accessibility and lower expected sampling costs). (D) Consensus diagram—VOI‐NFI quadrant analysis revealing priority sampling locations (upper right quadrant) where both information value and ecological need are high. In Panel A, grid cells (50 km) with insufficient data appear in grey (fewer than 2 total observations per grid). See Data S1 for details on methodology, data sources, and implementation.

Similar articles

References

    1. Akinlotan, M. D. , Warne D. J., Helmstedt K. J., et al. 2024. “Beyond Expected Values: Making Environmental Decisions Using Value of Information Analysis When Measurement Outcome Matters.” Ecological Indicators 160: 111828.
    1. Anderson, R. P. , Araújo M. B., Guisan A., et al. 2020. “Optimizing Biodiversity Informatics to Improve Information Flow, Data Quality, and Utility for Science and Society.” Frontiers of Biogeography 12, no. 3: e47839.
    1. Australian Bureau of Statistics . 2021. “Australian Statistical Geography Standard (ASGS) Edition 3”.
    1. Bakker, F. T. , Antonelli A., Clarke J. A., et al. 2020. “The Global Museum: Natural History Collections and the Future of Evolutionary Science and Public Education.” PeerJ 8: e8225. - PMC - PubMed
    1. Ball, L. , Rodríguez‐Machado S., Paredes‐Burneo D., et al. 2025. “What ‘Unexplored’ Means: Mapping Regions With Digitized Natural History Records to Look for ‘Biodiversity Blindspots’.” PeerJ 13: e18511. - PMC - PubMed

LinkOut - more resources