Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 10:13:e141562.
doi: 10.3897/BDJ.13.e141562. eCollection 2025.

Commentary on "Preliminary Species Hypotheses" in Entomological Taxonomy: A Global Data and FAIR Infrastructure Perspective

Affiliations

Commentary on "Preliminary Species Hypotheses" in Entomological Taxonomy: A Global Data and FAIR Infrastructure Perspective

Sharif Islam. Biodivers Data J. .

Abstract

What if early taxonomic findings were treated like preprints, open to iterative improvement or managed with practices from the open-source community, such as Git branching, merging and patch management? Prompted by Buckley's article Charting a Future for Entomological Taxonomy in New Zealand (2024), this commentary explores these possibilities in the context of biodiversity informatics. In response to the need for rapid, scalable biodiversity monitoring, Buckley introduces preliminary species hypotheses (PSH) as a bridge between quick identification tools and the rigorous Linnaean system, leveraging DNA barcoding and AI-assisted image recognition to produce provisional classifications that can later be validated. Expanding on Buckley's framework, this commentary emphasises the critical role of data linking, versioning and integration to support evolving taxonomic data. Borrowing from software and open-source practices, I explore the idea of managing PSH with an infrastructure that treats each taxonomic update as a versioned "commit", which can be tracked, refined and integrated over time. Drawing insights from FAIR (Findable, Accessible, Interoperable, Reusable) principles and Digital Extended Specimens, I identify infrastructure requirements for PSH, including robust data standards, persistent identifiers and interoperability to support global biodiversity repositories. Additionally, Taxonomic Data Objects offer a model for dynamically integrating PSH into adaptable taxonomies that can evolve with new data and tools. By positioning PSH within an open, infrastructure-focused framework, this commentary advocates for scalable, hypothesis-driven biodiversity data that meets modern conservation needs, bridging traditional and emerging practices in taxonomy.

Keywords: FAIR; data integration; interoperability; open source; species; taxonomy.

PubMed Disclaimer

Conflict of interest statement

No conflict of interest to declare Disclaimer: This article is (co-)authored by any of the Editors-in-Chief, Managing Editors or their deputies in this journal.

Figures

Figure 1.
Figure 1.
A simplified conceptual framework for version-controlled taxonomic data management This diagram illustrates the parallel between hypothesis-driven taxonomic workflows and Git-based version control systems. Drawing inspiration from software development practices, the framework demonstrates how version control concepts could be applied to manage and track the evolution of taxonomic hypotheses. The actual processes involved are much more complex, as described in Pyle's paper "An Introduction to Scientific Names of Organisms and the Taxon Concepts they Represent (Pyle 2022).

References

    1. Addink Wouter, Kyriakopoulou Niki, Penev Lyubomir, Fichtmueller David, Norton Ben, Shorthouse David. Deliverable D1.3 Best practice manual for findability, re-use and accessibility of infrastructures. ARPHA Preprints. 2023 doi: 10.3897/arphapreprints.e107169. - DOI
    1. Bánki Olaf, Döring Markus, Jeppesen Thomas. Name IDs and Name Matching for Catalogue of Life: Existing Services and Prospects. Biodiversity Information Science and Standards. 2023;7 doi: 10.3897/biss.7.111662. - DOI
    1. Bentley Andrew, Thiers Barbara, Moser William E, Watkins-Colwell Gregory J, Zimkus Breda M, Monfils Anna K, Franz Nico M, Bates John M, Boundy-Mills Kyria, Lomas Michael W, Ellwood Elizabeth R, Poo Sinlan, Contreras Dori L, Webster Michael S, Nelson Gil, Pandey Jyotsna L. Community Action: Planning for Specimen Management in Funding Proposals. BioScience. 2024;74(7):435–439. doi: 10.1093/biosci/biae032. - DOI
    1. Borgman Christine L., Sands Ashley E., Darch Peter T., Golshan Milena S. The durability and fragility of knowledge infrastructures: Lessons learned from astronomy. Proceedings of the Association for Information Science and Technology. 2016;53(1):1–10. doi: 10.1002/pra2.2016.14505301057. - DOI
    1. Borgman Christine L., Wofford Morgan F. From Data Processes to Data Products: Knowledge Infrastructures in Astronomy. Harvard Data Science Review. 2021 doi: 10.1162/99608f92.4e792052. - DOI

LinkOut - more resources