Commentary on "Preliminary Species Hypotheses" in Entomological Taxonomy: A Global Data and FAIR Infrastructure Perspective
- PMID: 39967725
- PMCID: PMC11833303
- DOI: 10.3897/BDJ.13.e141562
Commentary on "Preliminary Species Hypotheses" in Entomological Taxonomy: A Global Data and FAIR Infrastructure Perspective
Abstract
What if early taxonomic findings were treated like preprints, open to iterative improvement or managed with practices from the open-source community, such as Git branching, merging and patch management? Prompted by Buckley's article Charting a Future for Entomological Taxonomy in New Zealand (2024), this commentary explores these possibilities in the context of biodiversity informatics. In response to the need for rapid, scalable biodiversity monitoring, Buckley introduces preliminary species hypotheses (PSH) as a bridge between quick identification tools and the rigorous Linnaean system, leveraging DNA barcoding and AI-assisted image recognition to produce provisional classifications that can later be validated. Expanding on Buckley's framework, this commentary emphasises the critical role of data linking, versioning and integration to support evolving taxonomic data. Borrowing from software and open-source practices, I explore the idea of managing PSH with an infrastructure that treats each taxonomic update as a versioned "commit", which can be tracked, refined and integrated over time. Drawing insights from FAIR (Findable, Accessible, Interoperable, Reusable) principles and Digital Extended Specimens, I identify infrastructure requirements for PSH, including robust data standards, persistent identifiers and interoperability to support global biodiversity repositories. Additionally, Taxonomic Data Objects offer a model for dynamically integrating PSH into adaptable taxonomies that can evolve with new data and tools. By positioning PSH within an open, infrastructure-focused framework, this commentary advocates for scalable, hypothesis-driven biodiversity data that meets modern conservation needs, bridging traditional and emerging practices in taxonomy.
Keywords: FAIR; data integration; interoperability; open source; species; taxonomy.
Sharif Islam.
Conflict of interest statement
No conflict of interest to declare Disclaimer: This article is (co-)authored by any of the Editors-in-Chief, Managing Editors or their deputies in this journal.
Figures

References
-
- Addink Wouter, Kyriakopoulou Niki, Penev Lyubomir, Fichtmueller David, Norton Ben, Shorthouse David. Deliverable D1.3 Best practice manual for findability, re-use and accessibility of infrastructures. ARPHA Preprints. 2023 doi: 10.3897/arphapreprints.e107169. - DOI
-
- Bánki Olaf, Döring Markus, Jeppesen Thomas. Name IDs and Name Matching for Catalogue of Life: Existing Services and Prospects. Biodiversity Information Science and Standards. 2023;7 doi: 10.3897/biss.7.111662. - DOI
-
- Bentley Andrew, Thiers Barbara, Moser William E, Watkins-Colwell Gregory J, Zimkus Breda M, Monfils Anna K, Franz Nico M, Bates John M, Boundy-Mills Kyria, Lomas Michael W, Ellwood Elizabeth R, Poo Sinlan, Contreras Dori L, Webster Michael S, Nelson Gil, Pandey Jyotsna L. Community Action: Planning for Specimen Management in Funding Proposals. BioScience. 2024;74(7):435–439. doi: 10.1093/biosci/biae032. - DOI
-
- Borgman Christine L., Sands Ashley E., Darch Peter T., Golshan Milena S. The durability and fragility of knowledge infrastructures: Lessons learned from astronomy. Proceedings of the Association for Information Science and Technology. 2016;53(1):1–10. doi: 10.1002/pra2.2016.14505301057. - DOI
-
- Borgman Christine L., Wofford Morgan F. From Data Processes to Data Products: Knowledge Infrastructures in Astronomy. Harvard Data Science Review. 2021 doi: 10.1162/99608f92.4e792052. - DOI
LinkOut - more resources
Full Text Sources