Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data
- PMID: 39771857
- PMCID: PMC11679253
- DOI: 10.3390/s24248122
Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data
Abstract
Camera traps offer enormous new opportunities in ecological studies, but current automated image analysis methods often lack the contextual richness needed to support impactful conservation outcomes. Integrating vision-language models into these workflows could address this gap by providing enhanced contextual understanding and enabling advanced queries across temporal and spatial dimensions. Here, we present an integrated approach that combines deep learning-based vision and language models to improve ecological reporting using data from camera traps. We introduce a two-stage system: YOLOv10-X to localise and classify species (mammals and birds) within images and a Phi-3.5-vision-instruct model to read YOLOv10-X bounding box labels to identify species, overcoming its limitation with hard-to-classify objects in images. Additionally, Phi-3.5 detects broader variables, such as vegetation type and time of day, providing rich ecological and environmental context to YOLO's species detection output. When combined, this output is processed by the model's natural language system to answer complex queries, and retrieval-augmented generation (RAG) is employed to enrich responses with external information, like species weight and IUCN status (information that cannot be obtained through direct visual analysis). Combined, this information is used to automatically generate structured reports, providing biodiversity stakeholders with deeper insights into, for example, species abundance, distribution, animal behaviour, and habitat selection. Our approach delivers contextually rich narratives that aid in wildlife management decisions. By providing contextually rich insights, our approach not only reduces manual effort but also supports timely decision making in conservation, potentially shifting efforts from reactive to proactive.
Keywords: biodiversity monitoring; deep learning; large language models; object detection; vision transformers; wildlife conservation.
Conflict of interest statement
There are no conflicts of interest.
Figures
























Similar articles
-
Large-scale and long-term wildlife research and monitoring using camera traps: a continental synthesis.Biol Rev Camb Philos Soc. 2025 Apr;100(2):530-555. doi: 10.1111/brv.13152. Epub 2025 Jan 17. Biol Rev Camb Philos Soc. 2025. PMID: 39822039 Free PMC article. Review.
-
Temporal insights into ecological community: Advancing waterbird monitoring with dome camera and deep learning.J Environ Manage. 2025 Jul;387:125769. doi: 10.1016/j.jenvman.2025.125769. Epub 2025 May 21. J Environ Manage. 2025. PMID: 40403671
-
Camera trap surveys of Atlantic Forest mammals: A data set for analyses considering imperfect detection (2004-2020).Ecology. 2024 May;105(5):e4298. doi: 10.1002/ecy.4298. Epub 2024 Apr 12. Ecology. 2024. PMID: 38610092
-
Estimating species richness and modelling habitat preferences of tropical forest mammals from camera trap data.PLoS One. 2014 Jul 23;9(7):e103300. doi: 10.1371/journal.pone.0103300. eCollection 2014. PLoS One. 2014. PMID: 25054806 Free PMC article.
-
An overview of remote monitoring methods in biodiversity conservation.Environ Sci Pollut Res Int. 2022 Nov;29(53):80179-80221. doi: 10.1007/s11356-022-23242-y. Epub 2022 Oct 5. Environ Sci Pollut Res Int. 2022. PMID: 36197618 Free PMC article. Review.
References
-
- O’Connell A.F., Nichols J.D., Karanth K.U. Camera Traps in Animal Ecology: Methods and Analyses. Vol. 271 Springer; Berlin/Heidelberg, Germany: 2011.
-
- Villa A.G., Salazar A., Vargas F. Towards automatic wild animal monitoring: Identification of animal species in camera-trap images using very deep convolutional neural networks. Ecol. Inform. 2017;41:24–32. doi: 10.1016/j.ecoinf.2017.07.004. - DOI
-
- Nazir S., Kaleem M. Advances in image acquisition and processing technologies transforming animal ecological studies. Ecol. Inform. 2021;61:101212. doi: 10.1016/j.ecoinf.2021.101212. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous