Improving the accessibility and transferability of machine learning algorithms for identification of animals in camera trap images: MLWIC2
- PMID: 33072266
- PMCID: PMC7548173
- DOI: 10.1002/ece3.6692
Improving the accessibility and transferability of machine learning algorithms for identification of animals in camera trap images: MLWIC2
Abstract
Motion-activated wildlife cameras (or "camera traps") are frequently used to remotely and noninvasively observe animals. The vast number of images collected from camera trap projects has prompted some biologists to employ machine learning algorithms to automatically recognize species in these images, or at least filter-out images that do not contain animals. These approaches are often limited by model transferability, as a model trained to recognize species from one location might not work as well for the same species in different locations. Furthermore, these methods often require advanced computational skills, making them inaccessible to many biologists. We used 3 million camera trap images from 18 studies in 10 states across the United States of America to train two deep neural networks, one that recognizes 58 species, the "species model," and one that determines if an image is empty or if it contains an animal, the "empty-animal model." Our species model and empty-animal model had accuracies of 96.8% and 97.3%, respectively. Furthermore, the models performed well on some out-of-sample datasets, as the species model had 91% accuracy on species from Canada (accuracy range 36%-91% across all out-of-sample datasets) and the empty-animal model achieved an accuracy of 91%-94% on out-of-sample datasets from different continents. Our software addresses some of the limitations of using machine learning to classify images from camera traps. By including many species from several locations, our species model is potentially applicable to many camera trap studies in North America. We also found that our empty-animal model can facilitate removal of images without animals globally. We provide the trained models in an R package (MLWIC2: Machine Learning for Wildlife Image Classification in R), which contains Shiny Applications that allow scientists with minimal programming experience to use trained models and train new models in six neural network architectures with varying depths.
Keywords: R package; computer vision; deep convolutional neural networks; image classification; machine learning; motion‐activated camera; remote sensing; species identification.
© 2020 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
Conflict of interest statement
The authors have no conflicts of interest to declare.
Figures
References
-
- Adabi, M. , Barhab, P. , Chen, J. , Chen, Z. , Davis, A. , Dean, J. , … Zheng, X. (2016). TensorFlow: A system for large‐scale machine learning (Vol. 16, pp. 265–283). Presented at the 12th USENIX Symposium on Operating Systems Design and Implementation, USENIX Association.
-
- Advanced Research Computing Center (2018). Teton Computing Environment, Intel x86_64 cluster. Laramie, WY: University of Wyoming; Retrieved from 10.15786/M2FY47 - DOI
-
- Anton, V. , Hartley, S. , Geldenhuis, A. , & Wittmer, H. U. (2018). Monitoring the mammalian fauna of urban areas using remote cameras and citizen science. Journal of Urban Ecology, 4(1), 1–9. 10.1093/jue/juy002 - DOI
-
- Beery, S. , Morris, D. , & Yang, S. (2019). Efficient pipeline for camera trap image review. Retrieved from http://arxiv.org/abs/1907.06772
-
- Beery, S. , Van Horn, G. , & Perona, P. (2018). Recognition in terra incognita (pp. 456–473). Presented at the Proceedings of the European Conference on Computer Vision (ECCV). Retrieved from http://openaccess.thecvf.com/content_ECCV_2018/html/Beery_Recognition_in...
Associated data
LinkOut - more resources
Full Text Sources
