Vocal complexity in the long calls of Bornean orangutans
- PMID: 38766489
- PMCID: PMC11100477
- DOI: 10.7717/peerj.17320
Vocal complexity in the long calls of Bornean orangutans
Abstract
Vocal complexity is central to many evolutionary hypotheses about animal communication. Yet, quantifying and comparing complexity remains a challenge, particularly when vocal types are highly graded. Male Bornean orangutans (Pongo pygmaeus wurmbii) produce complex and variable "long call" vocalizations comprising multiple sound types that vary within and among individuals. Previous studies described six distinct call (or pulse) types within these complex vocalizations, but none quantified their discreteness or the ability of human observers to reliably classify them. We studied the long calls of 13 individuals to: (1) evaluate and quantify the reliability of audio-visual classification by three well-trained observers, (2) distinguish among call types using supervised classification and unsupervised clustering, and (3) compare the performance of different feature sets. Using 46 acoustic features, we used machine learning (i.e., support vector machines, affinity propagation, and fuzzy c-means) to identify call types and assess their discreteness. We additionally used Uniform Manifold Approximation and Projection (UMAP) to visualize the separation of pulses using both extracted features and spectrogram representations. Supervised approaches showed low inter-observer reliability and poor classification accuracy, indicating that pulse types were not discrete. We propose an updated pulse classification approach that is highly reproducible across observers and exhibits strong classification accuracy using support vector machines. Although the low number of call types suggests long calls are fairly simple, the continuous gradation of sounds seems to greatly boost the complexity of this system. This work responds to calls for more quantitative research to define call types and quantify gradedness in animal vocal systems and highlights the need for a more comprehensive framework for studying vocal complexity vis-à-vis graded repertoires.
Keywords: Acoustic communication; Affinity propagation; Fuzzy clustering; Graded signals; Machine learning; Supervised classification; Support vector machines; Uniform manifold approximation and projection (UMAP); Unsupervised clustering; Vocal repertoire.
©2024 Erb et al.
Conflict of interest statement
The authors declare there are no competing interests.
Figures








Similar articles
-
Validation of an acoustic location system to monitor Bornean orangutan (Pongo pygmaeus wurmbii) long calls.Am J Primatol. 2015 Jul;77(7):767-76. doi: 10.1002/ajp.22398. Epub 2015 Mar 16. Am J Primatol. 2015. PMID: 25773926
-
Characterizing Vocal Repertoires--Hard vs. Soft Classification Approaches.PLoS One. 2015 Apr 27;10(4):e0125785. doi: 10.1371/journal.pone.0125785. eCollection 2015. PLoS One. 2015. PMID: 25915039 Free PMC article.
-
Filling in the gaps: Acoustic gradation increases in the vocal ontogeny of chimpanzees (Pan troglodytes).Am J Primatol. 2021 May;83(5):e23249. doi: 10.1002/ajp.23249. Epub 2021 Apr 1. Am J Primatol. 2021. PMID: 33792937
-
Unsupervised discovery of family specific vocal usage in the Mongolian gerbil.Elife. 2024 Dec 16;12:RP89892. doi: 10.7554/eLife.89892. Elife. 2024. PMID: 39680425 Free PMC article.
-
Call type signals caller goal: a new take on ultimate and proximate influences in vocal production.Biol Rev Camb Philos Soc. 2018 Nov;93(4):2071-2082. doi: 10.1111/brv.12437. Epub 2018 Jun 12. Biol Rev Camb Philos Soc. 2018. PMID: 29896860 Review.
References
-
- Alloghani M, Al-Jumeily D, Mustafina J, Hussain A, Aljaaf AJ. Unsupervised and semi-supervised learning: supervised and unsupervised learning for data science. Springer; Cham: 2020. A systematic review on supervised and unsupervised machine learning algorithms for data science; pp. 3–21.
-
- Araya-Salas M, Smith-Vidaurre G. WarbleR: an r package to streamline analysis of animal acoustic signals. Methods in Ecology and Evolution. 2017;8(2):184–191. doi: 10.1111/2041-210X.12624. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous