Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 17;21(6):e1013135.
doi: 10.1371/journal.pcbi.1013135. eCollection 2025 Jun.

The role of learned song in the evolution and speciation of Eastern and Spotted towhees

Affiliations

The role of learned song in the evolution and speciation of Eastern and Spotted towhees

Ximena León Du'Mottuchi et al. PLoS Comput Biol. .

Abstract

Oscine songbirds learn vocalizations that function in mate attraction and territory defense; sexual selection pressures on these learned songs could thus accelerate speciation. The Eastern and Spotted towhees are recently diverged sister species that now have partially overlapping ranges with evidence of some hybridization. Widespread community-science recordings of these species, including songs within their zone of overlap and from potential hybrids, enable us to investigate whether song differentiation might facilitate their reproductive isolation. Here, we quantify 16 song features to analyze geographic variation in Spotted and Eastern towhee songs and assess species-level differences. We then use several machine learning models to measure how accurately their songs can be classified by species. While no single song feature reliably distinguishes the two species, machine learning models classified songs with relatively high accuracy (random forest: 89.5%, deep learning: 90%, gradient boosting machine: 88%, convolutional neural network: 88%); interestingly, species classification was less accurate in their zone of overlap. Finally, our analysis of the limited publicly available genetic data from each species supports the hypothesis that the species are reproductively isolated. Together, our results suggest that small variations in multiple features may contribute to these sister species' ability to recognize their species-specific songs.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Example spectrograms of Spotted and Eastern towhee songs.
Examples of Spotted towhee songs (A-E) and Eastern towhee songs (F-J) from Macaulay Library (ML) and Xeno-canto (XC). The repeated syllable at the end of each song is often called the ‘trill’. Recording ID numbers: (A) ML191126; (B) ML90009051; (C) XC575461; (D) XC577591; (E) XC577593; (F) ML15276; (G) ML200973; (H) ML54283841; (I) ML76939751; (J) ML450438861; see metadata (https://github.com/CreanzaLab/TowheeAnalysis) for detailed recording information.
Fig 2
Fig 2. Spectrogram illustrating the definition of specific song elements.
On this spectrogram of an Eastern towhee song, we indicate a syllable, silence, bout, maximum syllable frequency, and minimum syllable frequency. The color of the syllable indicates its unique syllable type. Spectrogram generated from Macaulay Library recording 52506381, https://macaulaylibrary.org/asset/52506381.
Fig 3
Fig 3. Results of a Linear Discriminant Analysis trained on towhee song data.
Plot of LD1 results of a subset of towhee song bouts (Ntest = 697) using a Linear Discriminant Analysis trained on raw song feature data from a balanced training set of Spotted and Eastern towhee bouts (Ntrain = 1592). The model revealed 86.8% prediction accuracy (balanced accuracy = 86.9%; Cohen’s κ = 0.73). Points are jittered vertically for visualization.
Fig 4
Fig 4. Spatial distribution of Spotted towhee and Eastern towhee song data and genetic data.
Song data is shown in panels A–C (Ntotal_recordings = 2785; NSpotted_towhee = 1067; NEastern_towhee = 1718) and genetic data is shown in panels D–F (Ntotal = 23; NSpotted_towhee = 18; NEastern_towhee = 5). (A) Distribution of song recordings in North America. (B) Principal component analysis (PCA) of song bouts using 16 song features. (C) Procrustes analysis of song data using PC1 and PC2 from the PCA in panel B. (D) Distribution of genetic sequences obtained from the Barcode of Life Data Systems database and NCBI. (E) PCA using single nucleotide polymorphisms of aligned sequences of the cytochrome oxidase subunit I regions of the mitochondrial genome. (F) Procrustes analysis of genetic data using PC1 and PC2 from the PCA analysis in panel E. Ellipses indicate 95% confidence intervals. Base maps (panels A, C, D, and F) were made with Natural Earth (http://www.naturalearthdata.com/).
Fig 5
Fig 5. UMAP projection of Eastern and Spotted towhee song-feature data.
Each point represents an analyzed song bout (Ntotal_bouts = 2785; NSpotted_towhee = 1067; NEastern_towhee = 1718), with Eastern towhee songs shown in shades of red and Spotted towhee songs in shades of blue. The lighter colors represent recordings from the zone of species overlap. Black dots indicate the 27 recordings from individuals that were classified as potential hybrids (“hybrid/unsure”).
Fig 6
Fig 6. Geographic distribution of random forest model predictions of Spotted and Eastern towhee species identity.
Predictions were based on a model trained on 16 song features from samples of Spotted towhees and Eastern towhees. (A) We trained a model on song data from the entire geographic range of both species (NSpotted_towhee = 796; NEastern_towhee = 796) and tested how well it predicted the species identification of a subset of all song samples (Ntest = 697; accuracy = 89.5%). (B) The same results from (A) are plotted based on the percent of trees in the random forest classifier that supported the correct species identification; when less than 50% of trees supported the correct identification, the classifier made an incorrect species prediction. The average confidence in species classifications tended to decrease toward the zone of overlap for both species. (C) We then trained a second model on a subset of samples obtained only from the non-overlap zone (NSpotted_towhee = 796; NEastern_towhee = 796) and tested it on a random subsample of song bouts from both the zone of non-overlap (Ntest_nonoverlap = 216; accuracy = 93.5%) and the zone of overlap (i.e., 102°W - 91°W; Ntest_overlap = 216; accuracy = 84.3%). (D) The same results from (C) are plotted based on the percent of trees in the random forest classifier that supported the correct species identification; again, the average confidence of species identifications tended to decrease toward the zone of overlap for both species. (E) We used the same model from panels C and D to predict species identity of song bouts from recordings of “hybrid/unsure” towhees (Npredict = 27). The model predicted that 16 of these “hybrid/unsure” recordings were Spotted towhees and 11 were Eastern towhees, with no discernable longitudinal gradient in the predictions. The dotted line represents the zone of overlap determined by the co-occurence of Eastern towhee and Spotted towhee song recordings (102°W - 91°W). Base maps (panels A, C, and E) were made with Natural Earth (http://www.naturalearthdata.com/).

Similar articles

References

    1. Catchpole CK, Slater PJB. Bird song: biological themes and variations. Cambridge University Press. 1995.
    1. Searfoss AM, Liu WC, Creanza N. Geographically well-distributed citizen science data reveals range-wide variation in the chipping sparrow’s simple song. Anim Behav. 2020;161:63–76.
    1. Podos J, Warren PS. The Evolution of Geographic Variation in Birdsong. Advances in the Study of Behavior. Academic Press. 2007. p. 403–58.
    1. Rivera‐Gutierrez HF, Matthysen E, Adriaensen F, Slabbekoorn H. Repertoire sharing and song similarity between great tit males decline with distance between forest fragments. Ethology. 2010;116(10):951–60. doi: 10.1111/j.1439-0310.2010.01813.x - DOI
    1. Price T. Speciation in Birds. 1st ed. W. H. Freeman. 2007.

LinkOut - more resources