Review

. 2020 Feb;43(2):115-126.

doi: 10.1016/j.tins.2019.12.006. Epub 2020 Jan 16.

A Hierarchy of Autonomous Systems for Vocal Production

Yisi S Zhang¹, Asif A Ghazanfar²

Affiliations

¹ Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA. Electronic address: yz9@princeton.edu.
² Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA; Department of Psychology, Princeton University, Princeton, NJ 08544, USA; Department of Ecology & Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA. Electronic address: asifg@princeton.edu.

PMID: 31955902
PMCID: PMC7213988
DOI: 10.1016/j.tins.2019.12.006

Review

A Hierarchy of Autonomous Systems for Vocal Production

Yisi S Zhang et al. Trends Neurosci. 2020 Feb.

. 2020 Feb;43(2):115-126.

doi: 10.1016/j.tins.2019.12.006. Epub 2020 Jan 16.

Authors

Yisi S Zhang¹, Asif A Ghazanfar²

Affiliations

¹ Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA. Electronic address: yz9@princeton.edu.
² Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA; Department of Psychology, Princeton University, Princeton, NJ 08544, USA; Department of Ecology & Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA. Electronic address: asifg@princeton.edu.

PMID: 31955902
PMCID: PMC7213988
DOI: 10.1016/j.tins.2019.12.006

Abstract

Vocal production is hierarchical in the time domain. These hierarchies build upon biomechanical and neural dynamics across various timescales. We review studies in marmoset monkeys, songbirds, and other vertebrates. To organize these data in an accessible and across-species framework, we interpret the different timescales of vocal production as belonging to different levels of an autonomous systems hierarchy. The first level accounts for vocal acoustics produced on short timescales; subsequent levels account for longer timescales of vocal output. The hierarchy of autonomous systems that we put forth accounts for vocal patterning, sequence generation, dyadic interactions, and context dependence by sequentially incorporating central pattern generators, intrinsic drives, and sensory signals from the environment. We then show the framework's utility by providing an integrative explanation of infant vocal production learning in which social feedback modulates infant vocal acoustics through the tuning of a drive signal.

Keywords: Mayer wave; biomechanics; birdsong; dynamical system; marmoset monkey; parental feedback; slow oscillations; social reinforcement; timescales; vocalizations.

PubMed Disclaimer

Conflict of interest statement

Competing Interests

We have no competing interests.

Figures

**Figure 1.**
Temporal hierarchies of vocalizations, and the corresponding brain structure and vocal apparatus. (A) A segment of infant marmoset monkey vocalization comprised of utterances of different types of calls (contributed by Ghazanfar lab). (B) A segment of Bengalese finch song comprised of fixed and variable syllable sequences (contributed by Yisi Zhang). (C) A segment of midshipman fish grunts (courtesy of Dr. Andrew Bass). (D) Vocal communication system and vocal apparatus of nonhuman primates. Abbreviations: ACC, anterior cingulate cortex; AC, auditory cortex; Am, amygdala; Hy, hypothalamus; PAG, periaqueductal gray; LRF, lateral reticular formation; PB, parabrachial nucleus; MN, motor nuclei; VC, visual cortex (adapted from [125] and [126]). (E) Song system and vocal apparatus of songbirds. Abbreviations: HVC (as a proper name); RA, robust nucleus of the arcopallium; NIf, nucleus interfacialis of the nidopallium; LMAN, lateral magnocellular nucleus of the anterior nidopallium; Area X, striato-palliadal basal ganglia nucleus; DLM, dorsolateral thalamic nucleus; Uva, nucleus uvaeformis; nXIIts, tracheosyringeal part of the hypoglossal motor nucleus (adapted from [127]). (F) Vocal network and vocal apparatus of fish. Abbreviations: POA, preoptic area; AT, anterior tuberal nucleus; VT, ventral tuberal nucleus; PL, paralemniscal midbrain tegmentum; TS, torus semicircularis; VPP, vocal prepacemaker nucleus; VPN, vocal pacemaker nucleus; VMN vocal motor nucleus (adapted from [128]). Solid lines indicate vocal pathways and dotted lines indicate sensory pathways.

**Figure 2.**
A hierarchical structure of the vocal production system based on time-scales. (A) Two-level system: the vocal biomechanics (lungs and larynx) and two coupled central pattern generators (CPGs). (B) Three-level system: adding a drive signal on top of the CPGs enables the continuous production of various calls. (C) Four-level system: a fourth layer provides the modulation to the drive and allows animals to adjust the vocal output with respect to the environment.

**Figure 3.**
Vocal development through social feedback. Here we illustrate vocal development through marmoset monkey parent-infant vocal interaction. The parent responds at the transitions where the infant starts producing more mature-sounding vocalizations [117]. As more mature-sounding vocalizations indicate a greater underlying drive signal [53], parental responses occur in the rising phase of the oscillatory drive. The contingent parental calls have a cumulative effect on the infant vocal production towards more high-energy calls, accelerating vocal development on the timescale of days [116]. This process can be a consequence of shaping the drive signal of infant vocal production. The social feedback process described here compactly illustrates its cumulative influence on cumulative changes in the drive signal.

See this image and copyright information in PMC

Cited by

Evolutionary continuity and divergence of auditory dorsal and ventral pathways in primates revealed by ultra-high field diffusion MRI.
Zhang Y, Shen SX, Bibic A, Wang X. Zhang Y, et al. Proc Natl Acad Sci U S A. 2024 Feb 27;121(9):e2313831121. doi: 10.1073/pnas.2313831121. Epub 2024 Feb 20. Proc Natl Acad Sci U S A. 2024. PMID: 38377216 Free PMC article.
Echolocation-related reversal of information flow in a cortical vocalization network.
García-Rosales F, López-Jury L, González-Palomares E, Wetekam J, Cabral-Calderín Y, Kiai A, Kössl M, Hechavarría JC. García-Rosales F, et al. Nat Commun. 2022 Jun 25;13(1):3642. doi: 10.1038/s41467-022-31230-6. Nat Commun. 2022. PMID: 35752629 Free PMC article.
Generative vocal plasticity in chimpanzees.
Lameira AR, Caneco B, Kershenbaum A, Santamaría-Bonfil G, Call J. Lameira AR, et al. iScience. 2025 Apr 8;28(5):112381. doi: 10.1016/j.isci.2025.112381. eCollection 2025 May 16. iScience. 2025. PMID: 40322082 Free PMC article.
Arousal elevation drives the development of oscillatory vocal output.
Zhang YS, Alvarez JL, Ghazanfar AA. Zhang YS, et al. J Neurophysiol. 2022 Jun 1;127(6):1519-1531. doi: 10.1152/jn.00007.2022. Epub 2022 Apr 27. J Neurophysiol. 2022. PMID: 35475704 Free PMC article.
A novel reticular node in the brainstem synchronizes neonatal mouse crying with breathing.
Wei XP, Collie M, Dempsey B, Fortin G, Yackle K. Wei XP, et al. Neuron. 2022 Feb 16;110(4):644-657.e6. doi: 10.1016/j.neuron.2021.12.014. Epub 2022 Jan 7. Neuron. 2022. PMID: 34998469 Free PMC article.

See all "Cited by" articles

References

1. Bass AH (2014) Central pattern generator for vocalization: is there a vertebrate morphotype? Current opinion In neurobiology 28, 94–100 - PMC - PubMed
1. Hage SR and Nieder A (2016) Dual Neural Network Model for the Evolution of Speech and Language. Trends in Neurosciences 39, 813–829 - PubMed
1. Kiebel SJ, et al. (2008) A Hierarchy of Time-Scales and the Brain. Plos Comput Biol 4 - PMC - PubMed
1. Flack JC, et al. (2013) Timescales, symmetry, and uncertainty reduction in the origins of hierarchy in biological systems. Evolution cooperation and complexity, 45–74
1. Deng L, et al. (2006) Structured speech modeling. IEEE Transactions on Audio, Speech, and Language Processing 14, 1492–1504

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

R01 NS054898/NS/NINDS NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Hierarchy of Autonomous Systems for Vocal Production

Affiliations

A Hierarchy of Autonomous Systems for Vocal Production

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources