Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;6(12):e28317.
doi: 10.1371/journal.pone.0028317. Epub 2011 Dec 14.

The anatomy of onomatopoeia

Affiliations

The anatomy of onomatopoeia

María Florencia Assaneo et al. PLoS One. 2011.

Abstract

Virtually every human faculty engage with imitation. One of the most natural and unexplored objects for the study of the mimetic elements in language is the onomatopoeia, as it implies an imitative-driven transformation of a sound of nature into a word. Notably, simple sounds are transformed into complex strings of vowels and consonants, making difficult to identify what is acoustically preserved in this operation. In this work we propose a definition for vocal imitation by which sounds are transformed into the speech elements that minimize their spectral difference within the constraints of the vocal system. In order to test this definition, we use a computational model that allows recovering anatomical features of the vocal system from experimental sound data. We explore the vocal configurations that best reproduce non-speech sounds, like striking blows on a door or the sharp sounds generated by pressing on light switches or computer mouse buttons. From the anatomical point of view, the configurations obtained are readily associated with co-articulated consonants, and we show perceptual evidence that these consonants are positively associated with the original sounds. Moreover, the pairs vowel-consonant that compose these co-articulations correspond to the most stable syllables found in the knock and click onomatopoeias across languages, suggesting a mechanism by which vocal imitation naturally embeds single sounds into more complex speech structures. Other mimetic forces received extensive attention by the scientific community, such as cross-modal associations between speech and visual categories. The present approach helps building a global view of the mimetic forces acting on language and opens a new venue for a quantitative study of word formation in terms of vocal imitation.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Sketch of the vocal model.
The figure in the middle represents the concatenation of tubes that approximate the vocal tract. The upper panel represents, from left to right, the voiced source spectrum of fundamental frequency formula image, the vocal tract transfer function for a tube of about 17.5 cm and the multiplication of both, corresponding to the resulting voiced sound. In the lower panel, a colored noise sound source characterizing the turbulent flow at the exit of the constriction at the section formula image of the vocal tract and the resulting fricative sound, filtered by the vocal tract.
Figure 2
Figure 2. Anatomy of vowels.
Each point in the graph corresponds to a vowel sample (formula image ms) taken from normal speech recordings of 20 Spanish speakers of different age and sex. We performed a Fast Fourier Transform to the time series to get the vowel spectrum and plot the first two formants formula image and formula image. The points naturally cluster into five groups, associated with the Spanish vowels formula image. The figures defined by the black lines are vocal tract shapes taken from a corpus of MRI-based anatomical data reported in . In each case, we selected from the corpus the vowels that were closer, from a phonetic point of view, to the Spanish vowels: formula image. MRI-based data consists of 44 area functions taken from equally spaced slices of vocal tract shapes formula image, formula image. The shapes drawn here correspond to the solid of revolution of radius formula image. On the other hand, the gray shapes are the reconstructed vocal tracts from our model (see Methods).
Figure 3
Figure 3. Co-articulated fricatives.
From top to bottom, reconstructed vocal tract configurations of co-articulated fricatives formula image, formula image, formula image, formula image and formula image (gray shapes) and their associated MRI vowel data (black contours). The obtained shapes are a combination of the preceding vowel and a constriction at the velar level (located around half the vocal tract length), indicated by the watermark. These vocal tract configurations along with the source parameters formula image are: formula image, formula image, formula image, formula image, formula image generate sounds having the spectra in black, to be compared with the experimental spectra, in gray.
Figure 4
Figure 4. Anatomy of onomatopoeias.
We compare sound time series, spectra and anatomy of the click (panel a) and knock (panel b) onomatopoeias and their corresponding sounds. As evident from the time series for the knock and click words (upper insets), the occlusive consonants formula image are naturally isolated from the rest of the speech sounds during the pronunciation of the onomatopoeias in normal speech. However, co-articulation strongly affects their spectral content (medium insets): the occlusive consonants formula image and formula image consist of superimposing a velar constriction on a vocal tract that globally resembles the vowels formula image in click and formula image in knock (lower insets). The figures to the right within the frame represent the best vocal tracts imitating the click and knock sounds as retrieved by our model, without anatomical restrictions. To the right, outside the frame, we show the area functions for the occlusive consonants formula image (black) and formula image (gray) for the click (dotted) and formula image (black) and formula image (light gray) for the knock (gray). In the bottom panel we show the first two components formula image of the PCA for the co-articulated consonants and best imitations: formula image; formula image; formula image; formula image; formula image; knock = formula image and click = formula image. The distances between the knock vocal tract and the coarticulated consonants are: formula image = 0.90; formula image = 0.82; formula image = 1.00; formula image = 0.26; formula image = 0.38. The distances between the click vocal tract and the coarticulated consonants are: formula image = 0.95; formula image = 0.26; formula image = 0.21; formula image = 1.07; formula image = 0.50.
Figure 5
Figure 5. Associations between co-articulated consonants, knocks and clicks.
We evaluate the similitude of formula image sounds with respect to the knock (solid line) and click (dotted line) sounds. Participants graded the audio files using a scale from 1 (poor or no association) to 10 (perfect identification). The left panel summarizes the responses of 20 participants to 7 synthetic sounds: the 5 co-articulated formula image, using the parameters of formula image (figure 3 and table 2) modulated by an experimental formula image envelope (see Methods). The other 2 sounds were generated using the best vocal tracts for the knock and click sounds, modulated by the same formula image envelope (points in light gray). The stronger associations with the click and knock sounds are formula image and formula image respectively. The best vocal tracts performed better than the consonants. In the right panel, we show the results of the experiment for 20 subjects using experimental isolated fricatives formula image. The trend is the same as before, but grades are systematically higher.

References

    1. de Saussure F. Course in general linguistics. New York City: Columbia University Press; 2011. 336
    1. Hashimoto T, Usui N, Taira M, Nose I, Haji T, et al. The neural mechanism associated with the processing of onomatopoeic sounds. Neuro Image. 2006;31:1762–70. - PubMed
    1. Rizzolatti G, Arbib MA. Language within our grasp. Trends in Neuroscience. 1998;21:188–194. - PubMed
    1. Stevens KN. Acoustic Phonetics. Massachussets: MIT Press, new ed edi edition; 2000. 617
    1. Sitt J, Amador a, Goller F, Mindlin G. Dynamical origin of spectrally rich vocalizations in birdsong. Physical Review E. 2008;78:1–6. - PubMed

Publication types