Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Nov;126(5):EL160-5.
doi: 10.1121/1.3213452.

Analysis of pausing behavior in spontaneous speech using real-time magnetic resonance imaging of articulation

Analysis of pausing behavior in spontaneous speech using real-time magnetic resonance imaging of articulation

Vikram Ramanarayanan et al. J Acoust Soc Am. 2009 Nov.

Abstract

It is hypothesized that pauses at major syntactic boundaries (i.e., grammatical pauses), but not ungrammatical (e.g., word search) pauses, are planned by a high-level cognitive mechanism that also controls the rate of articulation around these junctures. Real-time magnetic resonance imaging is used to analyze articulation at and around grammatical and ungrammatical pauses in spontaneous speech. Measures quantifying the speed of articulators were developed and applied during these pauses as well as during their immediate neighborhoods. Grammatical pauses were found to have an appreciable drop in speed at the pause itself as compared to ungrammatical pauses, which is consistent with our hypothesis that grammatical pauses are indeed choreographed by a central cognitive planner.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An illustration of the gradient energy calculation process: first, contour outlines are obtained from the MRI images in panel A and are then converted to binary masks (panel B); these are then used to compute the “gradient” images (panel C), the energy of which is then calculated by a simple addition operation (of all white pixels).
Figure 2
Figure 2
Pause length distributions for grammatical and ungrammatical pauses.
Figure 3
Figure 3
Time-normalized average gradient frame energies (in squared pixels) of grammatical and ungrammatical pauses and their neighborhoods pooled across all seven speakers (standard deviation bars are plotted on top of each energy bar in a lighter color). Corresponding average local phone rates (phones/s) are also shown to the right of the gradient frame energy panel. Each panel consists of two pause groups on the x-axis: (1) grammatical and (2) ungrammatical. Group 1 consists of, in order, bars for two neighborhoods immediately before the grammatical pause (∼250 ms), followed by one bar for the pause itself (not shown for phone rate graph), followed by two bars for the neighborhoods following the pause (∼250 ms); this set of five bars is followed by a parallel sequence of five bars for the ungrammatical pauses (Group 2).
Figure 4
Figure 4
A schematic depicting the levels (grammatical and ungrammatical) and sites (prepause, pause, and postpause) at which the ANOVA statistical analyses were performed.

References

    1. Bresch, E. and Narayanan, S. (2009). “Region segmentation in the frequency domain applied to upper airway real-time magnetic resonance images,” IEEE Trans. Med. Imaging ITMID4 28, 323–338.10.1109/TMI.2008.928920 - DOI - PMC - PubMed
    1. Bresch, E., Nielsen, J., Nayak, K., and Narayanan, S. (2006). “Synchronized and noise-robust audio recordings during realtime MRI scans,” J. Acoust. Soc. Am. JASMAN 120, 1791–1794.10.1121/1.2335423 - DOI - PMC - PubMed
    1. Browman, C. P. and Goldstein, L. (1992). “Articulatory phonology: An overview,” Phonetica PHNTAW 49, 155–180. - PubMed
    1. Browman, C. P. and Goldstein, L. (1995). “Dynamics and articulatory phonology,” in Mind as Motion: Dynamics, Behavior, and Cognition, edited by Port R. and van Gelder T. (MIT, Cambridge, MA: ), pp. 175–193.
    1. Byrd, D. and Saltzman, E. (2003). “The elastic phrase: Modeling the dynamics of boundary-adjacent lengthening,” J. Phonetics JPHNB9 31, 149–180.10.1016/S0095-4470(02)00085-2 - DOI

Publication types