Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jul;134(1):510-9.
doi: 10.1121/1.4807639.

An investigation of articulatory setting using real-time magnetic resonance imaging

Affiliations

An investigation of articulatory setting using real-time magnetic resonance imaging

Vikram Ramanarayanan et al. J Acoust Soc Am. 2013 Jul.

Abstract

This paper presents an automatic procedure to analyze articulatory setting in speech production using real-time magnetic resonance imaging of the moving human vocal tract. The procedure extracts frames corresponding to inter-speech pauses, speech-ready intervals and absolute rest intervals from magnetic resonance imaging sequences of read and spontaneous speech elicited from five healthy speakers of American English and uses automatically extracted image features to quantify vocal tract posture during these intervals. Statistical analyses show significant differences between vocal tract postures adopted during inter-speech pauses and those at absolute rest before speech; the latter also exhibits a greater variability in the adopted postures. In addition, the articulatory settings adopted during inter-speech pauses in read and spontaneous speech are distinct. The results suggest that adopted vocal tract postures differ on average during rest positions, ready positions and inter-speech pauses, and might, in that order, involve an increasing degree of active control by the cognitive speech planning mechanism.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(Color online) (a) Contour outlines extracted for each image of the vocal tract. Note the template definition such that each articulator is described by a separate contour. (b) A schematic depicting the concept of vocal tract area descriptors or VTADs [adapted from Bresch and Narayanan (2009)]. These VTADs are bounded by cross-distances (depicted by white lines), and are, in order, from lips to glottis: lip aperture, tongue tip constriction degree, tongue dorsum constriction degree, velic aperture, tongue root constriction degree, and the epiglottal-pharyngeal wall cross-distance, respectively.
Figure 2
Figure 2
(Color online) Schematic showing how the jaw angle (denoted by α) is computed.
Figure 3
Figure 3
(Color online) Mean vocal tract images for all speakers calculated on all frames corresponding to different positions in the utterance and speaking style.

References

    1. Bockman, S. (1989). “ Generalizing the formula for areas of polygons to moments,” Am. Math. Monthly 96, 131–132.10.2307/2323197 - DOI
    1. Bresch, E., and Narayanan, S. (2009). “ Region segmentation in the frequency domain applied to upper airway real-time magnetic resonance images,” IEEE Trans. Med. Imaging 28, 323–338.10.1109/TMI.2008.928920 - DOI - PMC - PubMed
    1. Bresch, E., Nielsen, J., Nayak, K., and Narayanan, S. (2006). “ Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans,” J. Acoust. Soc. Am. 120, 1791–1794.10.1121/1.2335423 - DOI - PMC - PubMed
    1. Byrd, D., and Saltzman, E. (2003). “ The elastic phrase: modeling the dynamics of boundary-adjacent lengthening,” J. Phonetics 31, 149–180.10.1016/S0095-4470(02)00085-2 - DOI
    1. Esling, J., and Wong, R. (1983). “ Voice quality settings and the teaching of pronunciation,” TESOL Q. 17, 89–95.10.2307/3586426 - DOI

Publication types