Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar/Apr;43(2):554-562.
doi: 10.1097/AUD.0000000000001114.

The Recognition of Whispered Speech in Real-Time

Affiliations

The Recognition of Whispered Speech in Real-Time

Kristi Hendrickson et al. Ear Hear. 2022 Mar/Apr.

Abstract

Objectives: Whispered speech offers a unique set of challenges to speech perception and word recognition. The goals of the present study were twofold: First, to determine how listeners recognize whispered speech. Second, to inform major theories of spoken word recognition by considering how recognition changes when major cues to phoneme identity are reduced or largely absent compared with normal voiced speech.

Design: Using eye tracking in the Visual World Paradigm, we examined how listeners recognize whispered speech. After hearing a target word (normal or whispered), participants selected the corresponding image from a display of four-a target (e.g., money), a word that shares sounds with the target at the beginning (cohort competitor, e.g., mother), a word that shares sounds with the target at the end (rhyme competitor, e.g., honey), and a phonologically unrelated word (e.g., whistle). Eye movements to each object were monitored to measure (1) how fast listeners process whispered speech, and (2) how strongly they consider lexical competitors (cohorts and rhymes) as the speech signal unfolds.

Results: Listeners were slower to recognize whispered words. Compared with normal speech, listeners displayed slower reaction times to click the target image, were slower to fixate the target, and fixated the target less overall. Further, we found clear evidence that the dynamics of lexical competition are altered during whispered speech recognition. Relative to normal speech, words that overlapped with the target at the beginning (cohorts) displayed slower, reduced, and delayed activation, whereas words that overlapped with the target at the end (rhymes) exhibited faster, more robust, and longer lasting activation.

Conclusion: When listeners are confronted with whispered speech, they engage in a "wait-and-see" approach. Listeners delay lexical access, and by the time they begin to consider what word they are hearing, the beginning of the word has largely come and gone, and activation for cohorts is reduced. However, delays in lexical access actually increase consideration of rhyme competitors; the delay pushes lexical activation to a point later in processing, and the recognition system puts more weight on the word-final overlap between the target and the rhyme.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflicts of interest to disclose.

References

    1. Abramson A. S. Tonal experiments with whispered Thai. (1972). Papers in Linguistics and Phonetics to the Memory of Pierre Delattre. (pp. 31–44).
    1. Allen J., Haggard M. Perception of voicing and place features in whispered speech: A dichotic choice analysis. Percep Psychophys, (1977). 21, 315–322.
    1. Allopenna P., Magnuson J., Tanenhaus M. Tracking the time course of spoken word recognition using eye movements: Evidence for continuous mapping models. J Mem Lang, (1998). 38, 419–439.
    1. Apfelbaum K. S., Blumstein S. E., McMurray B. Semantic priming is affected by real-time phonological competition: Evidence for continuous cascading systems. Psychon Bull Rev, (2011). 18, 141–149.
    1. Ben-David B. M., Chambers C. G., Daneman M., Pichora-Fuller M. K., Reingold E. M., Schneider B. A. Effects of aging and noise on real-time spoken word recognition: Evidence from eye movements. J Speech Lang Hear Res, (2011). 54, 243–262.

LinkOut - more resources