Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun;149(6):4013.
doi: 10.1121/10.0005191.

Face masks and speaking style affect audio-visual word recognition and memory of native and non-native speech

Affiliations

Face masks and speaking style affect audio-visual word recognition and memory of native and non-native speech

Rajka Smiljanic et al. J Acoust Soc Am. 2021 Jun.

Abstract

Though necessary, protective mask wearing in response to the COVID-19 pandemic presents communication challenges. The present study examines how signal degradation and loss of visual information due to masks affects intelligibility and memory for native and non-native speech. We also test whether clear speech can alleviate perceptual difficulty for masked speech. One native and one non-native speaker of English recorded video clips in conversational speech without a mask and conversational and clear speech with a mask. Native English listeners watched video clips presented in quiet or mixed with competing speech. The results showed that word recognition and recall of speech produced with a mask can be as accurate as without a mask in optimal listening conditions. Masks affected non-native speech processing at easier noise levels than native speech. Clear speech with a mask significantly improved accuracy in all listening conditions. Speaking clearly, reducing noise, and using surgical masks as well as good signal amplification can help compensate for the loss of intelligibility due to background noise, lack of visual cues, physical distancing, or non-native speech. The findings have implications for communication in classrooms and hospitals where listeners interact with teachers and healthcare providers, oftentimes non-native speakers, through their protective barriers.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
(Color online) Experiment overview. Each participant heard one of the talkers (native talker or non-native) and one noise condition (quiet, +5 dB SNR, and 0 dB SNR; an additional −5 dB SNR was used for the native talker only) (A). Each listener saw video clips from all three speaking style and mask conditions: conversational speech no mask; conversational speech mask, clear speech mask. Speaking style and mask were counterbalanced across listeners (B). Immediately after each audio-video clip, listeners were instructed to type what they heard to test intelligibility. At the end of each block, they were asked to answer content-related questions to test recall.
FIG. 2.
FIG. 2.
Intelligibility (top panels) and recall (bottom panels) for the native talker in quiet (n = 32), in 6T +5 dB SNR (n = 40), in 6T 0 dB SNR (n = 38), and in 6T −5 dB SNR (n = 32). ***, p < 0.001; **, p < 0.01; *, p < 0.05; ns, no statistical difference (mixed-effect logistic regression simple effects analysis).
FIG. 3.
FIG. 3.
Intelligibility (top panels) and recall (bottom panels) for the non-native talker in quiet (n = 45), in 6T +5 dB SNR (n = 32), and in 6T 0 dB SNR (n = 32). ***, p < 0.001; **, p < 0.01; *, p < 0.05; ns, no statistical difference (mixed-effect logistic regression simple effects analysis).

Similar articles

Cited by

References

    1. Anderson‐Hsieh, J. , Johnson, R. , and Koehler, K. (1992). “ The relationship between native speaker judgments of nonnative pronunciation and deviance in segmentais, prosody, and syllable structure,” Lang. Learn. 42, 529–555.10.1111/j.1467-1770.1992.tb01043.x - DOI
    1. Anwyl-Irvine, A. L. , Massonnié, J. , Flitton, A. , Kirkham, N. , and Evershed, J. K. (2020). “ Gorilla in our midst: An online behavioral experiment builder,” Behav. Res. Methods 52(1), 388–407.10.3758/s13428-019-01237-x - DOI - PMC - PubMed
    1. Baese-Berk, M. M. , and Bradlow, A. R. (2021). “ Variability in speaking rate of native and nonnative speech,” in Second Language Speech Learning: Theoretical and Empirical Progress, edited by Wayland R. ( Cambridge University, Cambridge, UK: ), pp. 312–334.
    1. Bates, D. , Mächler, M. , Bolker, B. M. , and Walker, S. C. (2015). “ Fitting linear mixed-effects models using lme4,” J. Stat. Softw. 67(1), 1–48.10.18637/jss.v067.i01 - DOI
    1. Best, C. T. , and Tyler, M. D. (2007). “ Nonnative and second-language speech perception: Commonalities and complementarities,” in Language Experience in Second Language Speech Learning: In Honor of James Emil Flege ( John Benjamins, Amsterdam: ), pp. 1–47.