Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2025 Jan-Dec:29:23312165251324266.
doi: 10.1177/23312165251324266. Epub 2025 Mar 25.

Understanding the Lombard Effect for Mandarin: Relation Between Speech Recognition Thresholds and Acoustic Parameters

Affiliations
Comparative Study

Understanding the Lombard Effect for Mandarin: Relation Between Speech Recognition Thresholds and Acoustic Parameters

Fei Chen et al. Trends Hear. 2025 Jan-Dec.

Abstract

The present work quantifies the Lombard effect across native speakers of Mandarin Chinese using the Matrix sentence test, which is optimized for precisely assessing speech recognition thresholds (SRTs) in noise. Specifically, we studied the effects of speaker gender, fundamental frequency (F0), formant frequencies (F1 and F2), the duration and rate of voiced segments, and frequency-specific energy redistribution characterized by alpha ratio and speech-weighted signal-to-noise ratio (swSNR) on the recognition of Mandarin in plain and Lombard speech. The Mandarin Chinese matrix test was recorded with plain and Lombard speech from 11 native-Mandarin speakers. SRTs in stationary noise were measured with native-Mandarin, normal-hearing listeners. Results showed that on average, Mandarin Lombard speech was more intelligible than Mandarin plain speech for both female and male speakers, and the Mandarin Lombard gain of female speakers was larger than that of males. In addition, various acoustic analyses involving all speakers showed that (a) only swSNR was significantly correlated with the SRT of the Mandarin plain speech; (b) most acoustic measures were significantly correlated with the SRT of the Mandarin Lombard speech; and (c) alpha ratio and swSNR were significantly correlated with the SRT Lombard gain. In addition, a gender effect was found in the correlational analysis between acoustic parameters and SRT as well as Lombard gain in SRT. The findings highlight the impact of increased high-frequency energy on the observed Lombard gain in Mandarin speech, whereas the changes in individual acoustic parameters (e.g., F0 and F1) appear to play only a minor role.

Keywords: Lombard effect; SRT Lombard gain; alpha ratio; speech recognition threshold; speech-weighted signal-to-noise ratio (swSNR).

PubMed Disclaimer

Conflict of interest statement

Conflicting InterestsThe authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure 1.
Figure 1.
(a) Speech recognition thresholds (SRTs) averaged across speakers and listeners under all listening conditions. Asterisks denote a significant (P < 0.05) difference between the SRTs of the paired conditions. (b) Empirical Lombard gains for female and male speakers across all listeners. The error bars denote ±1 standard error of the mean.
Figure 2.
Figure 2.
SRTs with corresponding standard errors of the Mandarin plain and Lombard speech of all speakers. Asterisks denote that the SRTs difference between plain and Lombard speech are significant (P < 0.0045) or non-significant (P > 0.0045), respectively. ‘SF’ and ‘SM’ denote female and male speakers, respectively.
Figure 3.
Figure 3.
Mean values of acoustic parameters for Mandarin plain and Lombard speech of different speaker groups. Panels (a), (b), (c), (d), (e), (f) and (g) are for alpha ratio, swSNR, F0, F1, F2, duration of voiced segments, and rate of voiced segments, respectively. Asterisks denote a significant (P < 0.05) difference between the SRTs of the paired conditions. The error bars denote ±1 standard error of the mean. ‘all’, ‘male’ and ‘female’ denote the comparisons involving all speakers, male speakers, and female speakers, respectively.
Figure 4.
Figure 4.
Scatter plots between SRT and swSNR for the plain speech of (a) all speakers, (b) male speakers, and (c) female speakers, and for the Lombard speech of (d) all speakers, (e) male speakers, and (f) female speakers. The lowest row depicts scatter plots between empirical Lombard gain in SRT and Lombard gain in swSNR for (g) all speakers, (h) male speakers, and (i) female speakers.

Similar articles

References

    1. Akeroyd M. A., Arlinger S., Bentler R. A., Boothroyd A., Dillier N., Dreschler W. A., Gagné J., Lutman M., Wouters J., Wong L., Kollmeier B. (2015). International collegium of rehabilitative audiology (ICRA) recommendations for the construction of multilingual speech tests. International Journal of Audiology, 54(Suppl. 2), 17–22. 10.3109/14992027.2015.1030513 - DOI - PubMed
    1. Alghamdi N., Maddock S., Marxer R., Barker J., Brown G. J. (2018). A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, 143(6), EL523–EL529. 10.1121/1.5042758 - DOI - PubMed
    1. ANSI. (1997). American National Standard Methods for the calculation of the speech intelligibility index. American National Standards Institute.
    1. Beutelmann R., Brand T., Kollmeier B. (2010). Revision, extension, and evaluation of a binaural speech intelligibility model. Journal of the Acoustical Society of America, 127(4), 2479–2497. 10.1121/1.3295575 - DOI - PubMed
    1. Bosker H. R., Cooke M. (2020). Enhanced amplitude modulations contribute to the Lombard intelligibility benefit: Evidence from the Nijmegen corpus of Lombard speech. Journal of the Acoustical Society of America, 147(2), 721–730. 10.1121/10.0000646 - DOI - PubMed

Publication types

LinkOut - more resources