Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms
- PMID: 21877811
- PMCID: PMC3190662
- DOI: 10.1121/1.3605668
Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms
Abstract
The conventional articulation index (AI) measure cannot be applied in situations where non-linear operations are involved and additive noise is present. This is because the definitions of the target and masker signals become vague following non-linear processing, as both the target and masker signals are affected. The aim of the present work is to modify the basic form of the AI measure to account for non-linear processing. This was done using a new definition of the output or effective SNR obtained following non-linear processing. The proposed output SNR definition for a specific band was designed to handle cases where the non-linear processing affects predominantly the target signal rather than the masker signal. The proposed measure also takes into consideration the fact that the input SNR in a specific band cannot be improved following any form of non-linear processing. Overall, the proposed measure quantifies the proportion of input band SNR preserved or transmitted in each band after non-linear processing. High correlation (r = 0.9) was obtained with the proposed measure when evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech corrupted in four different real-world maskers.
Figures





Similar articles
-
Modifying the normalized covariance metric measure to account for nonlinear distortions introduced by noise-reduction algorithms.J Acoust Soc Am. 2013 May;133(5):EL405-11. doi: 10.1121/1.4800189. J Acoust Soc Am. 2013. PMID: 23656101
-
Analysis of a simplified normalized covariance measure based on binary weighting functions for predicting the intelligibility of noise-suppressed speech.J Acoust Soc Am. 2010 Dec;128(6):3715-23. doi: 10.1121/1.3502473. J Acoust Soc Am. 2010. PMID: 21218903 Free PMC article.
-
Gain-induced speech distortions and the absence of intelligibility benefit with existing noise-reduction algorithms.J Acoust Soc Am. 2011 Sep;130(3):1581-96. doi: 10.1121/1.3619790. J Acoust Soc Am. 2011. PMID: 21895096 Free PMC article.
-
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502. J Acoust Soc Am. 2011. PMID: 21895088
-
Spectro-temporal modulation glimpsing for speech intelligibility prediction.Hear Res. 2022 Dec;426:108620. doi: 10.1016/j.heares.2022.108620. Epub 2022 Sep 21. Hear Res. 2022. PMID: 36175300 Free PMC article. Review.
Cited by
-
En route to sound coding strategies for optical cochlear implants.iScience. 2023 Aug 25;26(10):107725. doi: 10.1016/j.isci.2023.107725. eCollection 2023 Oct 20. iScience. 2023. PMID: 37720089 Free PMC article. Review.
References
-
- ANSI S3.5-1997 (1997). “Methods for calculation of the speech intelligibility index,” (American National Standards Institute, NY).
-
- Boll, S. F. (1979). “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust. Speech Signal Proc., 27(2), 113–120. 10.1109/TASSP.1979.1163209 - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical