Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Philipos C Loizou¹, Jianfen Ma

Affiliations

PMID: 21877811
PMCID: PMC3190662
DOI: 10.1121/1.3605668

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Philipos C Loizou et al. J Acoust Soc Am. 2011 Aug.

. 2011 Aug;130(2):986-95.

doi: 10.1121/1.3605668.

Authors

Philipos C Loizou¹, Jianfen Ma

Affiliation

¹ Department of Electrical Engineering, University of Texas at Dallas Richardson, TX 75083-0688, USA. loizou@utdallas.edu

PMID: 21877811
PMCID: PMC3190662
DOI: 10.1121/1.3605668

Abstract

The conventional articulation index (AI) measure cannot be applied in situations where non-linear operations are involved and additive noise is present. This is because the definitions of the target and masker signals become vague following non-linear processing, as both the target and masker signals are affected. The aim of the present work is to modify the basic form of the AI measure to account for non-linear processing. This was done using a new definition of the output or effective SNR obtained following non-linear processing. The proposed output SNR definition for a specific band was designed to handle cases where the non-linear processing affects predominantly the target signal rather than the masker signal. The proposed measure also takes into consideration the fact that the input SNR in a specific band cannot be improved following any form of non-linear processing. Overall, the proposed measure quantifies the proportion of input band SNR preserved or transmitted in each band after non-linear processing. High correlation (r = 0.9) was obtained with the proposed measure when evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech corrupted in four different real-world maskers.

PubMed Disclaimer

Figures

**Figure 1**
Signal-processing framework used in the present study for analyzing non-linear operations in the presence of noise. The dashed block shows the additional stage used in most noise-reduction applications to compute parameters such as band SNR, modulation rate, etc. These parameters are in turn used to construct a noise-suppressive gain function. The function f(.) represents generally the gain function used in noise-reduction or the non-linear function (e.g., compression function) used in hearing-aid applications.

**Figure 2**
(Color online) Histogram of band SNRs for corresponding bands in which $\overset{\land}{S}$ > S after noise- suppression. Band SNRs were determined for each time-frequency (T-F) unit, and accumulated over the duration of a sentence.

**Figure 3**
(Color online) Panel (a) shows the wideband spectrogram of the IEEE sentence “The young kid jumped the rusty gate.” in quiet, and panel (b) shows the sentence processed via a spectral- subtractive algorithm. The input sentence was originally corrupted by babble at 0 dB SNR. Panel (c) shows the corresponding short-term fAI values computed every 50 ms. The resulting average fAI value was 0.032.

**Figure 4**
Scatter plot of speech intelligibility scores and predicted fAI values for 72 noisy conditions involving noise-suppressed speech in four different masker conditions (babble, car, train and street interferences) and two SNR levels.

**Figure 5**
Scatter plot of observed intelligibility scores (expressed in percentage) and predicted scores for the 72 noisy conditions tested.

See this image and copyright information in PMC

Cited by

En route to sound coding strategies for optical cochlear implants.
Khurana L, Harczos T, Moser T, Jablonski L. Khurana L, et al. iScience. 2023 Aug 25;26(10):107725. doi: 10.1016/j.isci.2023.107725. eCollection 2023 Oct 20. iScience. 2023. PMID: 37720089 Free PMC article. Review.

References

1. Amlani, A., Punch, J., and Ching, T. (2002). “Methods and applications of the audibility index in hearing aid selections and fitting,” Trends Ampl. 6, 81–129. 10.1177/108471380200600302 - DOI - PMC - PubMed
1. ANSI S3.5-1997 (1997). “Methods for calculation of the speech intelligibility index,” (American National Standards Institute, NY).
1. Bentler, R., Wu, Y., Kettel, J., and Hurtig, R. (2008). “Digital noise reduction: Outcomes from laboratory and field studies,” Intern. J. Audiology 47, 447–460. 10.1080/14992020802033091 - DOI - PubMed
1. Boll, S. F. (1979). “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust. Speech Signal Proc., 27(2), 113–120. 10.1109/TASSP.1979.1163209 - DOI
1. Chung, K. (2004). “Challenges and recent developments in hearing aids: Part, I. Speech understanding in noise, microphone technologies and noise reduction algorithms,” Trends. Amplif. 8, 83–124. 10.1177/108471380400800302 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 DC010494/DC/NIDCD NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Affiliation

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical