Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Aug;130(2):986-95.
doi: 10.1121/1.3605668.

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Affiliations

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Philipos C Loizou et al. J Acoust Soc Am. 2011 Aug.

Abstract

The conventional articulation index (AI) measure cannot be applied in situations where non-linear operations are involved and additive noise is present. This is because the definitions of the target and masker signals become vague following non-linear processing, as both the target and masker signals are affected. The aim of the present work is to modify the basic form of the AI measure to account for non-linear processing. This was done using a new definition of the output or effective SNR obtained following non-linear processing. The proposed output SNR definition for a specific band was designed to handle cases where the non-linear processing affects predominantly the target signal rather than the masker signal. The proposed measure also takes into consideration the fact that the input SNR in a specific band cannot be improved following any form of non-linear processing. Overall, the proposed measure quantifies the proportion of input band SNR preserved or transmitted in each band after non-linear processing. High correlation (r = 0.9) was obtained with the proposed measure when evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech corrupted in four different real-world maskers.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Signal-processing framework used in the present study for analyzing non-linear operations in the presence of noise. The dashed block shows the additional stage used in most noise-reduction applications to compute parameters such as band SNR, modulation rate, etc. These parameters are in turn used to construct a noise-suppressive gain function. The function f(.) represents generally the gain function used in noise-reduction or the non-linear function (e.g., compression function) used in hearing-aid applications.
Figure 2
Figure 2
(Color online) Histogram of band SNRs for corresponding bands in which S > S after noise- suppression. Band SNRs were determined for each time-frequency (T-F) unit, and accumulated over the duration of a sentence.
Figure 3
Figure 3
(Color online) Panel (a) shows the wideband spectrogram of the IEEE sentence “The young kid jumped the rusty gate.” in quiet, and panel (b) shows the sentence processed via a spectral- subtractive algorithm. The input sentence was originally corrupted by babble at 0 dB SNR. Panel (c) shows the corresponding short-term fAI values computed every 50 ms. The resulting average fAI value was 0.032.
Figure 4
Figure 4
Scatter plot of speech intelligibility scores and predicted fAI values for 72 noisy conditions involving noise-suppressed speech in four different masker conditions (babble, car, train and street interferences) and two SNR levels.
Figure 5
Figure 5
Scatter plot of observed intelligibility scores (expressed in percentage) and predicted scores for the 72 noisy conditions tested.

Similar articles

Cited by

References

    1. Amlani, A., Punch, J., and Ching, T. (2002). “Methods and applications of the audibility index in hearing aid selections and fitting,” Trends Ampl. 6, 81–129. 10.1177/108471380200600302 - DOI - PMC - PubMed
    1. ANSI S3.5-1997 (1997). “Methods for calculation of the speech intelligibility index,” (American National Standards Institute, NY).
    1. Bentler, R., Wu, Y., Kettel, J., and Hurtig, R. (2008). “Digital noise reduction: Outcomes from laboratory and field studies,” Intern. J. Audiology 47, 447–460. 10.1080/14992020802033091 - DOI - PubMed
    1. Boll, S. F. (1979). “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust. Speech Signal Proc., 27(2), 113–120. 10.1109/TASSP.1979.1163209 - DOI
    1. Chung, K. (2004). “Challenges and recent developments in hearing aids: Part, I. Speech understanding in noise, microphone technologies and noise reduction algorithms,” Trends. Amplif. 8, 83–124. 10.1177/108471380400800302 - DOI - PMC - PubMed

Publication types