Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 1;156(6):3862-3876.
doi: 10.1121/10.0034545.

Impact of reduced spectral resolution on temporal-coherence-based source segregation

Affiliations

Impact of reduced spectral resolution on temporal-coherence-based source segregation

Vibha Viswanathan et al. J Acoust Soc Am. .

Abstract

Hearing-impaired listeners struggle to understand speech in noise, even when using cochlear implants (CIs) or hearing aids. Successful listening in noisy environments depends on the brain's ability to organize a mixture of sound sources into distinct perceptual streams (i.e., source segregation). In normal-hearing listeners, temporal coherence of sound fluctuations across frequency channels supports this process by promoting grouping of elements belonging to a single acoustic source. We hypothesized that reduced spectral resolution-a hallmark of both electric/CI (from current spread) and acoustic (from broadened tuning) hearing with sensorineural hearing loss-degrades segregation based on temporal coherence. This is because reduced frequency resolution decreases the likelihood that a single sound source dominates the activity driving any specific channel; concomitantly, it increases the correlation in activity across channels. Consistent with our hypothesis, our physiologically inspired computational model of temporal-coherence-based segregation predicts that CI current spread reduces comodulation masking release (CMR; a correlate of temporal-coherence processing) and speech intelligibility in noise. These predictions are consistent with our online behavioral data with simulated CI listening. Our model also predicts smaller CMR with increasing levels of outer-hair-cell damage. These results suggest that reduced spectral resolution relative to normal hearing impairs temporal-coherence-based segregation and speech-in-noise outcomes.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflicts to disclose.

Figures

FIG. 1.
FIG. 1.
(Color online) Comodulation making release (CMR) stimuli used for computational modeling and behavioral measurements. The stimuli consisted of a target signal (shown in green) in a 100% sinusoidally amplitude-modulated (SAM) tonal complex masker (shown in orange). The masker was composed of three 10 Hz SAM tones at carrier frequencies of 3022 Hz [on-frequency component (OFC)], 2142 Hz (first flanking component), and 4264 Hz (second flanking component). All three SAM tones were presented at the same sound level. In the Comodulated condition, the flanking components were modulated in phase with the OFC, while in the Codeviant condition, they were modulated 180° out of phase with the OFC. The target signal was a 3022 Hz pure tone presented at different signal-to-noise ratios (SNRs). The level of the OFC was fixed while that of the target signal was varied according to the SNR. The total duration of each stimulus was 0.5 s.
FIG. 2.
FIG. 2.
Across-channel temporal-coherence–based source-segregation model (Viswanathan et al., 2022) to predict CMR (A) and consonant confusions in noise (B) in different vocoding conditions (Intact, Vocoded, Current Spread) and as a function of outer-hair-cell (OHC) damage. (C) The CMR circuit model shown in (A) and (B). Detailed descriptions and model parameters are provided in the main text.
FIG. 3.
FIG. 3.
(Color online) (A) Model AN-fiber threshold tuning curves, (B), rate-level curves, and (C) thresholds (in dB hearing level, i.e., relative to COHC = 1) plotted against ERB [ERB was derived from (A) tuning data, and expressed as a ratio relative to the ERB of a normal-hearing ear (COHC = 1)]. Data are shown for varying degrees of OHC damage [simulated by varying the model parameter COHC from 1 (normal) to 0 (complete OHC dysfunction)]. All data were obtained for a 3225 Hz CF (i.e., the CF at which we derived CMR predictions).
FIG. 4.
FIG. 4.
(Color online) CMR as a function of simulated CI vocoding and current spread. (A) Estimated d′ values (mean and standard error across stimulus repetitions) from the temporal-coherence–based source-segregation model [Fig. 2(A)] for different SNRs and CMR conditions (Comodulated, Codeviant). (B) CMR predictions from the temporal-coherence model (mean and standard error across stimulus repetitions), calculated from (A) data as the mean difference in SNR threshold between Codeviant and Comodulated conditions across the d′ values predicted by the model. (C) Behaviorally measured proportion trials correct (mean and standard error across N = 20 samples) for different SNRs and CMR conditions. (D) Behaviorally measured CMR (mean and standard error across N = 20 samples), which was calculated for each sample as the SNR threshold difference between Codeviant and Comodulated conditions at a percent-correct score of 66%.
FIG. 5.
FIG. 5.
Speech intelligibility in speech-shaped noise as a function of simulated CI vocoding and current spread. The first (leftmost) plot shows predictions from a model of purely within-channel masking (mean and standard error across stimulus repetitions). The second (middle) plot shows predictions from the across-channel temporal-coherence–based source-segregation model [Fig. 2(B); mean and standard error across stimulus repetitions]. The third (rightmost) plot shows behavioral measurements (mean and standard error across N = 48 samples).
FIG. 6.
FIG. 6.
(Color online) CMR predictions at a fixed OFC level (83 dB SPL) as a function of degree of simulated OHC damage [COHC varied from 1 (normal) to 0 (complete OHC dysfunction)]. The sensation level (SL) of the OFC was 63 dB at COHC = 1, 43 dB at COHC = 0.25, 28 dB at COHC = 0.0625, 18 dB at COHC = 0.0156, and 18 dB at COHC = 0.0039 [derived from Fig. 3(A)]. (A) Estimated d′ values (mean and standard error across stimulus repetitions) from the temporal-coherence–based source-segregation model [Fig. 2(A)] for different SNRs and CMR conditions (Comodulated, Codeviant). (B) CMR predictions from the temporal-coherence model (mean and standard error across stimulus repetitions), calculated as the mean difference in SNR threshold between Codeviant and Comodulated conditions across the d′ values predicted by the model.
FIG. 7.
FIG. 7.
(Color online) CMR predictions at a fixed OFC loudness as a function of degree of simulated OHC damage [COHC varied from 1 (normal) to 0 (complete OHC dysfunction)]. The sound pressure level (SPL) of the OFC was 48 dB at COHC = 1, 57 dB at COHC = 0.25, 64 dB at COHC = 0.0625, 69 dB at COHC = 0.0156, and 69 dB at COHC = 0.0039. (A) d′ estimates (mean and standard error across stimulus repetitions) from the temporal-coherence–based source-segregation model [Fig. 2(A)] for different SNRs and CMR conditions (Comodulated, Codeviant). (B) CMR predictions from the temporal-coherence model (mean and standard error across stimulus repetitions), calculated as the mean difference in SNR threshold between Codeviant and Comodulated conditions across the d′ values predicted by the model.

Update of

References

    1. Bacon, S. P., and Opie, J. M. (2002). “ Modulation detection interference in listeners with normal and impaired hearing,” J. Speech. Lang. Hear. Res. 45(2), 392–402. 10.1044/1092-4388(2002/031) - DOI - PubMed
    1. Bacon, S. P., Opie, J. M., and Montoya, D. Y. (1998). “ The effects of hearing loss and noise masking on the masking release for speech in temporally complex backgrounds,” J. Speech. Lang. Hear. Res. 41(3), 549–563. 10.1044/jslhr.4103.549 - DOI - PubMed
    1. Bacon, S. P., and Wesley Grantham, D. (1989). “ Modulation masking: Effects of modulation frequency, depth, and phase,” J. Acoust. Soc. Am. 85(6), 2575–2580. 10.1121/1.397751 - DOI - PubMed
    1. Baer, T., and Moore, B. C. (1993). “ Effects of spectral smearing on the intelligibility of sentences in noise,” J. Acoust. Soc. Am. 94(3), 1229–1241. 10.1121/1.408176 - DOI - PubMed
    1. Baer, T., and Moore, B. C. (1994). “ Effects of spectral smearing on the intelligibility of sentences in the presence of interfering speech,” J. Acoust. Soc. Am. 95(4), 2277–2280. 10.1121/1.408640 - DOI - PubMed

Publication types