Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Oct:171:49-61.
doi: 10.1016/j.neures.2021.01.007. Epub 2021 Jan 20.

Estimating the reduced benefit of infant-directed speech in cochlear implant-related speech processing

Affiliations

Estimating the reduced benefit of infant-directed speech in cochlear implant-related speech processing

Meisam Arjmandi et al. Neurosci Res. 2021 Oct.

Abstract

Caregivers modify their speech when talking to infants, a specific type of speech known as infant-directed speech (IDS). This speaking style facilitates language learning compared to adult-directed speech (ADS) in infants with normal hearing (NH). While infants with NH and those with cochlear implants (CIs) prefer listening to IDS over ADS, it is yet unknown how CI processing may affect the acoustic distinctiveness between ADS and IDS, as well as the degree of intelligibility of these. This study analyzed speech of seven female adult talkers to model the effects of simulated CI processing on (1) acoustic distinctiveness between ADS and IDS, (2) estimates of intelligibility of caregivers' speech in ADS and IDS, and (3) individual differences in caregivers' ADS-to-IDS modification and estimated speech intelligibility. Results suggest that CI processing is substantially detrimental to the acoustic distinctiveness between ADS and IDS, as well as to the intelligibility benefit derived from ADS-to-IDS modifications. Moreover, the observed variability across individual talkers in acoustic implementation of ADS-to-IDS modification and the estimated speech intelligibility was significantly reduced due to CI processing. The findings are discussed in the context of the link between IDS and language learning in infants with CIs.

Keywords: Acoustic distance; Cochlear implant; Individual differences; Infant-directed speech; Speech intelligibility.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Schematic diagram of the approach used in the present study for measuring (1) MD between pairs of MFCCs derived from pairs of IDS-ADS stimuli, and (2) intelligibility of IDS and ADS stimuli as estimated by SRMR value for listeners with NH and SRMR-CI for those with CIs. Note that the dashed line denotes the process for creating and analyzing the noise-vocoded versions of the same pairs of stimuli, while N stands for the number of spectral channels in the noise-excited envelope vocoder. Blocks and lines with blue (dark gray) color indicate paths for processing ADS, while those with orange (light gray) color indicate paths for processing IDS. Note that SRMR and SRMR-CI were calculated only for unprocessed stimuli in order to estimate intelligibility for NH and CI listeners, respectively. In this figure, the waveforms and their corresponding MFCCs are from the utterance “See the modi?” spoken by one of the seven talkers both in IDS and ADS speaking styles. SIDSij and SADSij are the ith pair of IDS and ADS stimuli (i = {1,2,3,…,15}) for talker j (j ={1,2,3,…,7}). MFCCsIDSij and MFCCsADSij are MFCC features derived from SIDSij and SADSij speech stimuli, respectively. The middle two panels show MFCCs obtained from the frames of these IDS and ADS stimuli. MDij is the MD calculated to measure the acoustic distance between the two matrices for MFCCsIDSij and MFCCsADSij. SRMRIDSij and SRMRADSij are the estimated intelligibility for SIDSij and SADSij speech stimuli, respectively, as heard by listeners with NH. SRMR-CIIDSij and SRMR-CIADSij are the estimated speech intelligibility for the same SIDSij and SADSij speech stimuli, respectively, as heard by a listener with CIs.
FIG. 2.
FIG. 2.
The mean (bar) and ±1 standard error (vertical error bar in black) of Mahalanobis distance (MD) across groups of IDS and ADS stimuli at seven levels of spectro-temporal degradation, ranging from no-degradation (natural/unprocessed) to 4-channel noise-vocoded stimuli. Green (dark gray) circles show the mean MDs for each talker derived by averaging MDs over 15 pairs of IDS-ADS stimuli for that talker.
FIG. 3.
FIG. 3.
Estimated intelligibility of speech stimuli spoken in IDS (orange; light gray) and in ADS (blue; dark gray) styles (simulated) groups of listeners with different hearing statuses: NH (estimated by SRMR) and those with CIs (estimated by SRMR-CI). The bar graphs represent average values of SRMR or SRMR-CI over the seven talkers. The vertical error bars in black show ±1 standard error.
FIG. 4.
FIG. 4.
Variability across seven female talkers in (A) acoustic distinctiveness between their IDS and ADS for unprocessed stimuli (Natural) and simulated CI speech within a 22-channel noise vocoder (22CH), and (B) the change in talkers’ speech intelligibility (SI) due to a change in their speaking style (ADS to IDS), as heard for two (simulated) listener groups with either NH (estimated by SRMR) or CIs (estimated by SRMR-CI). The data points (gray circles) are laid over a 1.96 standard error of the mean (95% confidence interval) in red (rectangle area with light gray) and 1 standard deviation shown by blue lines (vertical dark gray lines). The solid and dotted red lines (horizontal solid and dotted dark gray lines) show the mean and median, respectively.

References

    1. Allen JS, Miller JL, & DeSteno D (2009). Individual talker differences in voice-onset-time: Contextual influences. The Journal of the Acoustical Society of America, 125(6), 3974–3982. - PMC - PubMed
    1. Arenberg Bierer J (2010). Probing the electrode-neuron interface with focused cochlear implant stimulation. Trends in Amplification, 14(2), 84–95. - PMC - PubMed
    1. Arjmandi M, Dilley LC, & Wagner SE (2018). Investigation of acoustic dimension use in dialect production: machine learning of sonorant sounds for modeling acoustic cues of African American dialect. 11th International Conference on Voice Physiology and Biomechanics, 12–13. East Lansing, USA.
    1. Baskent D, & Gaudrain E (2016). Perception and psychoacoustics of speech in cochlear implant users. Scientific Foundations of Audiology. Perspectives from Physics, Biology, Modelling, and Medicine, 185–320.
    1. Beauchemin M, Gonzalez-Frankenberger B, Tremblay J, Vannasing P, Martínez-Montes E, Belin P, Beland R, Francoeur D, Carceller AM, Wallois F and Lassonde M (2011). Mother and stranger: An electrophysiological study of voice processing in newborns. Cerebral Cortex, 21(8), 1705–1711. - PubMed