Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017 Sep:352:30-39.
doi: 10.1016/j.heares.2017.01.006. Epub 2017 Jan 11.

Voice emotion perception and production in cochlear implant users

Affiliations
Review

Voice emotion perception and production in cochlear implant users

N T Jiam et al. Hear Res. 2017 Sep.

Abstract

Voice emotion is a fundamental component of human social interaction and social development. Unfortunately, cochlear implant users are often forced to interface with highly degraded prosodic cues as a result of device constraints in extraction, processing, and transmission. As such, individuals with cochlear implants frequently demonstrate significant difficulty in recognizing voice emotions in comparison to their normal hearing counterparts. Cochlear implant-mediated perception and production of voice emotion is an important but relatively understudied area of research. However, a rich understanding of the voice emotion auditory processing offers opportunities to improve upon CI biomedical design and to develop training programs benefiting CI performance. In this review, we will address the issues, current literature, and future directions for improved voice emotion processing in cochlear implant users.

Keywords: Cochlear implant; Speech prosody; Voice emotion; Voice emotion perception; Voice emotion production.

PubMed Disclaimer

Conflict of interest statement

Declaration of Interest: There are no potential conflicts of interests.

Figures

Figure 1
Figure 1
17 school-aged children were asked to discriminate sinusoidal amplitude modulation rate (AMR) of broadband noise and to discriminate the fundamental frequency (F0) of broadband sine-phase harmonic complexes (Deroche, et al., 2012). Children who had adult-like sensitivity were not necessarily the oldest listeners. For example, the youngest research subject was 6.5 years old and demonstrated a standardized threshold of only 1.08 semitones for AMR discrimination and 12 cents for F0 discrimination, suggesting that children’s sensitivity to pitch (regardless of the underlying cue) does not systematically improve beyond 6 years of age.
Figure 2
Figure 2
Percent correct scores for CI users (left) and NH listeners (right) on four subtests of PEPS-C, including turn-end, affect, chunking, and contrastive stress (Kalathottukaren, et al., 2015). As indicated by the asterisk, chunking was the only task for which CI users did not perform significantly worse than NH listeners.
Figure 3
Figure 3
Mean emotion recognition scores for adult NH, adult CI, child NH, and child CI study groups across full spectrum speech and speech presented with three levels of spectral degradation (Chatterjee, et al., 2015). Child and adult CI users’ performance were similar to one another’s and comparable to adult NH’s performance in 8-channel degraded speech.
Figure 4
Figure 4
Cochlear implant (CI) versus normal hearing (NH) group mean proportions of question judgements as a function of an acoustic dimension (F0 height, F0 contour, peak intensity ratio, and duration ratio) (Peng, et al., 2012). The NH group was also subject to different listening conditions (full-spectrum, 8-, and 4-channel conditions) as depicted by different symbols.
Figure 5
Figure 5
Accuracy rate in identification of neutral, angry, and happy prosody in NH listeners and CI users using ACE and MP3000 strategies (Agrawal, et al., 2013).
Figure 6
Figure 6
Graphical representation of improvements in question-statement discrimination (as measured by the PEPS-C test) in CI users after participation in melodic contour training (Yhun Lo, et al., 2015). Two programs of melodic contour training were administered. The “Interval” program incorporated manipulations of note intervals, whereas the “Duration” program changed the note durations, in order to make the tests more challenging. The dashed line represents chance performance.

Similar articles

Cited by

References

    1. Agrawal D, Thorne JD, Viola FC, et al. Electrophysiological responses to emotional prosody perception in cochlear implant users. Neuroimage Clin. 2013;2:229–238. doi: 10.1016/j.nicl.2013.01.001. - DOI - PMC - PubMed
    1. Agrawal D, Timm L, Viola FC, et al. ERP evidence for the recognition of emotional prosody through simulated cochlear implant strategies. BMC Neurosci. 2012;13(113):1–10. doi: 10.1186/1471-2202-13-113. - DOI - PMC - PubMed
    1. Andrews FM, Diehl NC. Development of a technique for identifying elementary school children’s musical concepts. J Res Music Ed. 1970;18:214–222.
    1. Apple W, Streeter LA Krauss RM. Effects of pitch and speech rate on personal attributions. J Pers Soc Psychol. 1979;37:715–27.
    1. Artières F, Vieu A, Mondain M, Uziel A, Venail F. Impact of early cochlear implantation on the linguistic development of the deaf child. Otol Neurotol. 2009;30(6):736–742. doi: 10.1097/MAO.0b013e3181b2367b. - DOI - PubMed