Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan-Dec:27:23312165221141142.
doi: 10.1177/23312165221141142.

Musical Emotion Categorization with Vocoders of Varying Temporal and Spectral Content

Affiliations

Musical Emotion Categorization with Vocoders of Varying Temporal and Spectral Content

Eleanor E Harding et al. Trends Hear. 2023 Jan-Dec.

Abstract

While previous research investigating music emotion perception of cochlear implant (CI) users observed that temporal cues informing tempo largely convey emotional arousal (relaxing/stimulating), it remains unclear how other properties of the temporal content may contribute to the transmission of arousal features. Moreover, while detailed spectral information related to pitch and harmony in music - often not well perceived by CI users- reportedly conveys emotional valence (positive, negative), it remains unclear how the quality of spectral content contributes to valence perception. Therefore, the current study used vocoders to vary temporal and spectral content of music and tested music emotion categorization (joy, fear, serenity, sadness) in 23 normal-hearing participants. Vocoders were varied with two carriers (sinewave or noise; primarily modulating temporal information), and two filter orders (low or high; primarily modulating spectral information). Results indicated that emotion categorization was above-chance in vocoded excerpts but poorer than in a non-vocoded control condition. Among vocoded conditions, better temporal content (sinewave carriers) improved emotion categorization with a large effect while better spectral content (high filter order) improved it with a small effect. Arousal features were comparably transmitted in non-vocoded and vocoded conditions, indicating that lower temporal content successfully conveyed emotional arousal. Valence feature transmission steeply declined in vocoded conditions, revealing that valence perception was difficult for both lower and higher spectral content. The reliance on arousal information for emotion categorization of vocoded music suggests that efforts to refine temporal cues in the CI user signal may immediately benefit their music emotion perception.

Keywords: arousal; cochlear implants; music emotion perception; valence; vocoders.

PubMed Disclaimer

Conflict of interest statement

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure 1.
Figure 1.
Example stimuli spectrograms shown in non-vocoded and vocoded conditions. Fear, a high-arousal emotion, shows more energy at frequencies across the spectrum in all conditions. Sadness, in comparison, is lower arousal and accordingly displays lower spectral energy across frequencies, especially compared to higher frequencies in the fear condition. Compared to non-vocoded spectrograms, vocoders with sinewave carriers contain energy more localized to channel frequencies whereas in the noise carrier, energy is smeared across multiple frequencies.
Figure 2.
Figure 2.
Participants were presented with the scenario that they were an astronaut on an alien planet who was helping aliens to understand earth music. (Illustration by Kristin Hrycyk, image published under the CC BY NC 4.0 license, https://creativecommons.org/licenses/by-nc/4.0/).
Figure 3.
Figure 3.
Sensitivity in perception of presented emotion categories (d’) as a function of vocoder manipulations (x-axis), and shown for different emotions (color and symbol). The horizontal line in the boxplot shows the median sensitivity across participants. The box extends from the 25th to the 75th percentile, and the whiskers extend to the value most remote from the median within 1.5 times the interquartile range. Individual data points are overlaid on top of the boxplots. d' = 0 denotes the chance level.
Figure 4.
Figure 4.
Confidence ratings per vocoder. In all except the non-vocoded condition, confidence was reported higher as sensitivity increased.
Figure 5.
Figure 5.
Confusion matrices for emotion categorization shown for each vocoder condition. The size and color of each dot is proportional to the number of relevant responses. The presented emotion categories are listed on the x-axis, while the responded emotion categories are listed on the y-axis.
Figure 6.
Figure 6.
Relative transmission in percent of the total available information, shown as a function of feature (hatched vs. solid) and vocoder (x-axis). The sinewave and noise carriers are further marked by color (sinewave: blue; noise: red). The signification of the boxplot details is identical to that of Figure 3.

References

    1. Ambert-Dahan E., Giraud A. L., Sterkers O., Samson S. (2015). Judgment of musical emotions after cochlear implantation in adults with progressive deafness. Frontiers in Psychology, 6, 181. 10.3389/fpsyg.2015.00181 - DOI - PMC - PubMed
    1. Bakeman R. (2005). Recommended effect size statistics for repeated measures designs. Behavior Research Methods, 37(3), 379–384. 10.3758/BF03192707 - DOI - PubMed
    1. Bates D., Maechler M., Bolker B., Walker S., Christensen R. H., Singmann H., Dai B. (2015). lme4: Linear mixed-effects models using Eigen and S4[computer program]. R package version 1.1–7. 2014.
    1. de Balthasar C., Boëx C., Cosendai G., Valentini G., Sigrist A., Pelizzone M. (2003). Channel interactions with high-rate biphasic electrical stimulation in cochlear implant subjects. Hearing Research, 182(1), 77–87. 10.1016/S0378-5955(03)00174-6 - DOI - PubMed
    1. Başkent D., Gaudrain E., Tamati T., Wagner A. E. (2016). Perception and Psychoacoustics of Speech in Cochlear Implant Users. In Cacace A. T., de Kleine E., Holt A. G., van Dijk P. (Eds.), Scientific Foundations of Audiology: Perspectives from Physics, Biology, Modeling, and Medicine (pp. 285–319). Plural Publishing.

Publication types