Exploring Hierarchical Auditory Representation via a Neural Encoding Model
- PMID: 35401085
- PMCID: PMC8987159
- DOI: 10.3389/fnins.2022.843988
Exploring Hierarchical Auditory Representation via a Neural Encoding Model
Abstract
By integrating hierarchical feature modeling of auditory information using deep neural networks (DNNs), recent functional magnetic resonance imaging (fMRI) encoding studies have revealed the hierarchical neural auditory representation in the superior temporal gyrus (STG). Most of these studies adopted supervised DNNs (e.g., for audio classification) to derive the hierarchical feature representation of external auditory stimuli. One possible limitation is that the extracted features could be biased toward discriminative features while ignoring general attributes shared by auditory information in multiple categories. Consequently, the hierarchy of neural acoustic processing revealed by the encoding model might be biased toward classification. In this study, we explored the hierarchical neural auditory representation via an fMRI encoding framework in which an unsupervised deep convolutional auto-encoder (DCAE) model was adopted to derive the hierarchical feature representations of the stimuli (naturalistic auditory excerpts in different categories) in fMRI acquisition. The experimental results showed that the neural representation of hierarchical auditory features is not limited to previously reported STG, but also involves the bilateral insula, ventral visual cortex, and thalamus. The current study may provide complementary evidence to understand the hierarchical auditory processing in the human brain.
Keywords: deep convolutional auto-encoder; fMRI; hierarchical auditory representation; naturalistic experience; neural encoding.
Copyright © 2022 Wang, Liu, Zhang, Zhao, Guo, Han and Hu.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures









Similar articles
-
Cortical processing of pitch: Model-based encoding and decoding of auditory fMRI responses to real-life sounds.Neuroimage. 2018 Oct 15;180(Pt A):291-300. doi: 10.1016/j.neuroimage.2017.11.020. Epub 2017 Nov 13. Neuroimage. 2018. PMID: 29146377
-
Hierarchical Individual Naturalistic Functional Brain Networks with Group Consistency uncovered by a Two-Stage NAS-Volumetric Sparse DBN Framework.eNeuro. 2022 Aug 19;9(5):ENEURO.0200-22.2022. doi: 10.1523/ENEURO.0200-22.2022. Online ahead of print. eNeuro. 2022. PMID: 35995557 Free PMC article.
-
A Visual Encoding Model Based on Contrastive Self-Supervised Learning for Human Brain Activity along the Ventral Visual Stream.Brain Sci. 2021 Jul 29;11(8):1004. doi: 10.3390/brainsci11081004. Brain Sci. 2021. PMID: 34439623 Free PMC article.
-
Stimulus-dependent activations and attention-related modulations in the auditory cortex: a meta-analysis of fMRI studies.Hear Res. 2014 Jan;307:29-41. doi: 10.1016/j.heares.2013.08.001. Epub 2013 Aug 11. Hear Res. 2014. PMID: 23938208 Review.
-
The functional neuroanatomy of face perception: from brain measurements to deep neural networks.Interface Focus. 2018 Aug 6;8(4):20180013. doi: 10.1098/rsfs.2018.0013. Epub 2018 Jun 15. Interface Focus. 2018. PMID: 29951193 Free PMC article. Review.
Cited by
-
Preliminary Evidence for Global Properties in Human Listeners During Natural Auditory Scene Perception.Open Mind (Camb). 2024 Mar 26;8:333-365. doi: 10.1162/opmi_a_00131. eCollection 2024. Open Mind (Camb). 2024. PMID: 38571530 Free PMC article.
References
LinkOut - more resources
Full Text Sources