Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 24:16:843988.
doi: 10.3389/fnins.2022.843988. eCollection 2022.

Exploring Hierarchical Auditory Representation via a Neural Encoding Model

Affiliations

Exploring Hierarchical Auditory Representation via a Neural Encoding Model

Liting Wang et al. Front Neurosci. .

Abstract

By integrating hierarchical feature modeling of auditory information using deep neural networks (DNNs), recent functional magnetic resonance imaging (fMRI) encoding studies have revealed the hierarchical neural auditory representation in the superior temporal gyrus (STG). Most of these studies adopted supervised DNNs (e.g., for audio classification) to derive the hierarchical feature representation of external auditory stimuli. One possible limitation is that the extracted features could be biased toward discriminative features while ignoring general attributes shared by auditory information in multiple categories. Consequently, the hierarchy of neural acoustic processing revealed by the encoding model might be biased toward classification. In this study, we explored the hierarchical neural auditory representation via an fMRI encoding framework in which an unsupervised deep convolutional auto-encoder (DCAE) model was adopted to derive the hierarchical feature representations of the stimuli (naturalistic auditory excerpts in different categories) in fMRI acquisition. The experimental results showed that the neural representation of hierarchical auditory features is not limited to previously reported STG, but also involves the bilateral insula, ventral visual cortex, and thalamus. The current study may provide complementary evidence to understand the hierarchical auditory processing in the human brain.

Keywords: deep convolutional auto-encoder; fMRI; hierarchical auditory representation; naturalistic experience; neural encoding.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
The schematic illustration of the study. (A) fMRI acquisition using naturalistic auditory excerpts as stimuli. (B) Hierarchical feature representation of the naturalistic auditory stimuli via an unsupervised DCAE model. (C) Hierarchical acoustic features were correlated to fMRI brain activities using an encoding model based on LASSO to infer hierarchical auditory representation in the brain.
FIGURE 2
FIGURE 2
The DCAE model used for the hierarchical feature modeling.
FIGURE 3
FIGURE 3
The architecture of the supervised DNNs for audio classification. GAP, global average pooling.
FIGURE 4
FIGURE 4
Visualization of learned filters in the DCAE model. (A) Examples of the learned filters in each layer. (B) The power-spectrum patterns of learned filters. The x-axis represents the index of filters, the y-axis represents the frequency ranging from 0 to 8000 Hz.
FIGURE 5
FIGURE 5
Encoding performance of the trained DCAE model. (A) The distribution of Pearson correlation coefficients (PCC) between the input audio signals and reconstructed ones in the MagnaTagATune dataset and LibriSpeech Corpus. (B) The distribution of PCC in the auditory samples from the fMRI stimuli.
FIGURE 6
FIGURE 6
The encoding performance for each layer in the unsupervised DCAE model (A) and supervised classification model (B).
FIGURE 7
FIGURE 7
The comparison of encoding performance between the unsupervised DCAE model and supervised classification model in each layer. A1, primary auditory cortex; aSTG, anterior superior temporal gyrus; pSTG, posterior superior temporal gyrus; MTG, middle temporal gyrus; VC, visual cortex; PFC, prefrontal cortex.
FIGURE 8
FIGURE 8
Brain regions that are selectively activated by the hierarchical acoustic features represented in each encoder layer of the unsupervised DCAE model. Panels (A–D) represent the first four layers in the unsupervised DCAE model.
FIGURE 9
FIGURE 9
Brain regions that are selectively activated by the hierarchical acoustic features represented in each layer of the supervised classification model. Panels (A–D) represent the first four layers in the supervised classification model.

Similar articles

Cited by

References

    1. Ahveninen J., Jaaskelainen I. P., Raij T., Bonmassar G., Devore S., Hamalainen M., et al. (2006). Task-modulated “what” and “where” pathways in human auditory cortex. Proc. Natl. Acad. Sci. U.S.A. 103 14608–14613. 10.1073/pnas.0510480103 - DOI - PMC - PubMed
    1. Bilgic B., Chatnuntawech I., Fan A. P., Setsompop K., Cauley S. F., Wald L. L., et al. (2014). Fast image reconstruction with L2-regularization. J. Magn. Reson. Imaging 40 181–191. 10.1002/jmri.24365 - DOI - PMC - PubMed
    1. Bizley J. K., Cohen Y. E. (2013). The what, where and how of auditory-object perception. Nat. Rev. Neurosci. 14 693–707. 10.1038/nrn3565 - DOI - PMC - PubMed
    1. Cate A. D., Herron T. J., Yund E. W., Stecker G. C., Rinne T., Kang X., et al. (2009). Auditory attention activates peripheral visual cortex. PLoS One 4:e4645. 10.1371/journal.pone.0004645 - DOI - PMC - PubMed
    1. Chevillet M. A., Riesenhuber M., Rauschecker J. P. (2011). Functional correlates of the anterolateral processing hierarchy in human auditory cortex. J. Neurosci. 31 9345–9352. 10.1523/JNEUROSCI.1448-11.2011 - DOI - PMC - PubMed

LinkOut - more resources