Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 28;14(10):988.
doi: 10.3390/brainsci14100988.

Efficient Neural Decoding Based on Multimodal Training

Affiliations

Efficient Neural Decoding Based on Multimodal Training

Yun Wang. Brain Sci. .

Abstract

Background/objectives: Neural decoding methods are often limited by the performance of brain encoders, which map complex brain signals into a latent representation space of perception information. These brain encoders are constrained by the limited amount of paired brain and stimuli data available for training, making it challenging to learn rich neural representations.

Methods: To address this limitation, we present a novel multimodal training approach using paired image and functional magnetic resonance imaging (fMRI) data to establish a brain masked autoencoder that learns the interactions between images and brain activities. Subsequently, we employ a diffusion model conditioned on brain data to decode realistic images.

Results: Our method achieves high-quality decoding results in semantic contents and low-level visual attributes, outperforming previous methods both qualitatively and quantitatively, while maintaining computational efficiency. Additionally, our method is applied to decode artificial patterns across region of interests (ROIs) to explore their functional properties. We not only validate existing knowledge concerning ROIs but also unveil new insights, such as the synergy between early visual cortex and higher-level scene ROIs, as well as the competition within the higher-level scene ROIs.

Conclusions: These findings provide valuable insights for future directions in the field of neural decoding.

Keywords: diffusion model; fusion transformer; multimodal pre-training; neural decoding; scene reconstruction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure A1
Figure A1
Qualitative results.
Figure 1
Figure 1
Neural decoding of visual image. Subjects undergo fMRI scanning while viewing visual stimuli. The brain activity corresponding to stimuli is recorded and transformed into features. Computational models reconstruct the original stimuli based on the features.
Figure 2
Figure 2
Proposed pipeline of decoding with multimodal training.
Figure 3
Figure 3
Qualitative results.
Figure 4
Figure 4
50-way accuracy and computational complexity.
Figure 5
Figure 5
Images derived from synthetic fMRI patterns generated through the activation of one ROI.
Figure 6
Figure 6
Images derived from synthetic fMRI patterns generated through the activation of two ROIs.

Similar articles

References

    1. Bouton C. Neural Decoding and Applications in Bioelectronic Medicine. Bioelectron. Med. 2015;2:20–24. doi: 10.15424/bioelectronmed.2014.00012. - DOI
    1. Butler P.D., Silverstein S.M., Dakin S.C. Visual Perception and Its Impairment in Schizophrenia. Biol. Psychiatry. 2008;64:40–47. doi: 10.1016/j.biopsych.2008.03.023. - DOI - PMC - PubMed
    1. Dakin S., Frith U. Vagaries of visual perception in autism. Neuron. 2005;48:497–507. doi: 10.1016/j.neuron.2005.10.018. - DOI - PubMed
    1. Thye M.D., Bednarz H.M., Herringshaw A.J., Sartin E.B., Kana R.K. The impact of atypical sensory processing on social impairments in autism spectrum disorder. Dev. Cogn. Neurosci. 2018;29:151–167. doi: 10.1016/j.dcn.2017.04.010. - DOI - PMC - PubMed
    1. van Gerven M.A., Kok P., de Lange F.P., Heskes T. Dynamic decoding of ongoing perception. NeuroImage. 2011;57:950–957. doi: 10.1016/j.neuroimage.2011.05.020. - DOI - PubMed

LinkOut - more resources