. 2024 Sep 28;14(10):988.

doi: 10.3390/brainsci14100988.

Efficient Neural Decoding Based on Multimodal Training

Yun Wang^{1

2}

Affiliations

¹ Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China.
² Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai 200433, China.

PMID: 39452003
PMCID: PMC11506634
DOI: 10.3390/brainsci14100988

Efficient Neural Decoding Based on Multimodal Training

Yun Wang. Brain Sci. 2024.

. 2024 Sep 28;14(10):988.

doi: 10.3390/brainsci14100988.

Author

Yun Wang^{1

2}

Affiliations

¹ Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China.
² Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai 200433, China.

PMID: 39452003
PMCID: PMC11506634
DOI: 10.3390/brainsci14100988

Abstract

Background/objectives: Neural decoding methods are often limited by the performance of brain encoders, which map complex brain signals into a latent representation space of perception information. These brain encoders are constrained by the limited amount of paired brain and stimuli data available for training, making it challenging to learn rich neural representations.

Methods: To address this limitation, we present a novel multimodal training approach using paired image and functional magnetic resonance imaging (fMRI) data to establish a brain masked autoencoder that learns the interactions between images and brain activities. Subsequently, we employ a diffusion model conditioned on brain data to decode realistic images.

Results: Our method achieves high-quality decoding results in semantic contents and low-level visual attributes, outperforming previous methods both qualitatively and quantitatively, while maintaining computational efficiency. Additionally, our method is applied to decode artificial patterns across region of interests (ROIs) to explore their functional properties. We not only validate existing knowledge concerning ROIs but also unveil new insights, such as the synergy between early visual cortex and higher-level scene ROIs, as well as the competition within the higher-level scene ROIs.

Conclusions: These findings provide valuable insights for future directions in the field of neural decoding.

Keywords: diffusion model; fusion transformer; multimodal pre-training; neural decoding; scene reconstruction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 1**
Neural decoding of visual image. Subjects undergo fMRI scanning while viewing visual stimuli. The brain activity corresponding to stimuli is recorded and transformed into features. Computational models reconstruct the original stimuli based on the features.

**Figure 2**
Proposed pipeline of decoding with multimodal training.

**Figure 4**
50-way accuracy and computational complexity.

**Figure 5**
Images derived from synthetic fMRI patterns generated through the activation of one ROI.

**Figure 6**
Images derived from synthetic fMRI patterns generated through the activation of two ROIs.

See this image and copyright information in PMC

References

1. Bouton C. Neural Decoding and Applications in Bioelectronic Medicine. Bioelectron. Med. 2015;2:20–24. doi: 10.15424/bioelectronmed.2014.00012. - DOI
1. Butler P.D., Silverstein S.M., Dakin S.C. Visual Perception and Its Impairment in Schizophrenia. Biol. Psychiatry. 2008;64:40–47. doi: 10.1016/j.biopsych.2008.03.023. - DOI - PMC - PubMed
1. Dakin S., Frith U. Vagaries of visual perception in autism. Neuron. 2005;48:497–507. doi: 10.1016/j.neuron.2005.10.018. - DOI - PubMed
1. Thye M.D., Bednarz H.M., Herringshaw A.J., Sartin E.B., Kana R.K. The impact of atypical sensory processing on social impairments in autism spectrum disorder. Dev. Cogn. Neurosci. 2018;29:151–167. doi: 10.1016/j.dcn.2017.04.010. - DOI - PMC - PubMed
1. van Gerven M.A., Kok P., de Lange F.P., Heskes T. Dynamic decoding of ongoing perception. NeuroImage. 2011;57:950–957. doi: 10.1016/j.neuroimage.2011.05.020. - DOI - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Efficient Neural Decoding Based on Multimodal Training

Affiliations

Efficient Neural Decoding Based on Multimodal Training

Author

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

Grants and funding

LinkOut - more resources

Full Text Sources