Brain-inspired multisensory integration neural network for cross-modal recognition through spatiotemporal dynamics and deep learning
- PMID: 39712112
- PMCID: PMC11655826
- DOI: 10.1007/s11571-023-09932-4
Brain-inspired multisensory integration neural network for cross-modal recognition through spatiotemporal dynamics and deep learning
Abstract
The integration and interaction of cross-modal senses in brain neural networks can facilitate high-level cognitive functionalities. In this work, we proposed a bioinspired multisensory integration neural network (MINN) that integrates visual and audio senses for recognizing multimodal information across different sensory modalities. This deep learning-based model incorporates a cascading framework of parallel convolutional neural networks (CNNs) for extracting intrinsic features from visual and audio inputs, and a recurrent neural network (RNN) for multimodal information integration and interaction. The network was trained using synthetic training data generated for digital recognition tasks. It was revealed that the spatial and temporal features extracted from visual and audio inputs by CNNs were encoded in subspaces orthogonal with each other. In integration epoch, network state evolved along quasi-rotation-symmetric trajectories and a structural manifold with stable attractors was formed in RNN, supporting accurate cross-modal recognition. We further evaluated the robustness of the MINN algorithm with noisy inputs and asynchronous digital inputs. Experimental results demonstrated the superior performance of MINN for flexible integration and accurate recognition of multisensory information with distinct sense properties. The present results provide insights into the computational principles governing multisensory integration and a comprehensive neural network model for brain-inspired intelligence.
Keywords: Cross-modal recognition; Deep learning; Multisensory integration; Neural networks.
© The Author(s), under exclusive licence to Springer Nature B.V. 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Similar articles
-
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142. Br J Dermatol. 2024. PMID: 38581445
-
Predicting cognitive decline: Deep-learning reveals subtle brain changes in pre-MCI stage.J Prev Alzheimers Dis. 2025 May;12(5):100079. doi: 10.1016/j.tjpad.2025.100079. Epub 2025 Feb 6. J Prev Alzheimers Dis. 2025. PMID: 39920001 Free PMC article.
-
A fake news detection model using the integration of multimodal attention mechanism and residual convolutional network.Sci Rep. 2025 Jul 1;15(1):20544. doi: 10.1038/s41598-025-05702-w. Sci Rep. 2025. PMID: 40596197 Free PMC article.
-
EEG-based classification of individuals with neuropsychiatric disorders using deep neural networks: A systematic review of current status and future directions.Comput Methods Programs Biomed. 2023 Oct;240:107683. doi: 10.1016/j.cmpb.2023.107683. Epub 2023 Jun 20. Comput Methods Programs Biomed. 2023. PMID: 37406421
-
Integrating Patient Data Into Skin Cancer Classification Using Convolutional Neural Networks: Systematic Review.J Med Internet Res. 2021 Jul 2;23(7):e20708. doi: 10.2196/20708. J Med Internet Res. 2021. PMID: 34255646 Free PMC article.
Cited by
-
An efficient memory reserving-and-fading strategy for vector quantization based 3D brain segmentation and tumor extraction using an unsupervised deep learning network.Cogn Neurodyn. 2023 Apr 26;18(3):1-22. doi: 10.1007/s11571-023-09965-9. Online ahead of print. Cogn Neurodyn. 2023. PMID: 37362765 Free PMC article.
-
Brain-Inspired Multisensory Learning: A Systematic Review of Neuroplasticity and Cognitive Outcomes in Adult Multicultural and Second Language Acquisition.Biomimetics (Basel). 2025 Jun 12;10(6):397. doi: 10.3390/biomimetics10060397. Biomimetics (Basel). 2025. PMID: 40558367 Free PMC article. Review.
References
-
- Alais D, Newell F, Mamassian P (2010) Multisensory processing in review: from physiology to behaviour. See Perceiv 23(1):3–38. 10.1163/187847510X488603 - PubMed
-
- Alvarado JC, Vaughan JW, Stanford TR, Stein BE (2007) Multisensory versus unisensory integration: contrasting modes in the superior colliculus. J Neurophysiol 97(5):3193–3205. 10.1152/jn.00018.2007 - PubMed
-
- Barak O (2017) Recurrent neural networks as versatile tools of neuroscience research. Curr Opin Neurobiol 46:1–6. 10.1016/j.conb.2017.06.003 - PubMed
LinkOut - more resources
Full Text Sources