Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models
- PMID: 37417470
- DOI: 10.1111/cogs.13312
Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models
Abstract
To model behavioral and neural correlates of language comprehension in naturalistic environments, researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFGs), yet such formalisms are not sufficiently expressive for human languages. Combinatory categorial grammars (CCGs) are sufficiently expressive directly compositional models of grammar with flexible constituency that affords incremental interpretation. In this work, we evaluate whether a more expressive CCG provides a better model than a CFG for human neural signals collected with functional magnetic resonance imaging (fMRI) while participants listen to an audiobook story. We further test between variants of CCG that differ in how they handle optional adjuncts. These evaluations are carried out against a baseline that includes estimates of next-word predictability from a transformer neural network language model. Such a comparison reveals unique contributions of CCG structure-building predominantly in the left posterior temporal lobe: CCG-derived measures offer a superior fit to neural signals compared to those derived from a CFG. These effects are spatially distinct from bilateral superior temporal effects that are unique to predictability. Neural effects for structure-building are thus separable from predictability during naturalistic listening, and those effects are best characterized by a grammar whose expressive power is motivated on independent linguistic grounds.
Keywords: Grammar; Language modeling; Neural networks; Parsing; Surprisal; Syntax; fMRI.
© 2023 The Authors. Cognitive Science published by Wiley Periodicals LLC on behalf of Cognitive Science Society (CSS).
Similar articles
-
Localizing syntactic predictions using recurrent neural network grammars.Neuropsychologia. 2020 Sep;146:107479. doi: 10.1016/j.neuropsychologia.2020.107479. Epub 2020 May 16. Neuropsychologia. 2020. PMID: 32428530
-
Negative correlation between word-level surprisal and intersubject neural synchronization during narrative listening.Cortex. 2022 Oct;155:132-149. doi: 10.1016/j.cortex.2022.07.005. Epub 2022 Aug 1. Cortex. 2022. PMID: 35985124
-
Semantics-weighted lexical surprisal modeling of naturalistic functional MRI time-series during spoken narrative listening.Neuroimage. 2020 Nov 15;222:117281. doi: 10.1016/j.neuroimage.2020.117281. Epub 2020 Aug 21. Neuroimage. 2020. PMID: 32828929
-
The neural basis of combinatory syntax and semantics.Science. 2019 Oct 4;366(6461):62-66. doi: 10.1126/science.aax0050. Science. 2019. PMID: 31604303 Review.
-
Cognitive control mediates age-related changes in flexible anticipatory processing during listening comprehension.Brain Res. 2021 Oct 1;1768:147573. doi: 10.1016/j.brainres.2021.147573. Epub 2021 Jun 30. Brain Res. 2021. PMID: 34216583 Free PMC article. Review.
Cited by
-
Grammatical Parallelism in Aphasia: A Lesion-Symptom Mapping Study.Neurobiol Lang (Camb). 2023 Oct 31;4(4):550-574. doi: 10.1162/nol_a_00117. eCollection 2023. Neurobiol Lang (Camb). 2023. PMID: 37946730 Free PMC article.
-
Localizing Syntactic Composition with Left-Corner Recurrent Neural Network Grammars.Neurobiol Lang (Camb). 2024 Apr 1;5(1):201-224. doi: 10.1162/nol_a_00118. eCollection 2024. Neurobiol Lang (Camb). 2024. PMID: 38645619 Free PMC article.
-
Mapping the learning curves of deep learning networks.PLoS Comput Biol. 2025 Feb 10;21(2):e1012286. doi: 10.1371/journal.pcbi.1012286. eCollection 2025 Feb. PLoS Comput Biol. 2025. PMID: 39928655 Free PMC article.
-
Shared functional specialization in transformer-based language models and the human brain.Nat Commun. 2024 Jun 29;15(1):5523. doi: 10.1038/s41467-024-49173-5. Nat Commun. 2024. PMID: 38951520 Free PMC article.
-
Language-specific neural dynamics extend syntax into the time domain.PLoS Biol. 2025 Jan 21;23(1):e3002968. doi: 10.1371/journal.pbio.3002968. eCollection 2025 Jan. PLoS Biol. 2025. PMID: 39836653 Free PMC article.
References
-
- Abney, S., & Johnson, M. (1991). Memory requirements and local ambiguities of parsing strategies. Journal of Psycholinguistic Research, 20(3), 233-250.
-
- Abraham, A., Pedregosa, F., Eickenberg, M., Gervais, P., Mueller, A., Kossaifi, J., Gramfort, A., Thirion, B., & Varoquaux, G. (2014). Machine learning for neuroimaging with scikit-learn. Frontiers in Neuroinformatics, 8.
-
- Altmann, G., & Steedman, M. (1988). Interaction with context during human sentence processing. Cognition, 30(3), 191.
-
- Amici, S., Brambati, S. M., Wilkins, D. P., Ogar, J., Dronkers, N. L., Miller, B. L., & Gorno-Tempini, M. L. (2007). Anatomical correlates of sentence comprehension and verbal working memory in neurodegenerative disease. Journal of Neuroscience, 27(23), 6282-6290.
-
- Barker, C., & Jacobson, P. I. (2007). Direct compositionality. Oxford linguistics. Oxford University Press.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources