Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models

Miloš Stanojević¹, Jonathan R Brennan², Donald Dunagan³, Mark Steedman⁴, John T Hale^{1

3}

Affiliations

¹ Google DeepMind.
² Department of Linguistics, University of Michigan.
³ Department of Linguistics, University of Georgia.
⁴ School of Informatics, University of Edinburgh.

PMID: 37417470
DOI: 10.1111/cogs.13312

Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models

Miloš Stanojević et al. Cogn Sci. 2023 Jul.

. 2023 Jul;47(7):e13312.

doi: 10.1111/cogs.13312.

Authors

Miloš Stanojević¹, Jonathan R Brennan², Donald Dunagan³, Mark Steedman⁴, John T Hale^{1

3}

Affiliations

¹ Google DeepMind.
² Department of Linguistics, University of Michigan.
³ Department of Linguistics, University of Georgia.
⁴ School of Informatics, University of Edinburgh.

PMID: 37417470
DOI: 10.1111/cogs.13312

Abstract

To model behavioral and neural correlates of language comprehension in naturalistic environments, researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFGs), yet such formalisms are not sufficiently expressive for human languages. Combinatory categorial grammars (CCGs) are sufficiently expressive directly compositional models of grammar with flexible constituency that affords incremental interpretation. In this work, we evaluate whether a more expressive CCG provides a better model than a CFG for human neural signals collected with functional magnetic resonance imaging (fMRI) while participants listen to an audiobook story. We further test between variants of CCG that differ in how they handle optional adjuncts. These evaluations are carried out against a baseline that includes estimates of next-word predictability from a transformer neural network language model. Such a comparison reveals unique contributions of CCG structure-building predominantly in the left posterior temporal lobe: CCG-derived measures offer a superior fit to neural signals compared to those derived from a CFG. These effects are spatially distinct from bilateral superior temporal effects that are unique to predictability. Neural effects for structure-building are thus separable from predictability during naturalistic listening, and those effects are best characterized by a grammar whose expressive power is motivated on independent linguistic grounds.

Keywords: Grammar; Language modeling; Neural networks; Parsing; Surprisal; Syntax; fMRI.

PubMed Disclaimer

Cited by

Grammatical Parallelism in Aphasia: A Lesion-Symptom Mapping Study.
Matchin W, den Ouden DB, Basilakos A, Stark BC, Fridriksson J, Hickok G. Matchin W, et al. Neurobiol Lang (Camb). 2023 Oct 31;4(4):550-574. doi: 10.1162/nol_a_00117. eCollection 2023. Neurobiol Lang (Camb). 2023. PMID: 37946730 Free PMC article.
Localizing Syntactic Composition with Left-Corner Recurrent Neural Network Grammars.
Sugimoto Y, Yoshida R, Jeong H, Koizumi M, Brennan JR, Oseki Y. Sugimoto Y, et al. Neurobiol Lang (Camb). 2024 Apr 1;5(1):201-224. doi: 10.1162/nol_a_00118. eCollection 2024. Neurobiol Lang (Camb). 2024. PMID: 38645619 Free PMC article.
Mapping the learning curves of deep learning networks.
Jiang Y, Dale R. Jiang Y, et al. PLoS Comput Biol. 2025 Feb 10;21(2):e1012286. doi: 10.1371/journal.pcbi.1012286. eCollection 2025 Feb. PLoS Comput Biol. 2025. PMID: 39928655 Free PMC article.
Shared functional specialization in transformer-based language models and the human brain.
Kumar S, Sumers TR, Yamakoshi T, Goldstein A, Hasson U, Norman KA, Griffiths TL, Hawkins RD, Nastase SA. Kumar S, et al. Nat Commun. 2024 Jun 29;15(1):5523. doi: 10.1038/s41467-024-49173-5. Nat Commun. 2024. PMID: 38951520 Free PMC article.
Language-specific neural dynamics extend syntax into the time domain.
Coopmans CW, de Hoop H, Tezcan F, Hagoort P, Martin AE. Coopmans CW, et al. PLoS Biol. 2025 Jan 21;23(1):e3002968. doi: 10.1371/journal.pbio.3002968. eCollection 2025 Jan. PLoS Biol. 2025. PMID: 39836653 Free PMC article.

See all "Cited by" articles

References

1. Abney, S., & Johnson, M. (1991). Memory requirements and local ambiguities of parsing strategies. Journal of Psycholinguistic Research, 20(3), 233-250.
1. Abraham, A., Pedregosa, F., Eickenberg, M., Gervais, P., Mueller, A., Kossaifi, J., Gramfort, A., Thirion, B., & Varoquaux, G. (2014). Machine learning for neuroimaging with scikit-learn. Frontiers in Neuroinformatics, 8.
1. Altmann, G., & Steedman, M. (1988). Interaction with context during human sentence processing. Cognition, 30(3), 191.
1. Amici, S., Brambati, S. M., Wilkins, D. P., Ogar, J., Dronkers, N. L., Miller, B. L., & Gorno-Tempini, M. L. (2007). Anatomical correlates of sentence comprehension and verbal working memory in neurodegenerative disease. Journal of Neuroscience, 27(23), 6282-6290.
1. Barker, C., & Jacobson, P. I. (2007). Direct compositionality. Oxford linguistics. Oxford University Press.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Ovid Technologies, Inc.
- Wiley

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models

Affiliations

Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models

Authors

Affiliations

Abstract

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources