. 2019 Jan 16;14(1):e0207741.

doi: 10.1371/journal.pone.0207741. eCollection 2019.

Hierarchical structure guides rapid linguistic predictions during naturalistic listening

Jonathan R Brennan¹, John T Hale^{2

3}

Affiliations

¹ Department of Linguistics, University of Michigan, Ann Arbor, MI, United States of America.
² Department of Linguistics, Cornell University, Ithaca, NY, United States of America.
³ Department of Linguistics, University of Georgia, Athens, GA, United States of America.

PMID: 30650078
PMCID: PMC6334990
DOI: 10.1371/journal.pone.0207741

Hierarchical structure guides rapid linguistic predictions during naturalistic listening

Jonathan R Brennan et al. PLoS One. 2019.

. 2019 Jan 16;14(1):e0207741.

doi: 10.1371/journal.pone.0207741. eCollection 2019.

Authors

Jonathan R Brennan¹, John T Hale^{2

3}

Affiliations

¹ Department of Linguistics, University of Michigan, Ann Arbor, MI, United States of America.
² Department of Linguistics, Cornell University, Ithaca, NY, United States of America.
³ Department of Linguistics, University of Georgia, Athens, GA, United States of America.

PMID: 30650078
PMCID: PMC6334990
DOI: 10.1371/journal.pone.0207741

Abstract

The grammar, or syntax, of human language is typically understood in terms of abstract hierarchical structures. However, theories of language processing that emphasize sequential information, not hierarchy, successfully model diverse phenomena. Recent work probing brain signals has shown mixed evidence for hierarchical information in some tasks. We ask whether sequential or hierarchical information guides the expectations that a human listener forms about a word's part-of-speech when simply listening to every-day language. We compare the predictions of three computational models against electroencephalography signals recorded from human participants who listen passively to an audiobook story. We find that predictions based on hierarchical structure correlate with the human brain response above-and-beyond predictions based only on sequential information. This establishes a link between hierarchical linguistic structure and neural signals that generalizes across the range of syntactic structures found in every-day language.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Surprisal distributions from each of three models along with mean surprisal ±1 standard error of the mean (black bars).**
Surprisal from a model where each POS tag is uniformly probably is indicated with the dashed line.

**Fig 2. Language models and data analysis.**
Word-by-word surprisal values (top) estimated from one hierarchy-based and two sequential language models are time-aligned to the audiobook stimulus (middle). Epochs aligned with the onset of each word are extracted from filtered EEG data and amplitudes from each time-point and electrode serve as the dependent measure of a regression model (right) that includes surprisal values and low-level covariates as predictors.

**Fig 3. Whole-head regression results.**
β coefficients (M ± CI₉₅) and model-reconstructed ERPs. (A) Regression time-series for log-transformed word frequency fit against content-word measurements. Dark grey shading indicates significant time-points and the inset shows significant channels and coefficient averages across significant time-points. (B) Estimated content-word ERP for three word frequency values ranging from low (200 corpus counts) to high (2,000,000 corpus counts) from a represenative central channel (inset) shows a classic N400 effect. (C) Regression time-series for hierarchical *CFG* surprisal fit against content-word data (inset and shading as in (A)). (D) Estimated content-word ERP for three *CFG* surprisal values from a representative channel (inset). (E) Regression time-series for sequential *Ngram* surprisal fit against function-word data (inset and shading as in (A)). (F) Estimated function-word ERP for three *Ngram* surprisal values from a representative channel (inset).

**Fig 4. Model comparison results.**
WAIC difference scores (± standard error) indicate changes in model fit across six ROIs (columns). Each set of rows tests a different statistical question using step-wise model comparison. Terms that are being evaluated are indicated to the left of “>”; interactions with word-class are indicated with “:WC”. For each row-set, the baseline model includes all control covariates along with the indicated surprisal term(s) and interactions between word-class and those surprisal terms. The WAIC values are scaled so that positive numbers represent improvements for the larger model, while negative numbers indicate that the added complexity of the larger model is not matched by a better fit. Bold-face indicates WAIC improvements that are more than two standard errors from zero.

See this image and copyright information in PMC

Cited by

Phonological acquisition depends on the timing of speech sounds: Deconvolution EEG modeling across the first five years.
Menn KH, Männel C, Meyer L. Menn KH, et al. Sci Adv. 2023 Nov 3;9(44):eadh2560. doi: 10.1126/sciadv.adh2560. Epub 2023 Nov 1. Sci Adv. 2023. PMID: 37910625 Free PMC article.
A hierarchy of linguistic predictions during natural language comprehension.
Heilbron M, Armeni K, Schoffelen JM, Hagoort P, de Lange FP. Heilbron M, et al. Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3. Proc Natl Acad Sci U S A. 2022. PMID: 35921434 Free PMC article.
What's Surprising About Surprisal.
Slaats S, Martin AE. Slaats S, et al. Comput Brain Behav. 2025;8(2):233-248. doi: 10.1007/s42113-025-00237-9. Epub 2025 Feb 21. Comput Brain Behav. 2025. PMID: 40453151 Free PMC article.
Let them eat ceke: An electrophysiological study of form-based prediction in rich naturalistic contexts.
Yacovone A, Waite B, Levari T, Snedeker J. Yacovone A, et al. J Exp Psychol Gen. 2025 Mar;154(3):711-738. doi: 10.1037/xge0001677. Epub 2024 Dec 16. J Exp Psychol Gen. 2025. PMID: 39680005
Understanding words in context: A naturalistic EEG study of children's lexical processing.
Levari T, Snedeker J. Levari T, et al. J Mem Lang. 2024 Aug;137:104512. doi: 10.1016/j.jml.2024.104512. Epub 2024 Mar 8. J Mem Lang. 2024. PMID: 38855737 Free PMC article.

See all "Cited by" articles

References

1. Hauser MD, Chomsky N, Fitch WT. The Faculty of Language: What is it, Who has it, and How did it Evolve? Science. 2002;298:1569–1579. 10.1126/science.298.5598.1569 - DOI - PubMed
1. Frank SL, Bod R, Christiansen MH. How hierarchical is language use? Proceedings of the Royal Society B: Biological Sciences. 2012;. 10.1098/rspb.2012.1741 - DOI - PMC - PubMed
1. Ding N, Melloni L, Tian X, Poeppel D. Rule-based and word-level statistics-based processing of language: insights from neuroscience. Language, Cognition and Neuroscience. 2017;32(5):570–575. 10.1080/23273798.2016.1215477 - DOI - PMC - PubMed
1. Frank SL, Christiansen MH. Hierarchical and sequential processing of language. Language, Cognition and Neuroscience. 2018; p. 1–6.
1. Ding N, Melloni L, Zhang H, Tian X, Poeppel D. Cortical tracking of hierarchical linguistic structures in connected speech. Nat Neurosci. 2016;19(1):158–64. 10.1038/nn.4186 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Hierarchical structure guides rapid linguistic predictions during naturalistic listening

Affiliations

Hierarchical structure guides rapid linguistic predictions during naturalistic listening

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources