Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 1;14(1):7789.
doi: 10.1038/s41467-023-43490-x.

Emergence of the cortical encoding of phonetic features in the first year of life

Affiliations

Emergence of the cortical encoding of phonetic features in the first year of life

Giovanni M Di Liberto et al. Nat Commun. .

Abstract

Even prior to producing their first words, infants are developing a sophisticated speech processing system, with robust word recognition present by 4-6 months of age. These emergent linguistic skills, observed with behavioural investigations, are likely to rely on increasingly sophisticated neural underpinnings. The infant brain is known to robustly track the speech envelope, however previous cortical tracking studies were unable to demonstrate the presence of phonetic feature encoding. Here we utilise temporal response functions computed from electrophysiological responses to nursery rhymes to investigate the cortical encoding of phonetic features in a longitudinal cohort of infants when aged 4, 7 and 11 months, as well as adults. The analyses reveal an increasingly detailed and acoustically invariant phonetic encoding emerging over the first year of life, providing neurophysiological evidence that the pre-verbal human cortex learns phonetic categories. By contrast, we found no credible evidence for age-related increases in cortical tracking of the acoustic spectrogram.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Increasing low-frequency EEG tracking of phonetic features but not sound spectrogram in the first year of life.
A Schematic diagram of the analysis paradigm. Multivariate Temporal Response Function (TRF) models were fit to describe the forward relationship between stimulus features and the low-frequency EEG signal recorded from infants (4, 7, and 11 mo) and adults. TRF models were fit for acoustic spectrogram (S; green) and phonetic feature categories (F; orange) separately. EEG prediction correlations (Pearson’s correlation) were calculated for the TRFS and TRFF models with cross-validation. B EEG prediction correlations of the TRFS and TRFF models (average across all channels) for the lowΔ-band (0.1–1 Hz), Δ-band (1–4 Hz), and Θ-band (4–8 Hz; violin plots with mean value across participants). Stars indicate significant effects of age in infants (one-way repeated measures ANOVA; *p ≤ 0.05; **p ≤ 0.01). Statistically significant effects were measured for S in lowΔ-band (p = 0.002) and Δ-band (p = 0.002), but not Θ-band (p = 0.580). Statistically significant effects were measured for F in Δ-band (p = 0.013)and Θ-band (p = 0.004), but not lowΔ-band (p = 0.427). The violin plots show the result distributions and the mean value. C Topographical patterns of the EEG prediction correlations in infants and adults for the F and S models. D Individual-participant trajectories of the EEG prediction correlations for the longitudinal infant cohort. Colours indicate increasing vs. decreasing patterns with age (red and blue respectively). The figure was built using MATLAB software.
Fig. 2
Fig. 2. Low-frequency (Δ-band) cortical encoding of categorical phonetic features in the first year of life.
A Schematic diagram of the analysis paradigm. Forward TRF models were fit to describe the relationship between speech features (including nuisance regressors) and low-frequency EEG signals. Speech features included the acoustic spectrogram (S), half-way rectified envelope derivative (D), visual motion (V), and phonetic features (F). B (Left) Hypotheses: The cortical encoding of phonetic feature categories was expected to progressively increase across the first year of life. Hp1-3: phonetic encoding emerging from 11, 7, and 4 months of age respectively. (Right) Phonetic feature encoding measured as the EEG prediction correlation gain when including phonetic features in the TRF (violin plots with mean value across participants). Only the frequency bands showing significant effects of age for TRFF were studied. Stars indicate significant effects of age in infants (one-way repeated measures ANOVA; *p ≤ 0.05; **p ≤ 0.01, ***p ≤ 0.001). Statistically significant effects were measured in the Δ-band (p = 0.003) and in the Θ-bands (p = 0.009). C TRF weights corresponding to phonetic features for the TRF in Δ- and Θ-bands (1–8 Hz). The figure was built using MATLAB software.

References

    1. Kuhl PK. Early language acquisition: cracking the speech code. Nat. Rev. Neurosci. 2004;5:831–843. doi: 10.1038/nrn1533. - DOI - PubMed
    1. Kuhl P, Rivera-Gaxiola M. Neural substrates of language acquisition. Annu. Rev. Neurosci. 2008;31:511–534. doi: 10.1146/annurev.neuro.30.051606.094321. - DOI - PubMed
    1. Kuhl PK. Brain mechanisms in early language acquisition. Neuron. 2010;67:713–727. doi: 10.1016/j.neuron.2010.08.038. - DOI - PMC - PubMed
    1. Hoehl S, Wahl S. Recording infant ERP data for cognitive research. Dev. Neuropsychol. 2012;37:187–209. doi: 10.1080/87565641.2011.627958. - DOI - PubMed
    1. Wu YJ, et al. Rapid learning of a phonemic discrimination in the first hours of life. Nat. Hum. Behav. 2022;6:1169–1179. doi: 10.1038/s41562-022-01355-1. - DOI - PMC - PubMed

Publication types