. 2020 Jul 16:14:748.

doi: 10.3389/fnins.2020.00748. eCollection 2020.

Decoding Hearing-Related Changes in Older Adults' Spatiotemporal Neural Processing of Speech Using Machine Learning

Md Sultan Mahmud¹, Faruk Ahmed¹, Rakib Al-Fahad¹, Kazi Ashraf Moinuddin¹, Mohammed Yeasin¹, Claude Alain^{2

3

4}, Gavin M Bidelman^{5

6

7}

Affiliations

¹ Department of Electrical and Computer Engineering, The University of Memphis, Memphis, TN, United States.
² Rotman Research Institute-Baycrest Centre for Geriatric Care, Toronto, ON, Canada.
³ Department of Psychology, University of Toronto, Toronto, ON, Canada.
⁴ Institute of Medical Sciences, University of Toronto, Toronto, ON, Canada.
⁵ Institute for Intelligent Systems, University of Memphis, Memphis, TN, United States.
⁶ School of Communication Sciences and Disorders, University of Memphis, Memphis, TN, United States.
⁷ Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, United States.

PMID: 32765215
PMCID: PMC7378401
DOI: 10.3389/fnins.2020.00748

Decoding Hearing-Related Changes in Older Adults' Spatiotemporal Neural Processing of Speech Using Machine Learning

Md Sultan Mahmud et al. Front Neurosci. 2020.

. 2020 Jul 16:14:748.

doi: 10.3389/fnins.2020.00748. eCollection 2020.

Authors

Md Sultan Mahmud¹, Faruk Ahmed¹, Rakib Al-Fahad¹, Kazi Ashraf Moinuddin¹, Mohammed Yeasin¹, Claude Alain^{2

3

4}, Gavin M Bidelman^{5

6

7}

Affiliations

¹ Department of Electrical and Computer Engineering, The University of Memphis, Memphis, TN, United States.
² Rotman Research Institute-Baycrest Centre for Geriatric Care, Toronto, ON, Canada.
³ Department of Psychology, University of Toronto, Toronto, ON, Canada.
⁴ Institute of Medical Sciences, University of Toronto, Toronto, ON, Canada.
⁵ Institute for Intelligent Systems, University of Memphis, Memphis, TN, United States.
⁶ School of Communication Sciences and Disorders, University of Memphis, Memphis, TN, United States.
⁷ Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, United States.

PMID: 32765215
PMCID: PMC7378401
DOI: 10.3389/fnins.2020.00748

Abstract

Speech perception in noisy environments depends on complex interactions between sensory and cognitive systems. In older adults, such interactions may be affected, especially in those individuals who have more severe age-related hearing loss. Using a data-driven approach, we assessed the temporal (when in time) and spatial (where in the brain) characteristics of cortical speech-evoked responses that distinguish older adults with or without mild hearing loss. We performed source analyses to estimate cortical surface signals from the EEG recordings during a phoneme discrimination task conducted under clear and noise-degraded conditions. We computed source-level ERPs (i.e., mean activation within each ROI) from each of the 68 ROIs of the Desikan-Killiany (DK) atlas, averaged over a randomly chosen 100 trials without replacement to form feature vectors. We adopted a multivariate feature selection method called stability selection and control to choose features that are consistent over a range of model parameters. We use parameter optimized support vector machine (SVM) as a classifiers to investigate the time course and brain regions that segregate groups and speech clarity. For clear speech perception, whole-brain data revealed a classification accuracy of 81.50% [area under the curve (AUC) 80.73%; F1-score 82.00%], distinguishing groups within ∼60 ms after speech onset (i.e., as early as the P1 wave). We observed lower accuracy of 78.12% [AUC 77.64%; F1-score 78.00%] and delayed classification performance when speech was embedded in noise, with group segregation at 80 ms. Separate analysis using left (LH) and right hemisphere (RH) regions showed that LH speech activity was better at distinguishing hearing groups than activity measured in the RH. Moreover, stability selection analysis identified 12 brain regions (among 1428 total spatiotemporal features from 68 regions) where source activity segregated groups with >80% accuracy (clear speech); whereas 16 regions were critical for noise-degraded speech to achieve a comparable level of group segregation (78.7% accuracy). Our results identify critical time-courses and brain regions that distinguish mild hearing loss from normal hearing in older adults and confirm a larger number of active areas, particularly in RH, when processing noise-degraded speech information.

Keywords: aging; event-related potentials; hearing loss; machine learning; speech perception; stability selection and control; support vector machine.

PubMed Disclaimer

Figures

**FIGURE 1**
Behavioral audiograms (hearing thresholds) per group. NH, normal-hearing listeners; HI, mild hearing loss listeners; PTA, puretone average threshold.

**FIGURE 2**
Source-level ERPs for the NH and HI groups in representative ROIs. Solid lines = HI; dotted lines = NH. **(A)** Clear speech responses. **(B)** Noise-degraded speech responses. Baseline was corrected to the prestimulus interval. NH, normal hearing; HI, hearing impaired; L, Left; R, Right; lPT, parstriangularis L; lPRC, precentral L; rPRC, precentral R; rTRANS, transverse temporal R.

**FIGURE 3**
Time-varying group classification (NH vs. HI) as a function of neural data (clear and noise conditions) and hemisphere. Group classification accuracy from **(A)** Whole-brain data (all 68 ROIs), **(B)** LH data alone (34 ROIs), and **(C)** RH data alone (34 ROIs). LH, left hemisphere; RH, right hemisphere. 0 ms = stimulus onset. Green solid line indicates group segregation during clear speech perception, red dotted line indicates group segregation during noise-degraded speech perception.

**FIGURE 4**
Maximum classifier accuracy (y axis) and corresponding latency (x axis) for distinguishing NH and HI listeners using source amplitudes from the whole-brain (blue triangle), and LH (orange square) vs. RH (green circle) separately. **(A)** Clear speech responses. **(B)** Noise-degraded speech responses.

**FIGURE 5**
Effect of stability score threshold on model performance. The bottom of x-axis has four labels; *Stability score*, represents the stability score range of each bin (scores: 0∼1); *Number of features*, number of features under each bin; *% features*, corresponding percentage of selected features; *ROIs*, number of cumulative unique brain regions up to lower boundary of the bin. **(A)** Clear speech. **(B)** Noise-degraded speech.

**FIGURE 6**
Stable (most consistent) neural network distinguishing NH and HI listeners during *clear* speech processing. Visualization of brain ROIs corresponding to 0.50 stability threshold (12 top selected ROIs which segregate groups at 81.8%) for clear speech perception. **(A)** LH; **(B)** RH; **(C)** Posterior view; **(D)** Anterior view. Stability score (color legend): (0.70 ≤ pink ≤ 1.0); (0.60 ≤ blue < 0.70); (0.50 ≤ white < 0.60). L, Left; R, Right; rTP, temporal pole R; rFUS, fusiform; lSP, sperior parietal L; rPRC, precentral R; lPRC, precentral L; rCMF, caudal middle frontal R; lPREC, precuneus L; lMT, middle temporal L; rIST, isthmuscingulate R; lBKS, bankssts L; rBKS, bankssts R; lST, superior temporal L.

**FIGURE 7**
Stable (most consistent) neural network that distinguishes NH and HI listeners during *noise-degraded* speech processing. 16 top selected ROIs, 78.7% group classification. **(A)** LH; **(B)** RH; **(C)** Posterior view; **(D)** Anterior view. lRMF, rostral middle frontal L; rIT, inferior temporal R; lIP, inferior parietal L; rPARAC, paracentral R; lPHIP, para hippocampal L; rST, superior temporal R; lPERI, pericalcarine L; rIP, inferior parietal R. Otherwise as in Figure 6.

See this image and copyright information in PMC

Cited by

High-Frequency Transcranial Random Noise Stimulation Modulates Gamma-Band EEG Source-Based Large-Scale Functional Network Connectivity in Patients with Schizophrenia: A Randomized, Double-Blind, Sham-Controlled Clinical Trial.
Yeh TC, Huang CC, Chung YA, Im JJ, Lin YY, Ma CC, Tzeng NS, Chang HA. Yeh TC, et al. J Pers Med. 2022 Sep 30;12(10):1617. doi: 10.3390/jpm12101617. J Pers Med. 2022. PMID: 36294755 Free PMC article.
Data-driven machine learning models for decoding speech categorization from evoked brain responses.
Mahmud MS, Yeasin M, Bidelman GM. Mahmud MS, et al. J Neural Eng. 2021 Mar 23;18(4):10.1088/1741-2552/abecf0. doi: 10.1088/1741-2552/abecf0. J Neural Eng. 2021. PMID: 33690177 Free PMC article.
Online Left-Hemispheric In-Phase Frontoparietal Theta tACS Modulates Theta-Band EEG Source-Based Large-Scale Functional Network Connectivity in Patients with Schizophrenia: A Randomized, Double-Blind, Sham-Controlled Clinical Trial.
Yeh TC, Huang CC, Chung YA, Park SY, Im JJ, Lin YY, Ma CC, Tzeng NS, Chang HA. Yeh TC, et al. Biomedicines. 2023 Feb 20;11(2):630. doi: 10.3390/biomedicines11020630. Biomedicines. 2023. PMID: 36831167 Free PMC article.
Machine Learning-Based Prediction of the Outcomes of Cochlear Implantation in Patients With Cochlear Nerve Deficiency and Normal Cochlea: A 2-Year Follow-Up of 70 Children.
Lu S, Xie J, Wei X, Kong Y, Chen B, Chen J, Zhang L, Yang M, Xue S, Shi Y, Liu S, Xu T, Dong R, Chen X, Li Y, Wang H. Lu S, et al. Front Neurosci. 2022 Jun 23;16:895560. doi: 10.3389/fnins.2022.895560. eCollection 2022. Front Neurosci. 2022. PMID: 35812216 Free PMC article.
Subcortical rather than cortical sources of the frequency-following response (FFR) relate to speech-in-noise perception in normal-hearing listeners.
Bidelman GM, Momtaz S. Bidelman GM, et al. Neurosci Lett. 2021 Feb 16;746:135664. doi: 10.1016/j.neulet.2021.135664. Epub 2021 Jan 23. Neurosci Lett. 2021. PMID: 33497718 Free PMC article.

See all "Cited by" articles

References

1. Agung K., Purdy S. C., McMahon C. M., Newall P. (2006). The use of cortical auditory evoked potentials to evaluate neural encoding of speech sounds in adults. J. Am. Acad. Audiol. 17 559–572. 10.3766/jaaa.17.8.3 - DOI - PubMed
1. Alain C. (2014). Effects of age-related hearing loss and background noise on neuromagnetic activity from auditory cortex. Front. Syst. Neurosci. 8:8. 10.3389/fnsys.2014.00008 - DOI - PMC - PubMed
1. Alain C., Du Y., Bernstein L. J., Barten T., Banai K. (2018). Listening under difficult conditions: an activation likelihood estimation meta-analysis. Hum. Brain Mapp. 39 2695–2709. 10.1002/hbm.24031 - DOI - PMC - PubMed
1. Alain C., He Y., Grady C. (2008). The contribution of the inferior parietal lobe to auditory spatial working memory. J. Cogn. Neurosci. 20 285–295. 10.1162/jocn.2008.20014 - DOI - PubMed
1. Alain C., McDonald K. L., Kovacevic N., McIntosh A. R. (2009). Spatiotemporal analysis of auditory “what” and “where” working memory. Cereb. Cortex 19 305–314. 10.1093/cercor/bhn082 - DOI - PubMed

Grants and funding

R01 DC016267/DC/NIDCD NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Decoding Hearing-Related Changes in Older Adults' Spatiotemporal Neural Processing of Speech Using Machine Learning

Affiliations

Decoding Hearing-Related Changes in Older Adults' Spatiotemporal Neural Processing of Speech Using Machine Learning

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources