Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 7:17:1221401.
doi: 10.3389/fnins.2023.1221401. eCollection 2023.

Harnessing acoustic speech parameters to decipher amyloid status in individuals with mild cognitive impairment

Affiliations

Harnessing acoustic speech parameters to decipher amyloid status in individuals with mild cognitive impairment

Fernando García-Gutiérrez et al. Front Neurosci. .

Abstract

Alzheimer's disease (AD) is a neurodegenerative condition characterized by a gradual decline in cognitive functions. Currently, there are no effective treatments for AD, underscoring the importance of identifying individuals in the preclinical stages of mild cognitive impairment (MCI) to enable early interventions. Among the neuropathological events associated with the onset of the disease is the accumulation of amyloid protein in the brain, which correlates with decreased levels of Aβ42 peptide in the cerebrospinal fluid (CSF). Consequently, the development of non-invasive, low-cost, and easy-to-administer proxies for detecting Aβ42 positivity in CSF becomes particularly valuable. A promising approach to achieve this is spontaneous speech analysis, which combined with machine learning (ML) techniques, has proven highly useful in AD. In this study, we examined the relationship between amyloid status in CSF and acoustic features derived from the description of the Cookie Theft picture in MCI patients from a memory clinic. The cohort consisted of fifty-two patients with MCI (mean age 73 years, 65% female, and 57% positive amyloid status). Eighty-eight acoustic parameters were extracted from voice recordings using the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS), and several ML models were used to classify the amyloid status. Furthermore, interpretability techniques were employed to examine the influence of input variables on the determination of amyloid-positive status. The best model, based on acoustic variables, achieved an accuracy of 75% with an area under the curve (AUC) of 0.79 in the prediction of amyloid status evaluated by bootstrapping and Leave-One-Out Cross Validation (LOOCV), outperforming conventional neuropsychological tests (AUC = 0.66). Our results showed that the automated analysis of voice recordings derived from spontaneous speech tests offers valuable insights into AD biomarkers during the preclinical stages. These findings introduce novel possibilities for the use of digital biomarkers to identify subjects at high risk of developing AD.

Keywords: Alzheimer's disease; automated pattern recognition; biomarkers; cerebrospinal fluid; early diagnosis; machine learning; mild cognitive impairment; speech acoustics.

PubMed Disclaimer

Conflict of interest statement

MB has consulted for Araclon, Avid, Grifols, Lilly, Nutricia, Roche, Eisai, and Servier, received fees from lectures and funds for research from Araclon, Biogen, Grifols, Nutricia, Roche, and Servier. She reports grants/research funding from Abbvie, Araclon, Biogen Research Limited, Bioiberica, Grifols, Lilly, S.A., Merck Sharp & Dohme, Kyowa Hakko Kirin, Laboratorios Servier, Nutricia S.R.L., OryzonGenomics, Piramal Imaging Limited, Roche Pharma S.A., and Schwabe Farma Iberica S.L.U., all outside the submitted work, and has not received personal compensation from these organizations. AR is a member of the scientific advisory board of Landsteiner Genmed and Grifols S.A. and has stocks of Landsteiner Genmed. MM has consulted for F. Hoffmann-La Roche Ltd., and has participated in the Spanish Scientific Advisory Board of Biomarkers of Araclon. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Pipeline used to evaluate the goodness-of-fit of all the models used to predict amyloid status. Performance metrics and confidence intervals were calculated from the metric distribution obtained after 5,000 iterations of a nested leave-one-out cross validation (LOOCV) where the training set used for adjusting the models was generated by bootstrapping.
Figure 2
Figure 2
Receiver operating characteristic (ROC) curve for predicting amyloid status based on demographic and neuropsychological, acoustic, and a combination of demographic and acoustic variables. The results correspond to the best models presented in Table 3. For each model, the mean AUC calculated by 5,000 bootstrap iterations as described in Figure 1 is shown. AUC: area under the curve.
Figure 3
Figure 3
Uniform manifold approximation and projection (UMAP) (Sainburg et al., 2020) dimensionality reduction of the most discriminative feature set obtained by the VLPSO feature selection algorithm. (A) Projection highlighting positive and negative amyloid status. (B) Projection of the K-nearest neighbor decision boundary. On the left is the density plot representing the higher presence (red) or absence (blue) of amyloid-positive cases in the data. On the right are shown the KNN predictions of amyloid-positivity, where red indicates that the model assigns a higher probability of amyloid-positivity and blue a lower probability. UMAP hyperparameters: number of neighbors = 8 and minimum distance = 0.1; the rest of the hyperparameters were left as default.
Figure 4
Figure 4
SHapley Additive exPlanations (SHAP) values Lundberg and Lee (2017) of the best model for predicting amyloid status. The SHAP values were calculated on the test set using LOOCV. The best model corresponds to the combination of VLPSO with KNN using acoustic variables (see Table 3). The feature color indicates how it relates to the probability of being amyloid positive or negative. The red color is associated with higher feature values, while blue is associated with lower values. Thus, lower values in the F3-bandwidth (voiced) CoV (blue area) are associated with an increased likelihood of being amyloid positive; and having a lower mean of the spectral Hammarberg index (voiced) (red area) is associated with a lower probability of being amyloid positive. CoV: coefficient of variation; AMean: arithmetic mean; Std: standard deviation.

References

    1. Albert M. S., DeKosky S. T., Dickson D., Dubois B., Feldman H. H., Fox N. C., et al. . (2011). The diagnosis of mild cognitive impairment due to alzheimer's disease: recommendations from the national institute on aging-alzheimer's association workgroups on diagnostic guidelines for alzheimer's disease. Alzheimer's Dement. 7, 270–279. 10.1016/j.jalz.2011.03.008 - DOI - PMC - PubMed
    1. Alegret M., Espinosa A., Valero S., Vinyes-Junque G., Ruiz A., Hernandez I., et al. . (2013). Cut-off scores of a brief neuropsychological battery (nbace) for spanish individual adults older than 44 years old. PLoS ONE 8, e76436. 10.1371/journal.pone.0076436 - DOI - PMC - PubMed
    1. Alegret M., Espinosa A., Vinyes-Junqué G., Valero S., Hernández I., Tárraga L., et al. . (2012). Normative data of a brief neuropsychological battery for spanish individuals older than 49. J. Clin. Exp. Neuropsychol. 34, 209–219. 10.1080/13803395.2011.630652 - DOI - PMC - PubMed
    1. Alzheimer's & Dementia (2023). 2023 Alzheimer's disease facts and figures. Alzheimer's Dement. 19, 1598–1695. 10.1002/alz.13016 - DOI - PubMed
    1. Artiola L., Hermosillo D., Heaton R., Pardee R. (1999). Manual de normas y procedimientos para la baterìa neuropsicológica en español. Tucson: mPress.

LinkOut - more resources