Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep 2;9(4):533-566.
doi: 10.1007/s41666-025-00213-9. eCollection 2025 Dec.

Through the Speech and Vocal Signals Hidden Secrets: An Explainable Methodology for Neurological Diseases Early Detection

Affiliations

Through the Speech and Vocal Signals Hidden Secrets: An Explainable Methodology for Neurological Diseases Early Detection

Patrizia Vizza et al. J Healthc Inform Res. .

Abstract

Neurodegenerative diseases progressively damage brain and nervous systems impairing their functionality. Early diagnosis can improve the efficacy of treatments and patient's life quality. Biomarkers extracted from the human voice can be a simple, efficient, and non-invasive methodology to screen neurodegenerative diseases such as Parkinson's (PD) and multiple sclerosis (MS). Nevertheless, there is still a lack of reliable and clinically approved methodologies required in large-scale patient applications. We define a methodology for features extracted from voice signals as non-invasive indices for early diagnosis of neurodegenerative diseases. We combine and analyze vowels and speech using a set of machine learning (ML) algorithms trained on a combined set of signal features such as acoustic, articulation, and cepstral ones. The methodology has been fully implemented and applied to a dataset of normophonic and pathological voice signals. Experimental results proved that methodology is able to distinguish healthy from pathological voices, with reliable performances, such as accuracy of 97.5%, sensitivity of 98.5%, precision of 97.0%, F1-score of 98.0%, the Matthews correlation coefficient of 0.95, and AUC of 0.98. Finally, the proposed methodology provides explainability tasks for neurological biomarkers identification from speech and vocal features, confirming its reliability. A github repository with data sample and code is available at https://github.com/PatriziaVizza/SpeechAndVocalSignalsAnalysis.

Keywords: Explainability; Machine learning; Neurodegenerative diseases; Neurological biomarkers; Speech; Voice analysis.

PubMed Disclaimer

Conflict of interest statement

Conflict of InterestThe authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Pipeline of the proposed methodology to report neurological risk indexes. The implemented system processes both speech and vowel signals by extracting relevant features, performs ML analysis and ML explainability, and finally returns a health risk score index. Results explain the correlation between features and (potential) neurological pathology markers
Fig. 2
Fig. 2
Workflow of the proposed methodology. The major steps are (i) acquisition and pre-processing of vowel and speech signals, (ii) feature extraction from acoustic, articulatory, and cepstral domains, (iii) application of machine learning algorithms for vocal anomaly classification, (iv) utilization of explainable machine learning techniques to interpret model predictions, and finally (v) longitudinal module to study the progression of the disease
Fig. 3
Fig. 3
Receiver operating characteristic (ROC) curves are displayed to visually compare the performance of the five classifiers, i.e., SVM (a), XGBoost (b), DT (c), RF (d), and kNN (e). The area scores are also calculated via stratified three-fold cross-validation to quantify their performance. All models exhibit high area values with SVM and XGBoost demonstrating superior discrimination, while DT, though performing well, presents a slightly lower score, suggesting a potential for overfitting or the need for further hyperparameter optimization
Fig. 4
Fig. 4
Confusion matrices for each of the five classification models are reported, showing the comparison between predicted and actual labels. RF and XGBoost demonstrate comparable performance with the most accurate class predictions, where all records of class 1 (i.e., Parkinson) have been associated to the correct class. DT and kNN have the highest false rate. SVM presents a slightly different behavior to RF and XGBoost
Fig. 5
Fig. 5
Deep-learning architecture of the proposed model. The architecture is composed of a series of layers which, starting from the input one on top (i.e. “dense”) extract information from input audio files and, in the output layer at the bottom (i.e. “dense_4”, having 3 neurons), predict the probability of their 3 class (i.e., 0 healthy, 1 parkinson, 2 multiple sclerosis)
Fig. 6
Fig. 6
Deep model performances in terms of accuracy and loss for both training and test, confusion matrix related to the test set, and overall per-class performance of the model
Fig. 7
Fig. 7
Example of force plots for local explainability in healthy (a) and disease (b) instances. In a, a final prediction of -1.32, below the base value of 1.732, is shown, resulting in a healthy classification. In b, a prediction of 4.48, significantly above the base value, indicates a disease classification. Both force plots highlight MFCCmedian, MFCCmean, and QVSA as key features, demonstrating their influential positive and negative contributions to the classification
Fig. 8
Fig. 8
SHAP summary plot for global explainability. MFCC median and MFCCmean values strongly indicate neurological pathologies, while TVSA and QVSA, though less influential, require local analysis due to their variable SHAP values, with blue (positive) SHAP values signifying pathological predictions and red (negative) values indicating healthy predictions

References

    1. Pathak N, Vimal SK, Tandon I, Agrawal L, Hongyi C, Bhattacharyya S (2021) Neurodegenerative disorders of Alzheimer, Parkinsonism, amyotrophic lateral sclerosis and multiple sclerosis: an early diagnostic approach for precision treatment. Metab Brain Dis 1–38. 10.1007/s11011-021-00800-w - PubMed
    1. Tautan A, Ionescu B, Santarnecchi E (2021) Artificial intelligence in neurodegenerative diseases: a review of available tools with a focus on machine learning techniques. Artif Intell Med 117. 10.1016/J.ARTMED.2021.102081 - PubMed
    1. Palumbo A, Calabrese B, Cocorullo G, Lanuzza M, Veltri P, Vizza P, Gambardella A, Sturniolo M (2009) A novel ICA-based hardware system for reconfigurable and portable BCI. In: 2009 IEEE International workshop on medical measurements and applications, pp 95–98. 10.1109/MEMEA.2009.5167962
    1. Chaki J, Wozniak M (2023) Deep learning for neurodegenerative disorder (2016 to 2022): a systematic review. Biomed Signal Process Control 80. 10.1016/j.bspc.2022.104223
    1. Fagherazzi G, Fischer A, Ismael M, Despotovic V (2021) Voice for health: the use of vocal biomarkers from research to clinical practice. Digit Biomark 5(1):78–88. 10.1159/000515346 - DOI - PMC - PubMed

LinkOut - more resources