Detecting Parkinson Disease Using a Web-Based Speech Task: Observational Study
- PMID: 34665148
- PMCID: PMC8564663
- DOI: 10.2196/26305
Detecting Parkinson Disease Using a Web-Based Speech Task: Observational Study
Abstract
Background: Access to neurological care for Parkinson disease (PD) is a rare privilege for millions of people worldwide, especially in resource-limited countries. In 2013, there were just 1200 neurologists in India for a population of 1.3 billion people; in Africa, the average population per neurologist exceeds 3.3 million people. In contrast, 60,000 people receive a diagnosis of PD every year in the United States alone, and similar patterns of rising PD cases-fueled mostly by environmental pollution and an aging population-can be seen worldwide. The current projection of more than 12 million patients with PD worldwide by 2040 is only part of the picture given that more than 20% of patients with PD remain undiagnosed. Timely diagnosis and frequent assessment are key to ensure timely and appropriate medical intervention, thus improving the quality of life of patients with PD.
Objective: In this paper, we propose a web-based framework that can help anyone anywhere around the world record a short speech task and analyze the recorded data to screen for PD.
Methods: We collected data from 726 unique participants (PD: 262/726, 36.1% were women; non-PD: 464/726, 63.9% were women; average age 61 years) from all over the United States and beyond. A small portion of the data (approximately 54/726, 7.4%) was collected in a laboratory setting to compare the performance of the models trained with noisy home environment data against high-quality laboratory-environment data. The participants were instructed to utter a popular pangram containing all the letters in the English alphabet, "the quick brown fox jumps over the lazy dog." We extracted both standard acoustic features (mel-frequency cepstral coefficients and jitter and shimmer variants) and deep learning-based embedding features from the speech data. Using these features, we trained several machine learning algorithms. We also applied model interpretation techniques such as Shapley additive explanations to ascertain the importance of each feature in determining the model's output.
Results: We achieved an area under the curve of 0.753 for determining the presence of self-reported PD by modeling the standard acoustic features through the XGBoost-a gradient-boosted decision tree model. Further analysis revealed that the widely used mel-frequency cepstral coefficient features and a subset of previously validated dysphonia features designed for detecting PD from a verbal phonation task (pronouncing "ahh") influence the model's decision the most.
Conclusions: Our model performed equally well on data collected in a controlled laboratory environment and in the wild across different gender and age groups. Using this tool, we can collect data from almost anyone anywhere with an audio-enabled device and help the participants screen for PD remotely, contributing to equity and access in neurological care.
Keywords: Parkinson’s disease; improving access and equity in health care; mobile phone; speech analysis.
©Wasifur Rahman, Sangwu Lee, Md Saiful Islam, Victor Nikhil Antony, Harshil Ratnu, Mohammad Rafayet Ali, Abdullah Al Mamun, Ellen Wagner, Stella Jensen-Roberts, Emma Waddell, Taylor Myers, Meghan Pawlik, Julia Soto, Madeleine Coffey, Aayush Sarkar, Ruth Schneider, Christopher Tarolli, Karlo Lizarraga, Jamie Adams, Max A Little, E Ray Dorsey, Ehsan Hoque. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 19.10.2021.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures






Similar articles
-
Deep learning in automatic detection of dysphonia: Comparing acoustic features and developing a generalizable framework.Int J Lang Commun Disord. 2023 Mar;58(2):279-294. doi: 10.1111/1460-6984.12783. Epub 2022 Sep 18. Int J Lang Commun Disord. 2023. PMID: 36117378
-
Non-invasive detection of Parkinson's disease based on speech analysis and interpretable machine learning.Front Aging Neurosci. 2025 Apr 30;17:1586273. doi: 10.3389/fnagi.2025.1586273. eCollection 2025. Front Aging Neurosci. 2025. PMID: 40370753 Free PMC article.
-
Analysis of Voice, Speech, and Language Biomarkers of Parkinson's Disease Collected in a Mixed Reality Setting.Sensors (Basel). 2025 Apr 10;25(8):2405. doi: 10.3390/s25082405. Sensors (Basel). 2025. PMID: 40285095 Free PMC article.
-
Cognitive training interventions for dementia and mild cognitive impairment in Parkinson's disease.Cochrane Database Syst Rev. 2020 Feb 26;2(2):CD011961. doi: 10.1002/14651858.CD011961.pub2. Cochrane Database Syst Rev. 2020. PMID: 32101639 Free PMC article.
-
Application of Deep Learning Models for Automated Identification of Parkinson's Disease: A Review (2011-2021).Sensors (Basel). 2021 Oct 23;21(21):7034. doi: 10.3390/s21217034. Sensors (Basel). 2021. PMID: 34770340 Free PMC article. Review.
Cited by
-
Assessment Tasks and Virtual Exergames for Remote Monitoring of Parkinson's Disease: An Integrated Approach Based on Azure Kinect.Sensors (Basel). 2022 Oct 25;22(21):8173. doi: 10.3390/s22218173. Sensors (Basel). 2022. PMID: 36365870 Free PMC article.
-
Explainable machine learning for predicting conversion to neurological disease: Results from 52,939 medical records.Digit Health. 2024 Apr 28;10:20552076241249286. doi: 10.1177/20552076241249286. eCollection 2024 Jan-Dec. Digit Health. 2024. PMID: 38686337 Free PMC article.
-
Clinical Decision Support Using Speech Signal Analysis: Systematic Scoping Review of Neurological Disorders.J Med Internet Res. 2025 Jan 13;27:e63004. doi: 10.2196/63004. J Med Internet Res. 2025. PMID: 39804693 Free PMC article.
-
Analyzing Wav2Vec 1.0 Embeddings for Cross-Database Parkinson's Disease Detection and Speech Features Extraction.Sensors (Basel). 2024 Aug 26;24(17):5520. doi: 10.3390/s24175520. Sensors (Basel). 2024. PMID: 39275431 Free PMC article.
-
Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review.J Med Internet Res. 2023 Jul 19;25:e46105. doi: 10.2196/46105. J Med Internet Res. 2023. PMID: 37467031 Free PMC article.
References
-
- Khadilkar S. Neurology in India. Ann Indian Acad Neurol. 2013 Oct;16(4):465–6. doi: 10.4103/0972-2327.120415. http://www.annalsofian.org/article.asp?issn=0972-2327;year=2013;volume=1... AIAN-16-465 - DOI - PMC - PubMed
-
- Goetz CG, Tilley BC, Shaftman SR, Stebbins GT, Fahn S, Martinez-Martin P, Poewe W, Sampaio C, Stern MB, Dodel R, Dubois B, Holloway R, Jankovic J, Kulisevsky J, Lang AE, Lees A, Leurgans S, LeWitt PA, Nyenhuis D, Olanow CW, Rascol O, Schrag A, Teresi JA, van Hilten JJ, LaPelle N, Movement Disorder Society UPDRS Revision Task Force Movement Disorder Society-sponsored revision of the Unified Parkinson's Disease Rating Scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov Disord. 2008 Nov 15;23(15):2129–70. doi: 10.1002/mds.22340. - DOI - PubMed
-
- Ali M, Hernandez J, Dorsey E, Hoque E, McDuff D. Spatio-temporal attention and magnification for classification of Parkinson’s disease from videos collected via the internet. Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020); 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020); November 16-20, 2020; Buenos Aires, Argentina. 2020. - DOI
-
- Lonini L, Dai A, Shawen N, Simuni T, Poon C, Shimanovich L, Daeschler M, Ghaffari R, Rogers JA, Jayaraman A. Wearable sensors for Parkinson's disease: which data are worth collecting for training symptom detection models. NPJ Digit Med. 2018 Nov 23;1(1):64–8. doi: 10.1038/s41746-018-0071-z. doi: 10.1038/s41746-018-0071-z.71 - DOI - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical