A Systematic Review of Natural Language Processing Techniques for Early Detection of Cognitive Impairment
- PMID: 40568612
- PMCID: PMC12190899
- DOI: 10.1016/j.mcpdig.2025.100205
A Systematic Review of Natural Language Processing Techniques for Early Detection of Cognitive Impairment
Abstract
Objective: To systematically evaluate the effectiveness and methodologic approaches of natural language processing (NLP) techniques for early detection of cognitive decline through speech and language analysis.
Methods: We conducted a comprehensive search of 8 databases from inception through August 31, 2024, following Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Studies were included if they used NLP techniques to analyze speech or language data for detecting cognitive impairment and reported diagnostic accuracy metrics. Two independent reviewers (R.S. and A.B.) screened articles and extracted data on study characteristics, NLP methods, and outcomes.
Results: Of 23,562 records identified, 51 studies met inclusion criteria, involving 17,340 participants (mean age, 72.4 years). Combined linguistic and acoustic approaches achieved the highest diagnostic accuracy (average 87%; area under the curve [AUC], 0.89) compared with linguistic-only (83%; AUC, 0.85) or acoustic-only approaches (80%; AUC, 0.82). Lexical diversity, syntactic complexity, and semantic coherence were consistently strong predictors across cognitive conditions. Picture description tasks were most common (n=21), followed by spontaneous speech (n=15) and story recall (n=8). Crosslinguistic applicability was found across 8 languages, although language-specific adaptations were necessary. Longitudinal studies (n=9) reported potential for early detection but were limited by smaller sample sizes (average n=159) compared with cross-sectional studies (n=42; average n=274).
Conclusion: Natural language processing techniques show promising diagnostic accuracy for detecting cognitive impairment across multiple languages and clinical contexts. Although combined linguistic-acoustic approaches appear most effective, methodologic heterogeneity and small sample sizes in existing studies suggest the need for larger, standardized investigations to establish clinical utility.
© 2025 The Authors.
Conflict of interest statement
The authors report no competing interests.
Figures
References
-
- Mirheidari B., Bell S.M., Harkness K., Blackburn D., Christensen H. Spoken language-based automatic cognitive assessment of stroke survivors. Lang Health. 2024;2(1):32–38. doi: 10.1016/j.laheal.2024.01.001. - DOI
Publication types
LinkOut - more resources
Full Text Sources
Miscellaneous
