Natural language processing in at-risk mental states: enhancing the assessment of thought disorders and psychotic traits with semantic dynamics and graph theory
- PMID: 39074334
- PMCID: PMC11773321
- DOI: 10.47626/1516-4446-2023-3419
Natural language processing in at-risk mental states: enhancing the assessment of thought disorders and psychotic traits with semantic dynamics and graph theory
Abstract
Objective: Verbal communication contains key information for mental health assessment. Researchers have linked psychopathology phenomena to certain counterparts in natural language processing. We characterized subtle impairments in the early stages of psychosis, developing new analysis techniques, which led to a comprehensive map associating features of natural language processing with the full range of clinical presentation.
Methods: We used natural language processing to assess spontaneous and elicited speech by 60 individuals with at-risk mental states and 73 controls who were screened from 4,500 quota-sampled Portuguese speaking residents of São Paulo, Brazil. Psychotic symptoms were independently assessed with the Structured Interview for Psychosis-Risk Syndromes. Speech features (e.g., sentiments and semantic coherence), including novel ones, were correlated with psychotic traits (Spearman's-?) and at-risk mental state status (general linear models and machine-learning ensembles).
Results: Natural language processing features were informative for classification, presenting a balanced accuracy of 86%. Features such as semantic laminarity (as perseveration), semantic recurrence time (as circumstantiality), and average centrality in word repetition graphs carried the most information and were directly correlated with psychotic symptoms. Grammatical tagging (e.g., use of adjectives) was the most relevant standard measure.
Conclusion: Subtle speech impairments can be detected by sensitive methods and can be used in at-risk mental states screening. We have outlined a blueprint for speech-based evaluation, pairing features to standard psychometric items for thought disorder.
Keywords: Psychosis; at-risk mental states; machine learning; natural language processing; screening; semantics.
Conflict of interest statement
FA has provided consulting services and developed technology for private companies. NBM works at Mobile Brain, an Education and Health Tech startup, and has been a consultant to Boehringer Ingelheim. JMG works at Mobile Brain and has provided consulting for developing machine learning models for private companies. The other authors report no conflicts of interest.
Figures



References
-
- Chomsky N. Logical syntax and semantics: their linguistic relevance. Language. 1955;31:36.
-
- Croft W, Cruse DA. Cognitive linguistics. Cambridge: Cambridge University Press; 2004.
-
- Pylkkänen L. The neural basis of combinatory syntax and semantics. Sci. 2019;366:62–6. - PubMed
-
- DeLisi LE. Speech disorder in schizophrenia: review of the literature and exploration of its relation to the uniquely human capacity for language. Schizophr Bull. 2001;27:481–96. - PubMed
MeSH terms
Grants and funding
- 88887.625326/2021-00/CAPES/Coordenação de Aperfeiçoamento de Pessoal de Nível Superior/Brazil
- 223139/Z/21/Z/WT_/Wellcome Trust/United Kingdom
- 2014/50873-3/FAPESP/Fundação de Amparo à Pesquisa do Estado de São Paulo/Brazil
- 565412/2014-9/CNPq/Conselho Nacional de Desenvolvimento Científico e Tecnológico/Brazil
- WT_/Wellcome Trust/United Kingdom
LinkOut - more resources
Full Text Sources
Medical