Speech disturbances in schizophrenia: Assessing cross-linguistic generalizability of NLP automated measures of coherence
- PMID: 35927097
- DOI: 10.1016/j.schres.2022.07.002
Speech disturbances in schizophrenia: Assessing cross-linguistic generalizability of NLP automated measures of coherence
Abstract
Introduction: Language disorders - disorganized and incoherent speech in particular - are distinctive features of schizophrenia. Natural language processing (NLP) offers automated measures of incoherent speech as promising markers for schizophrenia. However, the scientific and clinical impact of NLP markers depends on their generalizability across contexts, samples, and languages, which we systematically assessed in the present study relying on a large, novel, cross-linguistic corpus.
Methods: We collected a Danish (DK), German (GE), and Chinese (CH) cross-linguistic dataset involving transcripts from 187 participants with schizophrenia (111DK, 25GE, 51CH) and 200 matched controls (129DK, 29GE, 42CH) performing the Animated Triangles Task. Fourteen previously published NLP coherence measures were calculated, and between-groups differences and association with symptoms were tested for cross-linguistic generalizability.
Results: One coherence measure, i.e. second-order coherence, robustly generalized across samples and languages. We found several language-specific effects, some of which partially replicated previous findings (lower coherence in German and Chinese patients), while others did not (higher coherence in Danish patients). We found several associations between symptoms and measures of coherence, but the effects were generally inconsistent across languages and rating scales.
Conclusions: Using a cumulative approach, we have shown that NLP findings of reduced semantic coherence in schizophrenia have limited generalizability across different languages, samples, and measures. We argue that several factors such as sociodemographic and clinical heterogeneity, cross-linguistic variation, and the different NLP measures reflecting different clinical aspects may be responsible for this variability. Future studies should take this variability into account in order to develop effective clinical applications targeting different patient populations.
Keywords: Biomarker; Communication disorders; Digital phenotyping; Natural language processing; Schizophrenia spectrum disorder; Semantic coherence; Thought disorder.
Copyright © 2022 The Authors. Published by Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of competing interest Riccardo Fusaroli has been a paid consultant on related but not overlapping topics for Roche. The other authors have no real or potential conflicts of interest that could have influenced the research.
Similar articles
-
Natural language processing for defining linguistic features in schizophrenia: A sample from Turkish speakers.Schizophr Res. 2024 Apr;266:183-189. doi: 10.1016/j.schres.2024.02.026. Epub 2024 Feb 27. Schizophr Res. 2024. PMID: 38417398
-
Voice Patterns as Markers of Schizophrenia: Building a Cumulative Generalizable Approach Via a Cross-Linguistic and Meta-analysis Based Investigation.Schizophr Bull. 2023 Mar 22;49(Suppl_2):S125-S141. doi: 10.1093/schbul/sbac128. Schizophr Bull. 2023. PMID: 36946527 Free PMC article.
-
Construct validity for computational linguistic metrics in individuals at clinical risk for psychosis: Associations with clinical ratings.Schizophr Res. 2022 Jul;245:90-96. doi: 10.1016/j.schres.2022.01.019. Epub 2022 Jan 29. Schizophr Res. 2022. PMID: 35094918 Free PMC article.
-
Language as a biomarker for psychosis: A natural language processing approach.Schizophr Res. 2020 Dec;226:158-166. doi: 10.1016/j.schres.2020.04.032. Epub 2020 Jun 1. Schizophr Res. 2020. PMID: 32499162 Free PMC article. Review.
-
Using Language Processing and Speech Analysis for the Identification of Psychosis and Other Disorders.Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Aug;5(8):770-779. doi: 10.1016/j.bpsc.2020.06.004. Epub 2020 Jun 14. Biol Psychiatry Cogn Neurosci Neuroimaging. 2020. PMID: 32771179 Free PMC article. Review.
Cited by
-
Reduced speech coherence in psychosis-related social media forum posts.Schizophrenia (Heidelb). 2024 Jul 4;10(1):60. doi: 10.1038/s41537-024-00481-1. Schizophrenia (Heidelb). 2024. PMID: 38965247 Free PMC article.
-
Widespread cortical thinning, excessive glutamate and impaired linguistic functioning in schizophrenia: A cluster analytic approach.Front Hum Neurosci. 2022 Aug 5;16:954898. doi: 10.3389/fnhum.2022.954898. eCollection 2022. Front Hum Neurosci. 2022. PMID: 35992940 Free PMC article.
-
Latent Factors of Language Disturbance and Relationships to Quantitative Speech Features.Schizophr Bull. 2023 Mar 22;49(Suppl_2):S93-S103. doi: 10.1093/schbul/sbac145. Schizophr Bull. 2023. PMID: 36946530 Free PMC article.
-
Understanding mental health through computers: An introduction to computational psychiatry.Front Psychiatry. 2023 Feb 7;14:1092471. doi: 10.3389/fpsyt.2023.1092471. eCollection 2023. Front Psychiatry. 2023. PMID: 36824671 Free PMC article. Review.
-
Speech characteristics yield important clues about motor function: Speech variability in individuals at clinical high-risk for psychosis.Schizophrenia (Heidelb). 2023 Sep 16;9(1):60. doi: 10.1038/s41537-023-00382-9. Schizophrenia (Heidelb). 2023. PMID: 37717025 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical