Reduced speech coherence in psychosis-related social media forum posts
- PMID: 38965247
- PMCID: PMC11224262
- DOI: 10.1038/s41537-024-00481-1
Reduced speech coherence in psychosis-related social media forum posts
Abstract
The extraction of linguistic markers from social media posts, which are indicative of the onset and course of mental disorders, offers great potential for mental healthcare. In the present study, we extracted over one million posts from the popular social media platform Reddit to analyze speech coherence, which reflects formal thought disorder and is a characteristic feature of schizophrenia and associated psychotic disorders. Natural language processing (NLP) models were used to perform an automated quantification of speech coherence. We could demonstrate that users who are active on forums geared towards disorders with a higher degree of psychotic symptoms tend to show a lower level of coherence. The lowest coherence scores were found in users of forums on dissociative identity disorder, schizophrenia, and bipolar disorder. In contrast, a relatively high level of coherence was detected in users of forums related to obsessive-compulsive disorder, anxiety, and depression. Users of forums on posttraumatic stress disorder, autism, and attention-deficit hyperactivity disorder exhibited medium-level coherence. Our findings provide promising first evidence for the possible utility of NLP-based coherence analyses for the early detection and prevention of psychosis on the basis of posts gathered from publicly available social media data. This opens new avenues for large-scale prevention programs aimed at high-risk populations.
© 2024. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Emotional Expression on Social Media Support Forums for Substance Cessation: Observational Study of Text-Based Reddit Posts.J Med Internet Res. 2023 Jul 19;25:e45267. doi: 10.2196/45267. J Med Internet Res. 2023. PMID: 37467010 Free PMC article.
-
Natural Language Processing Reveals Vulnerable Mental Health Support Groups and Heightened Health Anxiety on Reddit During COVID-19: Observational Study.J Med Internet Res. 2020 Oct 12;22(10):e22635. doi: 10.2196/22635. J Med Internet Res. 2020. PMID: 32936777 Free PMC article.
-
Understanding Mental Health Issues in Different Subdomains of Social Networking Services: Computational Analysis of Text-Based Reddit Posts.J Med Internet Res. 2023 Nov 30;25:e49074. doi: 10.2196/49074. J Med Internet Res. 2023. PMID: 38032730 Free PMC article.
-
Language as a biomarker for psychosis: A natural language processing approach.Schizophr Res. 2020 Dec;226:158-166. doi: 10.1016/j.schres.2020.04.032. Epub 2020 Jun 1. Schizophr Res. 2020. PMID: 32499162 Free PMC article. Review.
-
Speech markers to predict and prevent recurrent episodes of psychosis: A narrative overview and emerging opportunities.Schizophr Res. 2024 Apr;266:205-215. doi: 10.1016/j.schres.2024.02.036. Epub 2024 Feb 29. Schizophr Res. 2024. PMID: 38428118 Review.
Cited by
-
Artificial Intelligence in Obsessive-Compulsive Disorder: A Systematic Review.Curr Treat Options Psychiatry. 2025;12(1):23. doi: 10.1007/s40501-025-00359-8. Epub 2025 Jun 14. Curr Treat Options Psychiatry. 2025. PMID: 40524733 Free PMC article. Review.
-
Natural language processing reveals differences in mental time travel at higher levels of self-efficacy.Sci Rep. 2024 Oct 25;14(1):25342. doi: 10.1038/s41598-024-76959-w. Sci Rep. 2024. PMID: 39455740 Free PMC article.
References
-
- Lecrubier Y. Widespread underrecognition and undertreatment of anxiety and mood disorders: results from 3 European studies. J. Clin. Psychiatry. 2007;68:36–41. - PubMed
-
- Reddit. Reddit by the numbers. (2024).
Grants and funding
LinkOut - more resources
Full Text Sources