Dynamic suicide topic modelling: Deriving population-specific, psychosocial and time-sensitive suicide risk variables from Electronic Health Record psychotherapy notes
- PMID: 36797651
- PMCID: PMC11172400
- DOI: 10.1002/cpp.2842
Dynamic suicide topic modelling: Deriving population-specific, psychosocial and time-sensitive suicide risk variables from Electronic Health Record psychotherapy notes
Abstract
In the machine learning subfield of natural language processing, a topic model is a type of unsupervised method that is used to uncover abstract topics within a corpus of text. Dynamic topic modelling (DTM) is used for capturing change in these topics over time. The study deploys DTM on corpus of electronic health record psychotherapy notes. This retrospective study examines whether DTM helps distinguish closely matched patients that did and did not die by suicide. Cohort consists of United States Department of Veterans Affairs (VA) patients diagnosed with Posttraumatic Stress Disorder (PTSD) between 2004 and 2013. Each case (those who died by suicide during the year following diagnosis) was matched with five controls (those who remained alive) that shared psychotherapists and had similar suicide risk based on VA's suicide prediction algorithm. Cohort was restricted to patients who received psychotherapy for 9+ months after initial PTSD diagnoses (cases = 77; controls = 362). For cases, psychotherapy notes from diagnosis until death were examined. For controls, psychotherapy notes from diagnosis until matched case's death date were examined. A Python-based DTM algorithm was utilized. Derived topics identified population-specific themes, including PTSD, psychotherapy, medication, communication and relationships. Control topics changed significantly more over time than case topics. Topic differences highlighted engagement, expressivity and therapeutic alliance. This study strengthens groundwork for deriving population-specific, psychosocial and time-sensitive suicide risk variables.
Keywords: dynamic topic models; electronic medical records; natural language processing; suicide prediction.
Published 2023. This article is a U.S. Government work and is in the public domain in the USA.
Conflict of interest statement
CONFLICT OF INTEREST STATEMENT
The authors have no conflict of interest.
Figures



Similar articles
-
Using natural language processing to evaluate temporal patterns in suicide risk variation among high-risk Veterans.Psychiatry Res. 2024 Sep;339:116097. doi: 10.1016/j.psychres.2024.116097. Epub 2024 Jul 27. Psychiatry Res. 2024. PMID: 39083961 Free PMC article.
-
Natural language processing of clinical mental health notes may add predictive value to existing suicide risk models.Psychol Med. 2021 Jun;51(8):1382-1391. doi: 10.1017/S0033291720000173. Epub 2020 Feb 17. Psychol Med. 2021. PMID: 32063248 Free PMC article.
-
Using Natural Language Processing to develop risk-tier specific suicide prediction models for Veterans Affairs patients.J Psychiatr Res. 2024 Nov;179:322-329. doi: 10.1016/j.jpsychires.2024.09.031. Epub 2024 Sep 24. J Psychiatr Res. 2024. PMID: 39353293 Free PMC article.
-
Review: managing posttraumatic stress disorder in combat veterans with comorbid traumatic brain injury.J Rehabil Res Dev. 2012;49(5):789-812. doi: 10.1682/jrrd.2011.10.0185. J Rehabil Res Dev. 2012. PMID: 23015586 Review.
-
Psychodynamic Treatment of Combat Veterans with PTSD at Risk for Suicide.Psychodyn Psychiatry. 2017 Summer;45(2):217-235. doi: 10.1521/pdps.2017.45.2.217. Psychodyn Psychiatry. 2017. PMID: 28590209 Review.
Cited by
-
Investigating the Differential Impact of Psychosocial Factors by Patient Characteristics and Demographics on Veteran Suicide Risk Through Machine Learning Extraction of Cross-Modal Interactions.Pac Symp Biocomput. 2025;30:167-184. doi: 10.1142/9789819807024_0013. Pac Symp Biocomput. 2025. PMID: 39670369 Free PMC article.
-
Using natural language processing to evaluate temporal patterns in suicide risk variation among high-risk Veterans.Psychiatry Res. 2024 Sep;339:116097. doi: 10.1016/j.psychres.2024.116097. Epub 2024 Jul 27. Psychiatry Res. 2024. PMID: 39083961 Free PMC article.
-
Automatically extracting social determinants of health for suicide: a narrative literature review.Npj Ment Health Res. 2024 Nov 6;3(1):51. doi: 10.1038/s44184-024-00087-6. Npj Ment Health Res. 2024. PMID: 39506139 Free PMC article. Review.
-
Characterizing Veteran suicide decedents that were not classified as high-suicide-risk.Psychol Med. 2024 Aug;54(11):3135-3144. doi: 10.1017/S0033291724001296. Epub 2024 Sep 16. Psychol Med. 2024. PMID: 39282853 Free PMC article.
-
Psychotherapist remarks' ML classifier: insights from LLM and topic modeling application.Front Psychiatry. 2025 Jul 25;16:1608163. doi: 10.3389/fpsyt.2025.1608163. eCollection 2025. Front Psychiatry. 2025. PMID: 40787679 Free PMC article.
References
-
- Alloghani M, Al-Jumeily D, Mustafina J, Hussain A, & Aljaaf AJ (2020). A systematic review on supervised and unsupervised machine learning algorithms for data science. In Berry MW, Mohamed A, & Yap BW (Eds.), Supervised and unsupervised learning for data science (pp. 3–21). Springer International Publishing. 10.1007/978-3-030-22475-2_1 - DOI
-
- AlSumait L, Barbará D, Gentle J, & Domeniconi C. (2009). Topic significance ranking of LDA generative models. In Buntine W, Grobelnik M, Mladenic D, & Shawe-Taylor J. (Eds.), Machine learning and knowledge discovery in databases (Vol. 5781) (pp. 67–82). Springer. 10.1007/978-3-642-04180-8_22 - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical