Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach

doi:10.2196/34705

. 2022 Aug 17;24(8):e34705.

doi: 10.2196/34705.

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach

Hannah Metzler^{1

2

3

4

5}, Hubert Baginski^{3

6}, Thomas Niederkrotenthaler², David Garcia^{1

3

4}

Affiliations

¹ Section for the Science of Complex Systems, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Vienna, Austria.
² Unit Suicide Research and Mental Health Promotion, Center for Public Health, Medical University of Vienna, Vienna, Austria.
³ Complexity Science Hub Vienna, Vienna, Austria.
⁴ Computational Social Science Lab, Institute of Interactive Systems and Data Science, Graz University of Technology, Graz, Austria.
⁵ Institute for Globally Distributed Open Research and Education, Vienna, Austria.
⁶ Institute of Information Systems Engineering, Vienna University of Technology, Vienna, Austria.

PMID: 35976193
PMCID: PMC9434391
DOI: 10.2196/34705

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach

Hannah Metzler et al. J Med Internet Res. 2022.

. 2022 Aug 17;24(8):e34705.

doi: 10.2196/34705.

Authors

Hannah Metzler^{1

2

3

4

5}, Hubert Baginski^{3

6}, Thomas Niederkrotenthaler², David Garcia^{1

3

4}

Affiliations

¹ Section for the Science of Complex Systems, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Vienna, Austria.
² Unit Suicide Research and Mental Health Promotion, Center for Public Health, Medical University of Vienna, Vienna, Austria.
³ Complexity Science Hub Vienna, Vienna, Austria.
⁴ Computational Social Science Lab, Institute of Interactive Systems and Data Science, Graz University of Technology, Graz, Austria.
⁵ Institute for Globally Distributed Open Research and Education, Vienna, Austria.
⁶ Institute of Information Systems Engineering, Vienna University of Technology, Vienna, Austria.

PMID: 35976193
PMCID: PMC9434391
DOI: 10.2196/34705

Abstract

Background: Research has repeatedly shown that exposure to suicide-related news media content is associated with suicide rates, with some content characteristics likely having harmful and others potentially protective effects. Although good evidence exists for a few selected characteristics, systematic and large-scale investigations are lacking. Moreover, the growing importance of social media, particularly among young adults, calls for studies on the effects of the content posted on these platforms.

Objective: This study applies natural language processing and machine learning methods to classify large quantities of social media data according to characteristics identified as potentially harmful or beneficial in media effects research on suicide and prevention.

Methods: We manually labeled 3202 English tweets using a novel annotation scheme that classifies suicide-related tweets into 12 categories. Based on these categories, we trained a benchmark of machine learning models for a multiclass and a binary classification task. As models, we included a majority classifier, an approach based on word frequency (term frequency-inverse document frequency with a linear support vector machine) and 2 state-of-the-art deep learning models (Bidirectional Encoder Representations from Transformers [BERT] and XLNet). The first task classified posts into 6 main content categories, which are particularly relevant for suicide prevention based on previous evidence. These included personal stories of either suicidal ideation and attempts or coping and recovery, calls for action intending to spread either problem awareness or prevention-related information, reporting of suicide cases, and other tweets irrelevant to these 5 categories. The second classification task was binary and separated posts in the 11 categories referring to actual suicide from posts in the off-topic category, which use suicide-related terms in another meaning or context.

Results: In both tasks, the performance of the 2 deep learning models was very similar and better than that of the majority or the word frequency classifier. BERT and XLNet reached accuracy scores above 73% on average across the 6 main categories in the test set and F₁-scores between 0.69 and 0.85 for all but the suicidal ideation and attempts category (F₁=0.55). In the binary classification task, they correctly labeled around 88% of the tweets as about suicide versus off-topic, with BERT achieving F₁-scores of 0.93 and 0.74, respectively. These classification performances were similar to human performance in most cases and were comparable with state-of-the-art models on similar tasks.

Conclusions: The achieved performance scores highlight machine learning as a useful tool for media effects research on suicide. The clear advantage of BERT and XLNet suggests that there is crucial information about meaning in the context of words beyond mere word frequencies in tweets about suicide. By making data labeling more efficient, this work has enabled large-scale investigations on harmful and protective associations of social media content with suicide rates and help-seeking behavior.

Keywords: Twitter; deep learning; machine learning; social media; suicide prevention.

©Hannah Metzler, Hubert Baginski, Thomas Niederkrotenthaler, David Garcia. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 17.08.2022.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
Creating the labeled data set and annotation scheme. Each box describes how tweets were selected from the large pool of available tweets, how many tweets were added to the training data set in each step (after removing duplicates), and how many coders labeled each tweet. When we used preliminary model predictions to identify potential candidates for each category, we deleted the model labels before manual coding. After rounds with 2 coders, we checked interrater reliability, adapted the annotation scheme until all disagreements were clarified, and relabeled the respective sample.

**Figure 2**
Overview of characteristics of data sets. Each box describes the purpose of the data set, further details on how it was used or created, and the sample size. Only the predictions data set includes retweets, as it aims to capture the full volume of tweets posted on a given day. BERT: Bidirectional Encoder Representations from Transformers.

**Figure 3**
Performance scores per category for Bidirectional Encoder Representations from Transformers (BERT) for the 6 main categories (A) and for tweets about actual suicide versus off-topic tweets (B).

**Figure 4**
Confusion matrix of true and predicted labels in the reliability data set. (A) percentages and (B) count of tweets per true and predicted category. The diagonal from bottom left to top right represents correct predictions. True labels are labels by coder 1, and predicted labels are by Bidirectional Encoder Representations from Transformers (BERT).

**Figure 5**
Daily percent of tweets per predicted category in the predictions data set (n=7.15 million). The daily value subsumes original and retweets per category. Key words for event peaks are explained in the main text.

See this image and copyright information in PMC

Cited by

A machine learning approach to detect potentially harmful and protective suicide-related content in broadcast media.
Metzler H, Baginski H, Garcia D, Niederkrotenthaler T. Metzler H, et al. PLoS One. 2024 May 14;19(5):e0300917. doi: 10.1371/journal.pone.0300917. eCollection 2024. PLoS One. 2024. PMID: 38743759 Free PMC article.
Association of 7 million+ tweets featuring suicide-related content with daily calls to the Suicide Prevention Lifeline and with suicides, United States, 2016-2018.
Niederkrotenthaler T, Tran US, Baginski H, Sinyor M, Strauss MJ, Sumner SA, Voracek M, Till B, Murphy S, Gonzalez F, Gould M, Garcia D, Draper J, Metzler H. Niederkrotenthaler T, et al. Aust N Z J Psychiatry. 2023 Jul;57(7):994-1003. doi: 10.1177/00048674221126649. Epub 2022 Oct 14. Aust N Z J Psychiatry. 2023. PMID: 36239594 Free PMC article.
The Applications of Large Language Models in Mental Health: Scoping Review.
Jin Y, Liu J, Li P, Wang B, Yan Y, Zhang H, Ni C, Wang J, Li Y, Bu Y, Wang Y. Jin Y, et al. J Med Internet Res. 2025 May 5;27:e69284. doi: 10.2196/69284. J Med Internet Res. 2025. PMID: 40324177 Free PMC article.
Large Language Models for Mental Health Applications: Systematic Review.
Guo Z, Lai A, Thygesen JH, Farrington J, Keen T, Li K. Guo Z, et al. JMIR Ment Health. 2024 Oct 18;11:e57400. doi: 10.2196/57400. JMIR Ment Health. 2024. PMID: 39423368 Free PMC article.
Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area.
Grouin C, Grabar N; Section Editors for the IMIA Yearbook Section on Natural Language Processing. Grouin C, et al. Yearb Med Inform. 2023 Aug;32(1):244-252. doi: 10.1055/s-0043-1768752. Epub 2023 Dec 26. Yearb Med Inform. 2023. PMID: 38147866 Free PMC article.

See all "Cited by" articles

References

1. Ritchie H, Roser M, Ortiz-Ospina E. Suicide. Our World in Data. 2015. [2022-05-05]. https://ourworldindata.org/suicide .
1. Niederkrotenthaler T, Braun M, Pirkis J, Till B, Stack S, Sinyor M, Tran US, Voracek M, Cheng Q, Arendt F, Scherr S, Yip PS, Spittal MJ. Association between suicide reporting in the media and suicide: systematic review and meta-analysis. BMJ. 2020 Mar 18;368:m575. doi: 10.1136/bmj.m575. http://www.bmj.com/lookup/pmidlookup?view=long&pmid=32188637 - DOI - PMC - PubMed
1. Phillips DP. The influence of suggestion on suicide: substantive and theoretical implications of the Werther effect. Am Sociol Rev. 1974;39(3):340–54. - PubMed
1. Niederkrotenthaler T, Voracek M, Herberth A, Till B, Strauss M, Etzersdorfer E, Eisenwort B, Sonneck G. Role of media reports in completed and prevented suicide: Werther v. Papageno effects. Br J Psychiatry. 2010 Sep;197(3):234–43. doi: 10.1192/bjp.bp.109.074633.S0007125000015804 - DOI - PubMed
1. Niederkrotenthaler T, Till B. Effects of suicide awareness materials on individuals with recent suicidal ideation or attempt: online randomised controlled trial. Br J Psychiatry. 2020 Dec;217(6):693–700. doi: 10.1192/bjp.2019.259.S0007125019002599 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

[1] Ritchie H, Roser M, Ortiz-Ospina E. Suicide. Our World in Data. 2015. [2022-05-05]. https://ourworldindata.org/suicide .

[2] Ritchie H, Roser M, Ortiz-Ospina E. Suicide. Our World in Data. 2015. [2022-05-05]. https://ourworldindata.org/suicide .

[3] Niederkrotenthaler T, Braun M, Pirkis J, Till B, Stack S, Sinyor M, Tran US, Voracek M, Cheng Q, Arendt F, Scherr S, Yip PS, Spittal MJ. Association between suicide reporting in the media and suicide: systematic review and meta-analysis. BMJ. 2020 Mar 18;368:m575. doi: 10.1136/bmj.m575. http://www.bmj.com/lookup/pmidlookup?view=long&pmid=32188637 - DOI - PMC - PubMed

[4] Niederkrotenthaler T, Braun M, Pirkis J, Till B, Stack S, Sinyor M, Tran US, Voracek M, Cheng Q, Arendt F, Scherr S, Yip PS, Spittal MJ. Association between suicide reporting in the media and suicide: systematic review and meta-analysis. BMJ. 2020 Mar 18;368:m575. doi: 10.1136/bmj.m575. http://www.bmj.com/lookup/pmidlookup?view=long&pmid=32188637 - DOI - PMC - PubMed

[5] Phillips DP. The influence of suggestion on suicide: substantive and theoretical implications of the Werther effect. Am Sociol Rev. 1974;39(3):340–54. - PubMed

[6] Phillips DP. The influence of suggestion on suicide: substantive and theoretical implications of the Werther effect. Am Sociol Rev. 1974;39(3):340–54. - PubMed

[7] Niederkrotenthaler T, Voracek M, Herberth A, Till B, Strauss M, Etzersdorfer E, Eisenwort B, Sonneck G. Role of media reports in completed and prevented suicide: Werther v. Papageno effects. Br J Psychiatry. 2010 Sep;197(3):234–43. doi: 10.1192/bjp.bp.109.074633.S0007125000015804 - DOI - PubMed

[8] Niederkrotenthaler T, Voracek M, Herberth A, Till B, Strauss M, Etzersdorfer E, Eisenwort B, Sonneck G. Role of media reports in completed and prevented suicide: Werther v. Papageno effects. Br J Psychiatry. 2010 Sep;197(3):234–43. doi: 10.1192/bjp.bp.109.074633.S0007125000015804 - DOI - PubMed

[9] Niederkrotenthaler T, Till B. Effects of suicide awareness materials on individuals with recent suicidal ideation or attempt: online randomised controlled trial. Br J Psychiatry. 2020 Dec;217(6):693–700. doi: 10.1192/bjp.2019.259.S0007125019002599 - DOI - PubMed

[10] Niederkrotenthaler T, Till B. Effects of suicide awareness materials on individuals with recent suicidal ideation or attempt: online randomised controlled trial. Br J Psychiatry. 2020 Dec;217(6):693–700. doi: 10.1192/bjp.2019.259.S0007125019002599 - DOI - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach

Affiliations

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Research Materials