Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 7;8(1):1.
doi: 10.1038/s41537-021-00197-6.

Identifying schizophrenia stigma on Twitter: a proof of principle model using service user supervised machine learning

Affiliations

Identifying schizophrenia stigma on Twitter: a proof of principle model using service user supervised machine learning

Sagar Jilka et al. Schizophrenia (Heidelb). .

Abstract

Stigma has negative effects on people with mental health problems by making them less likely to seek help. We develop a proof of principle service user supervised machine learning pipeline to identify stigmatising tweets reliably and understand the prevalence of public schizophrenia stigma on Twitter. A service user group advised on the machine learning model evaluation metric (fewest false negatives) and features for machine learning. We collected 13,313 public tweets on schizophrenia between January and May 2018. Two service user researchers manually identified stigma in 746 English tweets; 80% were used to train eight models, and 20% for testing. The two models with fewest false negatives were compared in two service user validation exercises, and the best model used to classify all extracted public English tweets. Tweets classed as stigmatising by service users were more negative in sentiment (t (744) = 12.02, p < 0.001 [95% CI: 0.196-0.273]). Our linear Support Vector Machine was the best performing model with fewest false negatives and higher service user validation. This model identified public stigma in 47% of English tweets (n5,676) which were more negative in sentiment (t (12,143) = 64.38, p < 0.001 [95% CI: 0.29-0.31]). Machine learning can identify stigmatising tweets at large scale, with service user involvement. Given the prevalence of stigma, there is an urgent need for education and online campaigns to reduce it. Machine learning can provide a real time metric on their success.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. An overview of our methodology and data collection.
An outline of the number of tweets used for each section of the methodology and how they were used.

Similar articles

Cited by

References

    1. Berry N, et al. #WhyWeTweetMH: Understanding why people use Twitter to discuss mental health problems. J. Med. Internet Res. 2007;19:e107. doi: 10.2196/jmir.6173. - DOI - PMC - PubMed
    1. Gulliver A, et al. Internet-based interventions to promote mental health help-seeking in elite athletes: An exploratory randomized controlled trial. J. Med. Internet Res. 2012;14:e69. doi: 10.2196/jmir.1864. - DOI - PMC - PubMed
    1. Yap M, Reavley N, Jorm A. Where would young people seek help for mental disorders and what stops them? Findings from an Australian national survey. J. Affect Disord. 2013;147:255–261. doi: 10.1016/j.jad.2012.11.014. - DOI - PubMed
    1. Crawford, M. et al. Survey of review spam detection using machine learning techniques. J. Big Data10.1186/s40537-015-0029-9 (2015).
    1. Peng Z, Hu Q, Dang J. Multi-kernel SVM based depression recognition using social media data. IJMLC. 2017;10:43–57.