Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Mar 22:7:45141.
doi: 10.1038/srep45141.

Characterisation of mental health conditions in social media using Informed Deep Learning

Affiliations

Characterisation of mental health conditions in social media using Informed Deep Learning

George Gkotsis et al. Sci Rep. .

Erratum in

Abstract

The number of people affected by mental illness is on the increase and with it the burden on health and social care use, as well as the loss of both productivity and quality-adjusted life-years. Natural language processing of electronic health records is increasingly used to study mental health conditions and risk behaviours on a large scale. However, narrative notes written by clinicians do not capture first-hand the patients' own experiences, and only record cross-sectional, professional impressions at the point of care. Social media platforms have become a source of 'in the moment' daily exchange, with topics including well-being and mental health. In this study, we analysed posts from the social media platform Reddit and developed classifiers to recognise and classify posts related to mental illness according to 11 disorder themes. Using a neural network and deep learning approach, we could automatically recognise mental illness-related posts in our balenced dataset with an accuracy of 91.08% and select the correct theme with a weighted average accuracy of 71.37%. We believe that these results are a first step in developing methods to characterise large amounts of user-generated content that could support content curation and targeted interventions.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Overall workflow of our approach.
Figure 2
Figure 2. Multiclass classification confusion matrix using a Convolutional Neural Network (CNN) classifier.
Figure 3
Figure 3
Architecture for the Feed Forward (a) and Convolutional Neural Network (b) deep learning approaches.

References

    1. Whiteford H. A. et al.. Global burden of disease attributable to mental and substance use disorders: findings from the global burden of disease study 2010. The Lancet 382, 1575–1586 (2013). - PubMed
    1. Perera G. et al.. Cohort profile of the south london and maudsley nhs foundation trust biomedical research centre (slam brc) case register: current status and recent enhancement of an electronic mental health record-derived data resource. BMJ open 6, e008721 (2016). - PMC - PubMed
    1. Barak-Corren Y. et al.. Predicting suicidal behavior from longitudinal electronic health records. American journal of psychiatryappi-ajp (2016). - PubMed
    1. Shapiro J. S. et al.. Document ontology: supporting narrative documents in electronic health records. In AMIA(2005). - PMC - PubMed
    1. Coppersmith G., Dredze M., Harman C. & Hollingshead K. From adhd to sad: Analyzing the language of mental health on twitter through self-reported diagnoses. In Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality 1–10 (2015).

Publication types