Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022;78(4):4709-4744.
doi: 10.1007/s11227-021-04040-8. Epub 2021 Sep 9.

Automatic detection of depression symptoms in twitter using multimodal analysis

Affiliations

Automatic detection of depression symptoms in twitter using multimodal analysis

Ramin Safa et al. J Supercomput. 2022.

Abstract

Depression is the most prevalent mental disorder that can lead to suicide. Due to the tendency of people to share their thoughts on social platforms, social data contain valuable information that can be used to identify user's psychological states. In this paper, we provide an automated approach to collect and evaluate tweets based on self-reported statements and present a novel multimodal framework to predict depression symptoms from user profiles. We used n-gram language models, LIWC dictionaries, automatic image tagging, and bag-of-visual-words. We consider the correlation-based feature selection and nine different classifiers with standard evaluation metrics to assess the effectiveness of the method. Based on the analysis, the tweets and bio-text alone showed 91% and 83% accuracy in predicting depressive symptoms, respectively, which seems to be an acceptable result. We also believe performance improvements can be achieved by limiting the user domain or presence of clinical information.

Keywords: Depression detection; Mental health; Multimodal framework; Social media; Text mining.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The growing number of US youths with a major depressive episode from 2004 to 2019, by gender [6]
Fig. 2
Fig. 2
The high-level architecture of the proposed framework consists of three main modules: 1) data collection and dataset building, 2) cross-analysis, and 3) classification
Fig. 3
Fig. 3
The confirmed range of the polarity score, after initial filtering
Fig. 4
Fig. 4
The structure of the term-document in the case study (T = Term, D = Document, W = Weight)
Fig. 5
Fig. 5
The proportion of distinct LIWC categories
Fig. 5
Fig. 5
The proportion of distinct LIWC categories
Fig. 6
Fig. 6
Unigrams and bigrams word cloud
Fig. 7
Fig. 7
The Pearson correlation heatmap among the LIWC dictionaries for both Dtweets+ and Dtweets-
Fig. 8
Fig. 8
Comparison of the highest accuracy achieved on feature types by nine different classifiers
Fig. 9
Fig. 9
Comparison of the highest accuracy achieved by tweets analyzing methods
Fig. 10
Fig. 10
Comparison of the highest accuracy achieved by bios analyzing methods
Fig. 11
Fig. 11
ROC curves of the target classifiers
Fig. 12
Fig. 12
Comparison of the highest accuracy achieved by tweets analyzing methods via SVD approach
Fig. 13
Fig. 13
ROC curves of the target classifiers for SVD analysis

References

    1. Gao J, et al. Mental health problems and social media exposure during COVID-19 outbreak. PLoS ONE. 2020;15(4):e0231924. doi: 10.1371/journal.pone.0231924. - DOI - PMC - PubMed
    1. Martínez-Castaño R, Pichel JC, Losada DE. A big data platform for real time analysis of signs of depression in social media. Int J Environ Res Public Health. 2020;17(13):4752. doi: 10.3390/ijerph17134752. - DOI - PMC - PubMed
    1. Ríssola EA, Bahrainian SA, Crestani F. (2020) A Dataset for Research on Depression in Social Media. In Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization, pp. 338–342.
    1. James SL, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the global burden of disease study 2017. The Lancet. 2018;392(10159):1789–1858. doi: 10.1016/S0140-6736(18)32279-7. - DOI - PMC - PubMed
    1. Bloom DE et al., (2012) The global economic burden of noncommunicable diseases. Program on the Global Demography of Aging

LinkOut - more resources