Quantifying depression-related language on social media during the COVID-19 pandemic
- PMID: 35516163
- PMCID: PMC9052361
- DOI: 10.23889/ijpds.v5i4.1716
Quantifying depression-related language on social media during the COVID-19 pandemic
Abstract
Introduction: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level.
Objectives: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies.
Methods: We create a word embedding based on the posts in Reddit's /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic's beginning through June 2021.
Results: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases.
Conclusions: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health.
Keywords: COVID-19; Twitter; depression; information retrieval; machine learning; public health surveillance; social media.
Conflict of interest statement
Statement on conflicts of interest: The authors have no conflicts of interest to declare.
Figures



Similar articles
-
Emotional Distress During COVID-19 by Mental Health Conditions and Economic Vulnerability: Retrospective Analysis of Survey-Linked Twitter Data With a Semisupervised Machine Learning Algorithm.J Med Internet Res. 2023 Mar 16;25:e44965. doi: 10.2196/44965. J Med Internet Res. 2023. PMID: 36809798 Free PMC article.
-
State and Federal Legislators' Responses on Social Media to the Mental Health and Burnout of Health Care Workers Throughout the COVID-19 Pandemic: Natural Language Processing and Sentiment Analysis.JMIR Infodemiology. 2023 Feb 24;3:e38676. doi: 10.2196/38676. eCollection 2023. JMIR Infodemiology. 2023. PMID: 37013000 Free PMC article.
-
Dynamics of the Negative Discourse Toward COVID-19 Vaccines: Topic Modeling Study and an Annotated Data Set of Twitter Posts.J Med Internet Res. 2023 Apr 12;25:e41319. doi: 10.2196/41319. J Med Internet Res. 2023. PMID: 36877804 Free PMC article.
-
Machine Learning to Detect Self-Reporting of Symptoms, Testing Access, and Recovery Associated With COVID-19 on Twitter: Retrospective Big Data Infoveillance Study.JMIR Public Health Surveill. 2020 Jun 8;6(2):e19509. doi: 10.2196/19509. JMIR Public Health Surveill. 2020. PMID: 32490846 Free PMC article.
-
Depression and Anxiety on Twitter During the COVID-19 Stay-At-Home Period in 7 Major U.S. Cities.AJPM Focus. 2023 Mar;2(1):100062. doi: 10.1016/j.focus.2022.100062. Epub 2022 Dec 22. AJPM Focus. 2023. PMID: 36573174 Free PMC article.
Cited by
-
Breastmilk-Old but Not Obsolete: from the Safety of Breastfeeding During the Coronavirus Disease 2019 Pandemic to Broad Antiviral Drug Development.Matern Fetal Med. 2023 Mar 24;5(2):69-70. doi: 10.1097/FM9.0000000000000188. eCollection 2023 Apr. Matern Fetal Med. 2023. PMID: 40406396 Free PMC article. No abstract available.
-
When Infodemic Meets Epidemic: Systematic Literature Review.JMIR Public Health Surveill. 2025 Feb 3;11:e55642. doi: 10.2196/55642. JMIR Public Health Surveill. 2025. PMID: 39899850 Free PMC article.
-
Using natural language processing to identify patterns associated with depression, anxiety, and stress symptoms during the COVID-19 pandemic.J Affect Disord. 2025 May 1;376:113-121. doi: 10.1016/j.jad.2025.01.139. Epub 2025 Jan 31. J Affect Disord. 2025. PMID: 39892758
References
-
- Bueno-Notivol J., Gracia-García P., Olaya B., Lasheras I., R. López-Antón and Santabárbara J., “Prevalence of depression during the COVID-19 outbreak: A meta-analysis of community-based studies,” International journal of clinical and health psychology, vol. 21, p. 100196, 2021. 10.1016/j.ijchp.2020.07.007 - DOI - PMC - PubMed
-
- Ebrahimi O. V., Hoffart A. and Johnson S. U., “Physical distancing and mental health during the COVID-19 pandemic: Factors associated with psychological symptoms and adherence to pandemic mitigation strategies,” Clinical Psychological Science, vol. 9, p. 489–506, 2021. 10.1177/2167702621994545 - DOI
Publication types
LinkOut - more resources
Full Text Sources
Miscellaneous