Using Facebook language to predict and describe excessive alcohol use
- PMID: 35575955
- PMCID: PMC9179895
- DOI: 10.1111/acer.14807
Using Facebook language to predict and describe excessive alcohol use
Abstract
Background: Assessing risk for excessive alcohol use is important for applications ranging from recruitment into research studies to targeted public health messaging. Social media language provides an ecologically embedded source of information for assessing individuals who may be at risk for harmful drinking.
Methods: Using data collected on 3664 respondents from the general population, we examine how accurately language used on social media classifies individuals as at-risk for alcohol problems based on Alcohol Use Disorder Identification Test-Consumption score benchmarks.
Results: We find that social media language is moderately accurate (area under the curve = 0.75) at identifying individuals at risk for alcohol problems (i.e., hazardous drinking/alcohol use disorders) when used with models based on contextual word embeddings. High-risk alcohol use was predicted by individuals' usage of words related to alcohol, partying, informal expressions, swearing, and anger. Low-risk alcohol use was predicted by individuals' usage of social, affiliative, and faith-based words.
Conclusions: The use of social media data to study drinking behavior in the general public is promising and could eventually support primary and secondary prevention efforts among Americans whose at-risk drinking may have otherwise gone "under the radar."
Keywords: excessive alcohol use; natural language processing; social media; subclinical drinking.
© 2022 by the Research Society on Alcoholism.
Figures
References
-
- Akbik A, Bergmann T, Vollgraf R (2019) Pooled contextualized embeddings for named entity recognition, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp. 724–728.
-
- Baan R, Straif K, Grosse Y, Secretan B, El Ghissassi F, Bouvard V, Altieri A, Cogliano V (2007) Carcinogenicity of alcoholic beverages. Lancet Oncol 8:292–293. - PubMed
-
- Blank G, Lutz C (2017) Representativeness of social media in Great Britain: Investigating Facebook, LinkedIn, Twitter, Pinterest, Google+, and Instagram. Am Behav Sci 61:741–756.
-
- Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet Allocation. J Mach Learn Res 3:993–1022.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
