Epidemiology from Tweets: Estimating Misuse of Prescription Opioids in the USA from Social Media
- PMID: 28831738
- PMCID: PMC5711756
- DOI: 10.1007/s13181-017-0625-5
Epidemiology from Tweets: Estimating Misuse of Prescription Opioids in the USA from Social Media
Abstract
Background: The misuse of prescription opioids (MUPO) is a leading public health concern. Social media are playing an expanded role in public health research, but there are few methods for estimating established epidemiological metrics from social media. The purpose of this study was to demonstrate that the geographic variation of social media posts mentioning prescription opioid misuse strongly correlates with government estimates of MUPO in the last month.
Methods: We wrote software to acquire publicly available tweets from Twitter from 2012 to 2014 that contained at least one keyword related to prescription opioid use (n = 3,611,528). A medical toxicologist and emergency physician curated the list of keywords. We used the semantic distance (SemD) to automatically quantify the similarity of meaning between tweets and identify tweets that mentioned MUPO. We defined the SemD between two words as the shortest distance between the two corresponding word-centroids. Each word-centroid represented all recognized meanings of a word. We validated this automatic identification with manual curation. We used Twitter metadata to estimate the location of each tweet. We compared our estimated geographic distribution with the 2013-2015 National Surveys on Drug Usage and Health (NSDUH).
Results: Tweets that mentioned MUPO formed a distinct cluster far away from semantically unrelated tweets. The state-by-state correlation between Twitter and NSDUH was highly significant across all NSDUH survey years. The correlation was strongest between Twitter and NSDUH data from those aged 18-25 (r = 0.94, p < 0.01 for 2012; r = 0.94, p < 0.01 for 2013; r = 0.71, p = 0.02 for 2014). The correlation was driven by discussions of opioid use, even after controlling for geographic variation in Twitter usage.
Conclusions: Mentions of MUPO on Twitter correlate strongly with state-by-state NSDUH estimates of MUPO. We have also demonstrated that a natural language processing can be used to analyze social media to provide insights for syndromic toxicosurveillance.
Keywords: Computational linguistics; Epidemiology; Misuse; Natural language processing; Opioids; Social media.
Conflict of interest statement
Conflict of Interest
The authors declare that they have no conflicts of interest.
Sources of Funding
None
Figures






Similar articles
-
Machine Learning and Natural Language Processing for Geolocation-Centric Monitoring and Characterization of Opioid-Related Social Media Chatter.JAMA Netw Open. 2019 Nov 1;2(11):e1914672. doi: 10.1001/jamanetworkopen.2019.14672. JAMA Netw Open. 2019. PMID: 31693125 Free PMC article.
-
Social Media Mining for Toxicovigilance: Automatic Monitoring of Prescription Medication Abuse from Twitter.Drug Saf. 2016 Mar;39(3):231-40. doi: 10.1007/s40264-015-0379-4. Drug Saf. 2016. PMID: 26748505 Free PMC article.
-
Using Twitter to Surveil the Opioid Epidemic in North Carolina: An Exploratory Study.JMIR Public Health Surveill. 2020 Jun 24;6(2):e17574. doi: 10.2196/17574. JMIR Public Health Surveill. 2020. PMID: 32469322 Free PMC article.
-
Mining social media for prescription medication abuse monitoring: a review and proposal for a data-centric framework.J Am Med Inform Assoc. 2020 Feb 1;27(2):315-329. doi: 10.1093/jamia/ocz162. J Am Med Inform Assoc. 2020. PMID: 31584645 Free PMC article. Review.
-
How Can Geographical Information Systems and Spatial Analysis Inform a Response to Prescription Opioid Misuse? A Discussion in the Context of Existing Literature.Curr Drug Abuse Rev. 2015;8(2):104-10. doi: 10.2174/187447370802150928185302. Curr Drug Abuse Rev. 2015. PMID: 26452451 Review.
Cited by
-
Text classification models for the automatic detection of nonmedical prescription medication use from social media.BMC Med Inform Decis Mak. 2021 Jan 26;21(1):27. doi: 10.1186/s12911-021-01394-0. BMC Med Inform Decis Mak. 2021. PMID: 33499852 Free PMC article.
-
Detecting illicit opioid content on Twitter.Drug Alcohol Rev. 2020 Mar;39(3):205-208. doi: 10.1111/dar.13048. Drug Alcohol Rev. 2020. PMID: 32202005 Free PMC article.
-
Conversational topics of social media messages associated with state-level mental distress rates.J Ment Health. 2020 Apr;29(2):234-241. doi: 10.1080/09638237.2020.1739251. Epub 2020 Mar 30. J Ment Health. 2020. PMID: 32223489 Free PMC article.
-
Can accurate demographic information about people who use prescription medications nonmedically be derived from Twitter?Proc Natl Acad Sci U S A. 2023 Feb 21;120(8):e2207391120. doi: 10.1073/pnas.2207391120. Epub 2023 Feb 14. Proc Natl Acad Sci U S A. 2023. PMID: 36787355 Free PMC article.
-
Thematic Analysis of Reddit Content About Buprenorphine-naloxone Using Manual Annotation and Natural Language Processing Techniques.J Addict Med. 2022 Jul-Aug 01;16(4):454-460. doi: 10.1097/ADM.0000000000000940. Epub 2021 Dec 23. J Addict Med. 2022. PMID: 34864788 Free PMC article.
References
-
- Abuse S. Results from the 2010 National Survey on Drug Use and Health: Summary Of National Findings 2011.
-
- Manchikanti L, Singh A. Therapeutic opioids: a ten-year perspective on the complexities and complications of the escalating use, abuse, and nonmedical use of opioids. Pain physician. 2008;11(2 Suppl):S63–S88. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical