Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 21;120(8):e2207391120.
doi: 10.1073/pnas.2207391120. Epub 2023 Feb 14.

Can accurate demographic information about people who use prescription medications nonmedically be derived from Twitter?

Affiliations

Can accurate demographic information about people who use prescription medications nonmedically be derived from Twitter?

Yuan-Chi Yang et al. Proc Natl Acad Sci U S A. .

Abstract

Traditional substance use (SU) surveillance methods, such as surveys, incur substantial lags. Due to the continuously evolving trends in SU, insights obtained via such methods are often outdated. Social media-based sources have been proposed for obtaining timely insights, but methods leveraging such data cannot typically provide fine-grained statistics about subpopulations, unlike traditional approaches. We address this gap by developing methods for automatically characterizing a large Twitter nonmedical prescription medication use (NPMU) cohort (n = 288,562) in terms of age-group, race, and gender. Our natural language processing and machine learning methods for automated cohort characterization achieved 0.88 precision (95% CI:0.84 to 0.92) for age-group, 0.90 (95% CI: 0.85 to 0.95) for race, and 94% accuracy (95% CI: 92 to 97) for gender, when evaluated against manually annotated gold-standard data. We compared automatically derived statistics for NPMU of tranquilizers, stimulants, and opioids from Twitter with statistics reported in the National Survey on Drug Use and Health (NSDUH) and the National Emergency Department Sample (NEDS). Distributions automatically estimated from Twitter were mostly consistent with the NSDUH [Spearman r: race: 0.98 (P < 0.005); age-group: 0.67 (P < 0.005); gender: 0.66 (P = 0.27)] and NEDS, with 34/65 (52.3%) of the Twitter-based estimates lying within 95% CIs of estimates from the traditional sources. Explainable differences (e.g., overrepresentation of younger people) were found for age-group-related statistics. Our study demonstrates that accurate subpopulation-specific estimates about SU, particularly NPMU, may be automatically derived from Twitter to obtain earlier insights about targeted subpopulations compared to traditional surveillance approaches.

Keywords: Twitter; machine learning; natural language processing; substance use; toxicovigilance.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Gender, age, and race proportions estimated from Twitter and those reported in US census.
Fig. 2.
Fig. 2.
Gender distributions for NPMU estimated from Twitter and those reported in the NSDUH. For opioid pain relievers, the gender distribution of overdose-related emergency medicine visits is also provided. 95% CIs are provided for each bar.
Fig. 3.
Fig. 3.
Race distributions for NPMU estimated from Twitter and those reported in the NSDUH. 95% CIs are provided for each bar.
Fig. 4.
Fig. 4.
Age-group distributions for NPMU estimated from Twitter and those reported in the NSDUH. 95% CIs are provided for each bar.

References

    1. Jalal H., et al. , Changing dynamics of the drug overdose epidemic in the United States from 1979 through 2016. Science 361, eaau1184 (2018). - PMC - PubMed
    1. Mattson C. L., et al. , Trends and geographic patterns in drug and synthetic opioid overdose deaths—United States, 2013–2019. Morb. Mortal. Wkly. Rep. 70, 202 (2021). - PMC - PubMed
    1. Ahmad F. B., Cisewski J. A., Rossen L. M., Sutton P., “Provisional drug overdose death counts” (National Center for Health Statistics, Centers for Disease Control and Prevention, Hyattsville, MD, 2022).
    1. Office of National Drug Control Policy, The White House, “Biden-Harris Administration Calls for Historic Levels of Funding to Prevent and Treat Addiction and Overdose” (The White House, 2021).
    1. Cook B. L., Alegría M., Racial-ethnic disparities in substance abuse treatment: the role of criminal history and socioeconomic status. Psychiatr. Serv. 62, 1273–1281 (2011). - PMC - PubMed

Publication types

Substances