Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Dec 20;3(4):e98.
doi: 10.2196/publichealth.8641.

E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends

Affiliations

E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends

Jon-Patrick Allem et al. JMIR Public Health Surveill. .

Abstract

Background: As e-cigarette use rapidly increases in popularity, data from online social systems (Twitter, Instagram, Google Web Search) can be used to capture and describe the social and environmental context in which individuals use, perceive, and are marketed this tobacco product. Social media data may serve as a massive focus group where people organically discuss e-cigarettes unprimed by a researcher, without instrument bias, captured in near real time and at low costs.

Objective: This study documents e-cigarette-related discussions on Twitter, describing themes of conversations and locations where Twitter users often discuss e-cigarettes, to identify priority areas for e-cigarette education campaigns. Additionally, this study demonstrates the importance of distinguishing between social bots and human users when attempting to understand public health-related behaviors and attitudes.

Methods: E-cigarette-related posts on Twitter (N=6,185,153) were collected from December 24, 2016, to April 21, 2017. Techniques drawn from network science were used to determine discussions of e-cigarettes by describing which hashtags co-occur (concept clusters) in a Twitter network. Posts and metadata were used to describe where geographically e-cigarette-related discussions in the United States occurred. Machine learning models were used to distinguish between Twitter posts reflecting attitudes and behaviors of genuine human users from those of social bots. Odds ratios were computed from 2x2 contingency tables to detect if hashtags varied by source (social bot vs human user) using the Fisher exact test to determine statistical significance.

Results: Clusters found in the corpus of hashtags from human users included behaviors (eg, #vaping), vaping identity (eg, #vapelife), and vaping community (eg, #vapenation). Additional clusters included products (eg, #eliquids), dual tobacco use (eg, #hookah), and polysubstance use (eg, #marijuana). Clusters found in the corpus of hashtags from social bots included health (eg, #health), smoking cessation (eg, #quitsmoking), and new products (eg, #ismog). Social bots were significantly more likely to post hashtags that referenced smoking cessation and new products compared to human users. The volume of tweets was highest in the Mid-Atlantic (eg, Pennsylvania, New Jersey, Maryland, and New York), followed by the West Coast and Southwest (eg, California, Arizona and Nevada).

Conclusions: Social media data may be used to complement and extend the surveillance of health behaviors including tobacco product use. Public health researchers could harness these data and methods to identify new products or devices. Furthermore, findings from this study demonstrate the importance of distinguishing between Twitter posts from social bots and humans when attempting to understand attitudes and behaviors. Social bots may be used to perpetuate the idea that e-cigarettes are helpful in cessation and to promote new products as they enter the marketplace.

Keywords: Twitter; electronic cigarettes; electronic nicotine delivery system; infoveillance; social bots; social media; vaping.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1
Figure 1
Heat map from December 25, 2016, to April 21,2017, for 26,565 tweets.

Similar articles

Cited by

References

    1. Arrazola RA, Singh T, Corey CG, Husten CG, Neff LJ, Apelberg BJ, Bunnell RE, Choiniere CJ, King BA, Cox S, McAfee T, Caraballo RS. Tobacco use among middle and high school students—United States, 2011-2014. MMWR Morb Mortal Wkly Rep. 2015 Apr 17;64(14):381–385. http://www.statisticbrain.com/electronic-cigarette-statistics/ - PMC - PubMed
    1. Schoenborn CA, Gindi RM. Electronic cigarette use among adults: United States, 2014. NCHS Data Brief. 2015 Oct;(217):1–8. https://www.cdc.gov/nchs/data/databriefs/db217.pdf - PubMed
    1. Leventhal AM, Strong DR, Kirkpatrick MG, Unger JB, Sussman S, Riggs NR, Stone MD, Khoddam R, Samet JM, Audrain-McGovern J. Association of electronic cigarette use with initiation of combustible tobacco product smoking in early adolescence. JAMA. 2015 Aug 18;314(7):700–707. doi: 10.1001/jama.2015.8950. http://europepmc.org/abstract/MED/26284721 - DOI - PMC - PubMed
    1. Wills TA, Knight R, Sargent JD, Gibbons FX, Pagano I, Williams RJ. Longitudinal study of e-cigarette use and onset of cigarette smoking among high school students in Hawaii. Tob Control. 2017 Jan;26(1):34–39. doi: 10.1136/tobaccocontrol-2015-052705. - DOI - PMC - PubMed
    1. Unger JB, Soto DW, Leventhal A. E-cigarette use and subsequent cigarette and marijuana use among Hispanic young adults. Drug Alcohol Depend. 2016 Jun 01;163:261–264. doi: 10.1016/j.drugalcdep.2016.04.027. - DOI - PMC - PubMed