Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jul 5;17(Suppl 2):69.
doi: 10.1186/s12911-017-0469-6.

Leveraging machine learning-based approaches to assess human papillomavirus vaccination sentiment trends with Twitter data

Affiliations

Leveraging machine learning-based approaches to assess human papillomavirus vaccination sentiment trends with Twitter data

Jingcheng Du et al. BMC Med Inform Decis Mak. .

Abstract

Background: As one of the serious public health issues, vaccination refusal has been attracting more and more attention, especially for newly approved human papillomavirus (HPV) vaccines. Understanding public opinion towards HPV vaccines, especially concerns on social media, is of significant importance for HPV vaccination promotion.

Methods: In this study, we leveraged a hierarchical machine learning based sentiment analysis system to extract public opinions towards HPV vaccines from Twitter. English tweets containing HPV vaccines-related keywords were collected from November 2, 2015 to March 28, 2016. Manual annotation was done to evaluate the performance of the system on the unannotated tweets corpus. Followed time series analysis was applied to this corpus to track the trends of machine-deduced sentiments and their associations with different days of the week.

Results: The evaluation of the unannotated tweets corpus showed that the micro-averaging F scores have reached 0.786. The learning system deduced the sentiment labels for 184,214 tweets in the collected unannotated tweets corpus. Time series analysis identified a coincidence between mainstream outcome and Twitter contents. A weak trend was found for "Negative" tweets that decreased firstly and began to increase later; an opposite trend was identified for "Positive" tweets. Tweets that contain the worries on efficacy for HPV vaccines showed a relative significant decreasing trend. Strong associations were found between some sentiments ("Positive", "Negative", "Negative-Safety" and "Negative-Others") with different days of the week.

Conclusions: Our efforts on sentiment analysis for newly approved HPV vaccines provide us an automatic and instant way to extract public opinion and understand the concerns on Twitter. Our approaches can provide a feedback to public health professionals to monitor online public response, examine the effectiveness of their HPV vaccination promotion strategies and adjust their promotion plans.

Keywords: Hierarchical classification; Human papillomavirus vaccines; Machine learning; Sentiment analysis; Twitter.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Sentiment classification scheme for HPV vaccine related tweets [12]
Fig. 2
Fig. 2
Overview of the machine learning based system for tweets sentiment analysis
Fig. 3
Fig. 3
Sentiments distribution in large scale unannotated HPV vaccines related tweets corpus. (Neg: Negative)
Fig. 4
Fig. 4
Stacked line chart for the number of tweets containing different sentiments from November 2, 2015 to March 28, 2016
Fig. 5
Fig. 5
The relative proportions of tweets containing Negative, Neutral and Positive opinions
Fig. 6
Fig. 6
The relative proportions of “NegSafety”, “NegEfficacy” and “NegOthers” tweets to “Negative” tweets
Fig. 7
Fig. 7
The association of different days of the week with the relative proportions of tweets containing Negative, Neutral and Positive opinions
Fig. 8
Fig. 8
The association of different days of the week with the relative proportions of “NegSafety”, “NegEfficacy” and “NegOthers” tweets

References

    1. Chunara R, Andrews JR, Brownstein JS. Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak. Am J Trop Med Hyg. 2012;86:39–45. doi: 10.4269/ajtmh.2012.11-0597. - DOI - PMC - PubMed
    1. Culotta A. Detecting influenza outbreaks by analyzing Twitter messages. arXiv Prepr. arXiv1007.4748. 2010;
    1. Chew C, Eysenbach G. Pandemics in the age of Twitter: content analysis of Tweets during the 2009 H1N1 outbreak. PLoS One. 2010;5(Public Library of Science):e14118. doi: 10.1371/journal.pone.0014118. - DOI - PMC - PubMed
    1. Freifeld CC, Brownstein JS, Menone CM, Bao W, Filice R, Kass-Hout T, et al. Digital drug safety surveillance: Monitoring pharmaceutical products in Twitter. Drug Saf. 2014;37:343–350. doi: 10.1007/s40264-014-0155-x. - DOI - PMC - PubMed
    1. Zhou X, Coiera E, Tsafnat G, Arachi D, Ong M-S, Dunn AG. Using social connection information to improve opinion mining: Identifying negative sentiment about HPV vaccines on Twitter. Stud Health Technol Inform. 2015;216:761–765. - PubMed

Substances