Using Deep Learning to Identify Linguistic Features that Facilitate or Inhibit the Propagation of Anti- and Pro-Vaccine Content on Social Media
- PMID: 37975063
- PMCID: PMC10652839
- DOI: 10.1109/icdh55609.2022.00025
Using Deep Learning to Identify Linguistic Features that Facilitate or Inhibit the Propagation of Anti- and Pro-Vaccine Content on Social Media
Abstract
Anti-vaccine content is rapidly propagated via social media, fostering vaccine hesitancy, while pro-vaccine content has not replicated the opponent's successes. Despite this disparity in the dissemination of anti- and pro-vaccine posts, linguistic features that facilitate or inhibit the propagation of vaccine-related content remain less known. Moreover, most prior machine-learning algorithms classified social-media posts into binary categories (e.g., misinformation or not) and have rarely tackled a higher-order classification task based on divergent perspectives about vaccines (e.g., anti-vaccine, pro-vaccine, and neutral). Our objectives are (1) to identify sets of linguistic features that facilitate and inhibit the propagation of vaccine-related content and (2) to compare whether anti-vaccine, provaccine, and neutral tweets contain either set more frequently than the others. To achieve these goals, we collected a large set of social media posts (over 120 million tweets) between Nov. 15 and Dec. 15, 2021, coinciding with the Omicron variant surge. A two-stage framework was developed using a fine-tuned BERT classifier, demonstrating over 99 and 80 percent accuracy for binary and ternary classification. Finally, the Linguistic Inquiry Word Count text analysis tool was used to count linguistic features in each classified tweet. Our regression results show that anti-vaccine tweets are propagated (i.e., retweeted), while pro-vaccine tweets garner passive endorsements (i.e., favorited). Our results also yielded the two sets of linguistic features as facilitators and inhibitors of the propagation of vaccine-related tweets. Finally, our regression results show that anti-vaccine tweets tend to use the facilitators, while pro-vaccine counterparts employ the inhibitors. These findings and algorithms from this study will aid public health officials' efforts to counteract vaccine misinformation, thereby facilitating the delivery of preventive measures during pandemics and epidemics.
Keywords: deep-learning; diffusion of information; health informatics; regression analyses; social media; vaccine misinformation.
Figures
Similar articles
-
Using Machine Learning to Compare Provaccine and Antivaccine Discourse Among the Public on Social Media: Algorithm Development Study.JMIR Public Health Surveill. 2021 Jun 24;7(6):e23105. doi: 10.2196/23105. JMIR Public Health Surveill. 2021. PMID: 34185004 Free PMC article.
-
Public Officials' Engagement on Social Media During the Rollout of the COVID-19 Vaccine: Content Analysis of Tweets.JMIR Infodemiology. 2023 Jul 20;3:e41582. doi: 10.2196/41582. JMIR Infodemiology. 2023. PMID: 37315194 Free PMC article.
-
Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8. Online ahead of print. J Supercomput. 2023. PMID: 37359330 Free PMC article.
-
Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705. J Med Internet Res. 2022. PMID: 35976193 Free PMC article.
-
Emotions and Incivility in Vaccine Mandate Discourse: Natural Language Processing Insights.JMIR Infodemiology. 2022 Sep 13;2(2):e37635. doi: 10.2196/37635. eCollection 2022 Jul-Dec. JMIR Infodemiology. 2022. PMID: 36188420 Free PMC article.
Cited by
-
When Infodemic Meets Epidemic: Systematic Literature Review.JMIR Public Health Surveill. 2025 Feb 3;11:e55642. doi: 10.2196/55642. JMIR Public Health Surveill. 2025. PMID: 39899850 Free PMC article.
-
Vaccine rhetoric on social media and COVID-19 vaccine uptake rates: A triangulation using self-reported vaccine acceptance.Soc Sci Med. 2024 May;348:116775. doi: 10.1016/j.socscimed.2024.116775. Epub 2024 Mar 15. Soc Sci Med. 2024. PMID: 38579627 Free PMC article.
-
Identifying Misinformation About Unproven Cancer Treatments on Social Media Using User-Friendly Linguistic Characteristics: Content Analysis.JMIR Infodemiology. 2025 Feb 12;5:e62703. doi: 10.2196/62703. JMIR Infodemiology. 2025. PMID: 39938078 Free PMC article.
References
-
- Ortiz RR, Smith A, and Coyne-Beasley T, “A systematic literature review to examine the potential for social media to impact HPV vaccine uptake and awareness, knowledge, and attitudes about HPV and HPV vaccination,” vol. 15, no. 7, pp. 1465–1475, 2019. [Online]. Available: https://www.tandfonline.com/doi/abs/10.1080/21645515.2019.1581543?journa... - DOI - PMC - PubMed
-
- Singh L, Bansal S, Bode L, Budak C, Chi G, Kawintiranon K, Padden C, Vanarsdall R, Vraga E, and Wang Y, “A first look at COVID-19 information and misinformation sharing on twitter,” 2020. [Online]. Available: https://arxiv.org/abs/2003.13907v1
-
- Argyris YA, Kim Y, Roscizewski A, and Song W, “The mediating role of vaccine hesitancy between maternal engagement with anti- and pro-vaccine social media posts and adolescent HPV-vaccine uptake rates in the US: The perspective of loss aversion in emotion-laden decision circumstances,” vol. 282, p. 114043, 2021. [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S0277953621003750 - PubMed
-
- “See how vaccinations are going in your county and state,” 2020. [Online]. Available: https://www.nytimes.com/interactive/2020/us/covid-19-vaccine-doses.html
-
- Johnson NF, Velasquez N, Restrepo NJ, Leahy R, Gabriel N, El Oud S, Zheng M, Manrique P, Wuchty S, and Lupu Y, “The online competition between pro- and anti-vaccination views,” vol. 582, pp. 230–233, 2020. [Online]. Available: http://www.nature.com/articles/s41586-020-2281-1 - PubMed
Associated data
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials