Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2024 Jun 6;19(6):e0301488.
doi: 10.1371/journal.pone.0301488. eCollection 2024.

Understanding the determinants of vaccine hesitancy in the United States: A comparison of social surveys and social media

Affiliations
Comparative Study

Understanding the determinants of vaccine hesitancy in the United States: A comparison of social surveys and social media

Kuleen Sasse et al. PLoS One. .

Abstract

The COVID-19 pandemic prompted governments worldwide to implement a range of containment measures, including mass gathering restrictions, social distancing, and school closures. Despite these efforts, vaccines continue to be the safest and most effective means of combating such viruses. Yet, vaccine hesitancy persists, posing a significant public health concern, particularly with the emergence of new COVID-19 variants. To effectively address this issue, timely data is crucial for understanding the various factors contributing to vaccine hesitancy. While previous research has largely relied on traditional surveys for this information, recent sources of data, such as social media, have gained attention. However, the potential of social media data as a reliable proxy for information on population hesitancy, especially when compared with survey data, remains underexplored. This paper aims to bridge this gap. Our approach uses social, demographic, and economic data to predict vaccine hesitancy levels in the ten most populous US metropolitan areas. We employ machine learning algorithms to compare a set of baseline models that contain only these variables with models that incorporate survey data and social media data separately. Our results show that XGBoost algorithm consistently outperforms Random Forest and Linear Regression, with marginal differences between Random Forest and XGBoost. This was especially the case with models that incorporate survey or social media data, thus highlighting the promise of the latter data as a complementary information source. Results also reveal variations in influential variables across the five hesitancy classes, such as age, ethnicity, occupation, and political inclination. Further, the application of models to different MSAs yields mixed results, emphasizing the uniqueness of communities and the need for complementary data approaches. In summary, this study underscores social media data's potential for understanding vaccine hesitancy, emphasizes the importance of tailoring interventions to specific communities, and suggests the value of combining different data sources.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Similar articles

Cited by

References

    1. Dinleyici EC, Borrow R, Safadi MAP, van Damme P, Munoz FM. Vaccines and routine immunization strategies during the COVID-19 pandemic. Human vaccines & immunotherapeutics. 2021;17(2):400–407. doi: 10.1080/21645515.2020.1804776 - DOI - PMC - PubMed
    1. Gao Q, Bao L, Mao H, Wang L, Xu K, Yang M, et al.. Development of an inactivated vaccine candidate for SARS-CoV-2. Science. 2020;369(6499):77–81. doi: 10.1126/science.abc1932 - DOI - PMC - PubMed
    1. Mohamed K, Rzymski P, Islam MS, Makuku R, Mushtaq A, Khan A, et al.. COVID-19 vaccinations: The unknowns, challenges, and hopes. Journal of medical virology. 2022;94(4):1336–1349. doi: 10.1002/jmv.27487 - DOI - PMC - PubMed
    1. Mathieu E, Ritchie H, Rodés-Guirao L, Appel C, Gavrilov D, Giattino C, Hasell J, Macdonald B, Dattani S, Beltekian D, Ortiz-Ospina E, Roser M. Coronavirus (COVID-19) vaccinations Our World in Data. https://ourworldindata.org/covid-vaccinations.
    1. Duca LM, Xu SF Likangand Price, McLean CA. Covid-19 stats: Covid-19 incidence, by age group—United States, March 1–November 14, 2020; 2020. Available from: https://www.cdc.gov/mmwr/volumes/69/wr/mm695152a8.htm.

Publication types

MeSH terms

Substances