Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 27;10(6):994.
doi: 10.3390/healthcare10060994.

Text Mining and Determinants of Sentiments towards the COVID-19 Vaccine Booster of Twitter Users in Malaysia

Affiliations

Text Mining and Determinants of Sentiments towards the COVID-19 Vaccine Booster of Twitter Users in Malaysia

Song-Quan Ong et al. Healthcare (Basel). .

Abstract

Vaccination is the primary preventive measure against the COVID-19 infection, and an additional vaccine dosage is crucial to increase the immunity level of the community. However, public bias, as reflected on social media, may have a significant impact on the vaccination program. We aim to investigate the attitudes to the COVID-19 vaccination booster in Malaysia by using sentiment analysis. We retrieved 788 tweets containing COVID-19 vaccine booster keywords and identified the common topics discussed in tweets that related to the booster by using latent Dirichlet allocation (LDA) and performed sentiment analysis to understand the determinants for the sentiments to receiving the vaccination booster in Malaysia. We identified three important LDA topics: (1) type of vaccination booster; (2) effects of vaccination booster; (3) vaccination program operation. The type of vaccination further transformed into attributes of "az", "pfizer", "sinovac", and "mix" for determinants' assessments. Effect and type of vaccine booster associated stronger than program operation topic for the sentiments, and "pfizer" and "mix" were the strongest determinants of the tweet's sentiments after the Boruta feature selection and validated from the performance of regression analysis. This study provided a comprehensive workflow to retrieve and identify important healthcare topic from social media.

Keywords: Boruta; Pfizer-BioNTech; RFE; Twitter; astrazeneca; sinovac; vaccination booster.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Process flow of dataset construction.
Figure 2
Figure 2
Process of feature assessment.
Figure 3
Figure 3
The process of multinomial regression model build-up with two different groups of features as input data.
Figure 4
Figure 4
Sentiment polarity distribution of the tweets.
Figure 5
Figure 5
Topic sentiments’ probabilities.
Figure 6
Figure 6
Word cloud.
Figure 7
Figure 7
RFE result—ranking of variables based on the model Root Mean Square Error (RMSE). X-axes represent the subset sizes of variables; Y-axes represent the RMSE of the residuals, which indicate the prediction strength of the variables.
Figure 8
Figure 8
Boruta result plot for six possible determinants. Blue boxplots represent the minimal, average, and maximum Z-score of a shadow attribute. Red and green boxplots represent Z-scores of respectively rejected and confirmed attributes.
Figure 9
Figure 9
Venn diagram illustrating all, RFE, and Boruta determinants.
Figure 10
Figure 10
Feature assessment process and numbers of features obtained from each method.
Figure 11
Figure 11
Multinomial regression model performance using all and representative attributes.
Figure 12
Figure 12
Confusion matrix.

Similar articles

Cited by

References

    1. World Health Organization (WHO) Coronavirus Disease (COVID-19) Dashboard. World Health Organization; Geneva, Switzerland: 2020. [(accessed on 2 January 2022)]. Available online: https://covid19.who.int/
    1. Debata B., Patnaik P., Mishra A. COVID-19 pandemic. It’s impact on people, economy, and environment. J. Public Aff. 2020;20:e2372. doi: 10.1002/pa.2372. - DOI
    1. Song L., Zhou Y. The COVID-19 pandemic and its impact on the global economy: What does it take toturn crisis into opportunity? China World Econ. 2020;28:1–25. doi: 10.1111/cwe.12349. - DOI
    1. Reported Cases and Deaths by Country or Territory. [(accessed on 2 January 2022)]. Available online: https://www.worldometers.info/coronavirus/
    1. The Economist. The Omicron Variant Advances at an Incredible Rate. [(accessed on 2 January 2022)]. Available online: https://www.economist.com/international/2021/12/18/the-omicron-variant-a....

LinkOut - more resources