Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 19;14(9):e29323.
doi: 10.7759/cureus.29323. eCollection 2022 Sep.

Emotional Analysis of Tweets About Clinically Extremely Vulnerable COVID-19 Groups

Affiliations

Emotional Analysis of Tweets About Clinically Extremely Vulnerable COVID-19 Groups

Toluwalase Awoyemi et al. Cureus. .

Abstract

Background Clinically extremely vulnerable (CEV) individuals have a significantly higher risk of morbidity and mortality from coronavirus disease 2019 (COVID-19). This high risk is due to predispositions such as chronic obstructive pulmonary disease (COPD), diabetes mellitus, hypertension, smoking, or extreme age (≥75). The initial COVID-19 preventive measures (use of face masks, social distancing, social bubbles) and vaccine allocation prioritized this group of vulnerable individuals to ensure their continued protection. However, as countries start relaxing the lockdown measures to help prevent socio-economic collapse, the impact of this relaxation on CEVs is once again brought to light. In this study, we set out to understand the impact of policy changes on the lives of CEVs by analyzing Twitter data with the hashtag #highriskcovid used by many high-risk individuals to tweet about and express their opinions and feelings. Methodology Tweets were extracted from the Twitter API between March 01, 2022, and April 21, 2022, using the Twarc2 tool. Extracted tweets were in English and included the hashtag #highriskcovid. We evaluated the most frequently used words and hashtags by calculating term frequency-inverse document frequency, and the location of tweets using the tidygeocoder package (method = osm). We also evaluated the sentiments and emotions depicted by these tweets using the National Research Council sentiment lexicon of the Syuzhet package. Finally, we used the latent Dirichlet allocation algorithm to determine relevant high-risk COVID-19 themes. Results The vast majority of the tweets originated from the United States (64%), Canada (22%), and the United Kingdom (4%). The most common hashtags were #highriskcovid (25.5%), #covid (6.82%), #immunocompromised (4.93%), #covidisnotover (4.0%), and #Maskup (1.40%), and the most frequently used words were immunocompromised (1.64%), people (1.4%), disabled (0.97%), maskup (0.85%), and eugenics (0.85%). The tweets were more negative (19.27%) than positive, and the most expressed negative emotions were fear (13.62%) and sadness (12.47%). At the same time, trust was the most expressed positive emotion and was used in relation to belief in masks, policies, and health workers to help. Finally, we detected frequently co-tweeted words such asmass and disaster, deadly and disabling, high and risk, public and health, immunocompromised and people, mass and disaster, and deadly and disabling. Conclusions The study provides evidence regarding the concerns and fears of high-risk COVID-19 groups as expressed via social media. It is imperative that further policies be implemented to specifically protect the health and mental wellness of high-risk individuals (for example, incorporating sentiment analyses of high-risk COVID-19 individuals such as this paper to inform the evaluation of already implemented preventive measures and policies). In addition, considerable work needs to be done to educate the public on high-risk individuals.

Keywords: cevs; covid-19; face masks; healthcare policy; highriskcovid; sentiment analysis; twitter.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The analytic workflow used in this paper.
Figure 2
Figure 2. Top 10 hashtags used in association with #highriskcovid tweets.
The Y axis is the percentage of each hashtag relative to all analyzed tweets, while the X axis contains the top 10 hashtags associated with #highriskcovid tweets.
Figure 3
Figure 3. Top 10 words used in #highriskcovid tweets.
The Y axis is the percentage of each word relative to all analyzed tweets, while the X axis contains the top 10 words used in #highriskcovid tweets.
Figure 4
Figure 4. Word cloud of #highriskcovid words with a frequency of occurrence greater than 25.
Figure 5
Figure 5. Countries of origin of tweets analyzed in this study.
The Y axis contains the top 25 countries with the most #highriskcovid tweets, while the X axis is the percentage of each country relative to the whole tweet.
Figure 6
Figure 6. Sentiment analysis of #highriskcovid tweets.
The Y axis is the percentage of sentiments of all analyzed tweets, while the X axis refers to the sentiments (negative or positive) in #highriskcovid tweets.
Figure 7
Figure 7. Word cloud of the top 100 negative #highriskcovid words.
Figure 8
Figure 8. Word cloud of the top 100 positive #highriskcovid words.
Figure 9
Figure 9. Emotional quotient analysis of #highriskcovid tweets.
The Y axis is the percentage of sentiments of all analyzed tweets, while the X axis refers to the emotion in #highriskcovid tweets.
Figure 10
Figure 10. Comparative word cloud of the #highriskcovid words and the emotions they portray.
Figure 11
Figure 11. Network analysis of common co-occurring #highriskcovid words.
The nodes represent words, while the edges (lines) are the weight of the connection between the nodes.
Figure 12
Figure 12. Modeled topics and their associated frequent terms among the analyzed #highriskcovid tweets.
Figure 13
Figure 13. Five-fold cross-validation of topic modeling.
The Y axis contains the perplexity score, while the X axis contains the candidate number of topics.  We identified a topic number between 10 to 17 to be the optimal range based on the perplexity score.
Figure 14
Figure 14. Result of LDAtuning using Griffiths2004 and Arun2010 metrics.
The Y axis contains the metric unit, while the X axis contains the candidate number of topics. We identified a topic number between 10 to 26 (Griffith) and 10-25 (Arun) to be the optimal range based on the perplexity score.

Similar articles

Cited by

References

    1. Effective public health measures to mitigate the spread of COVID-19: a systematic review. Ayouni I, Maatoug J, Dhouib W, Zammit N, Fredj SB, Ghammam R, Ghannem H. BMC Public Health. 2021;21:1015. - PMC - PubMed
    1. When can we stop wearing face masks? [ May; 2022 ]. 2022. https://www.thesun.co.uk/news/13221751/face-mask-rules-ending/ https://www.thesun.co.uk/news/13221751/face-mask-rules-ending/
    1. Boris Johnson SCRAPS work from home from TODAY and face masks & vaccine passports from next week as he rips up Plan B. [ May; 2022 ]. 2022. https://www.thesun.co.uk/health/17373030/boris-johnson-plan-b-restrictio... https://www.thesun.co.uk/health/17373030/boris-johnson-plan-b-restrictio...
    1. More than 90% of U.S. population can ditch facemasks under CDC Covid guidance. [ May; 2022 ]. 2022. https://www.cnbc.com/2022/03/03/more-than-90percent-of-us-population-can... https://www.cnbc.com/2022/03/03/more-than-90percent-of-us-population-can...
    1. Wearing of face masks no more mandatory in Ghana - Nana Addo. [ May; 2022 ];https://www.msn.com/en-xl/africa/other/wearing-of-face-masks-no-more-man... 2022 2022:23.

LinkOut - more resources