Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 5;23(3):e26482.
doi: 10.2196/26482.

Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study

Affiliations

Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study

Chunyan Zhang et al. J Med Internet Res. .

Abstract

Background: Since the beginning of the COVID-19 pandemic in late 2019, its far-reaching impacts have been witnessed globally across all aspects of human life, such as health, economy, politics, and education. Such widely penetrating impacts cast significant and profound burdens on all population groups, incurring varied concerns and sentiments among them.

Objective: This study aims to identify the concerns, sentiments, and disparities of various population groups during the COVID-19 pandemic through a cross-sectional study conducted via large-scale Twitter data mining infoveillance.

Methods: This study consisted of three steps: first, tweets posted during the pandemic were collected and preprocessed on a large scale; second, the key population attributes, concerns, sentiments, and emotions were extracted via a collection of natural language processing procedures; third, multiple analyses were conducted to reveal concerns, sentiments, and disparities among population groups during the pandemic. Overall, this study implemented a quick, effective, and economical approach for analyzing population-level disparities during a public health event. The source code developed in this study was released for free public use at GitHub.

Results: A total of 1,015,655 original English tweets posted from August 7 to 12, 2020, were acquired and analyzed to obtain the following results. Organizations were significantly more concerned about COVID-19 (odds ratio [OR] 3.48, 95% CI 3.39-3.58) and expressed more fear and depression emotions than individuals. Females were less concerned about COVID-19 (OR 0.73, 95% CI 0.71-0.75) and expressed less fear and depression emotions than males. Among all age groups (ie, ≤18, 19-29, 30-39, and ≥40 years of age), the attention ORs of COVID-19 fear and depression increased significantly with age. It is worth noting that not all females paid less attention to COVID-19 than males. In the age group of 40 years or older, females were more concerned than males, especially regarding the economic and education topics. In addition, males 40 years or older and 18 years or younger were the least positive. Lastly, in all sentiment analyses, the sentiment polarities regarding political topics were always the lowest among the five topics of concern across all population groups.

Conclusions: Through large-scale Twitter data mining, this study revealed that meaningful differences regarding concerns and sentiments about COVID-19-related topics existed among population groups during the study period. Therefore, specialized and varied attention and support are needed for different population groups. In addition, the efficient analysis method implemented by our publicly released code can be utilized to dynamically track the evolution of each population group during the pandemic or any other major event for better informed public health research and interventions.

Keywords: COVID-19; Twitter mining; concerns; disparities; infodemiology; infoveillance; pandemic; population groups; sentiments.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1
Figure 1
The structure of our cross-sectional method. API: application programming interface; POMS: Profile of Mood States.
Figure 2
Figure 2
The language distribution of tweets. ar: Arabic; en: English; es: Spanish; hi: Hindi; others: other languages; pt: Portuguese.
Figure 3
Figure 3
The structure of the M3 (multimodal, multilingual, and multi-attribute) model for inferring user type, gender, and age from profile information. DenseNet: dense convolutional network; ReLU: rectified linear unit.
Figure 4
Figure 4
Univariate analysis of COVID-19 concerns and sentiment polarities among different population groups. OR: odds ratio.
Figure 5
Figure 5
Bivariate analysis of COVID-19 concerns and sentiment polarities among different population groups. OR: odds ratio; Org: organization.
Figure 6
Figure 6
Trivariate analysis of COVID-19 concerns and sentiment polarities among different population groups. OR: odds ratio.
Figure 7
Figure 7
The mean intensity scores for the three emotion models. Scores range from 0 to 1 for each emotion. POMS: Profile of Mood States.
Figure 8
Figure 8
The population distributions of the three emotion models. POMS: Profile of Mood States.
Figure 9
Figure 9
Plutchik emotion analysis on four population characteristics. Scores range from 0 to 1 for each emotion.
Figure 10
Figure 10
Profile of Mood States (POMS) emotion analysis on four population characteristics. Scores range from 0 to 1 for each emotion.
Figure 11
Figure 11
Statistical analysis of emotions related to COVID-19. OR: odds ratio.

Similar articles

Cited by

References

    1. Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, Zhao X, Huang B, Shi W, Lu R, Niu P, Zhan F, Ma X, Wang D, Xu W, Wu G, Gao GF, Tan W, China Novel Coronavirus Investigating and Research Team A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med. 2020 Feb 20;382(8):727–733. doi: 10.1056/NEJMoa2001017. http://europepmc.org/abstract/MED/31978945 - DOI - PMC - PubMed
    1. WHO coronavirus disease (COVID-19) dashboard. World Health Organization. [2020-08-12]. https://covid19.who.int/
    1. Moreland A, Herlihy C, Tynan MA, Sunshine G, McCord RF, Hilton C, Poovey J, Werner AK, Jones CD, Fulmer EB, Gundlapalli AV, Strosnider H, Potvien A, García MC, Honeycutt S, Baldwin G, CDC Public Health Law Program. CDC COVID-19 Response Team‚ Mitigation Policy Analysis Unit Timing of state and territorial COVID-19 stay-at-home orders and changes in population movement - United States, March 1-May 31, 2020. MMWR Morb Mortal Wkly Rep. 2020 Sep 04;69(35):1198–1203. doi: 10.15585/mmwr.mm6935a2. doi: 10.15585/mmwr.mm6935a2. - DOI - DOI - PMC - PubMed
    1. Mayo Clinic Staff COVID-19 quarantine, self-isolation and social distancing. Mayo Clinic. 2020. [2020-08-12]. https://www.mayoclinic.org/diseases-conditions/coronavirus/in-depth/coro....
    1. Impact of COVID-19 on people's livelihoods, their health and our food systems. World Health Organization. 2020. Oct 13, [2020-12-01]. https://www.who.int/news/item/13-10-2020-impact-of-covid-19-on-people's-....