Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study
- PMID: 33617460
- PMCID: PMC7939057
- DOI: 10.2196/26482
Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study
Abstract
Background: Since the beginning of the COVID-19 pandemic in late 2019, its far-reaching impacts have been witnessed globally across all aspects of human life, such as health, economy, politics, and education. Such widely penetrating impacts cast significant and profound burdens on all population groups, incurring varied concerns and sentiments among them.
Objective: This study aims to identify the concerns, sentiments, and disparities of various population groups during the COVID-19 pandemic through a cross-sectional study conducted via large-scale Twitter data mining infoveillance.
Methods: This study consisted of three steps: first, tweets posted during the pandemic were collected and preprocessed on a large scale; second, the key population attributes, concerns, sentiments, and emotions were extracted via a collection of natural language processing procedures; third, multiple analyses were conducted to reveal concerns, sentiments, and disparities among population groups during the pandemic. Overall, this study implemented a quick, effective, and economical approach for analyzing population-level disparities during a public health event. The source code developed in this study was released for free public use at GitHub.
Results: A total of 1,015,655 original English tweets posted from August 7 to 12, 2020, were acquired and analyzed to obtain the following results. Organizations were significantly more concerned about COVID-19 (odds ratio [OR] 3.48, 95% CI 3.39-3.58) and expressed more fear and depression emotions than individuals. Females were less concerned about COVID-19 (OR 0.73, 95% CI 0.71-0.75) and expressed less fear and depression emotions than males. Among all age groups (ie, ≤18, 19-29, 30-39, and ≥40 years of age), the attention ORs of COVID-19 fear and depression increased significantly with age. It is worth noting that not all females paid less attention to COVID-19 than males. In the age group of 40 years or older, females were more concerned than males, especially regarding the economic and education topics. In addition, males 40 years or older and 18 years or younger were the least positive. Lastly, in all sentiment analyses, the sentiment polarities regarding political topics were always the lowest among the five topics of concern across all population groups.
Conclusions: Through large-scale Twitter data mining, this study revealed that meaningful differences regarding concerns and sentiments about COVID-19-related topics existed among population groups during the study period. Therefore, specialized and varied attention and support are needed for different population groups. In addition, the efficient analysis method implemented by our publicly released code can be utilized to dynamically track the evolution of each population group during the pandemic or any other major event for better informed public health research and interventions.
Keywords: COVID-19; Twitter mining; concerns; disparities; infodemiology; infoveillance; pandemic; population groups; sentiments.
©Chunyan Zhang, Songhua Xu, Zongfang Li, Shunxu Hu. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 05.03.2021.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures











Similar articles
-
Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach.J Med Internet Res. 2020 Nov 25;22(11):e20550. doi: 10.2196/20550. J Med Internet Res. 2020. PMID: 33119535 Free PMC article.
-
Topics, Trends, and Sentiments of Tweets About the COVID-19 Pandemic: Temporal Infoveillance Study.J Med Internet Res. 2020 Oct 23;22(10):e22624. doi: 10.2196/22624. J Med Internet Res. 2020. PMID: 33006937 Free PMC article.
-
Emotions and Topics Expressed on Twitter During the COVID-19 Pandemic in the United Kingdom: Comparative Geolocation and Text Mining Analysis.J Med Internet Res. 2022 Oct 5;24(10):e40323. doi: 10.2196/40323. J Med Internet Res. 2022. PMID: 36150046 Free PMC article.
-
Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing.Annu Rev Biomed Data Sci. 2021 Jul 20;4:313-339. doi: 10.1146/annurev-biodatasci-021821-061045. Epub 2021 May 14. Annu Rev Biomed Data Sci. 2021. PMID: 34465169 Review.
-
Evolution of Select Epidemiological Modeling and the Rise of Population Sentiment Analysis: A Literature Review and COVID-19 Sentiment Illustration.Int J Environ Res Public Health. 2022 Mar 9;19(6):3230. doi: 10.3390/ijerph19063230. Int J Environ Res Public Health. 2022. PMID: 35328916 Free PMC article. Review.
Cited by
-
Emotional Distress During COVID-19 by Mental Health Conditions and Economic Vulnerability: Retrospective Analysis of Survey-Linked Twitter Data With a Semisupervised Machine Learning Algorithm.J Med Internet Res. 2023 Mar 16;25:e44965. doi: 10.2196/44965. J Med Internet Res. 2023. PMID: 36809798 Free PMC article.
-
Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia.JMIR Infodemiology. 2022 Sep 27;2(2):e36941. doi: 10.2196/36941. eCollection 2022 Jul-Dec. JMIR Infodemiology. 2022. PMID: 36196144 Free PMC article.
-
The Evolution of Rumors on a Closed Social Networking Platform During COVID-19: Algorithm Development and Content Study.JMIR Med Inform. 2021 Nov 23;9(11):e30467. doi: 10.2196/30467. JMIR Med Inform. 2021. PMID: 34623954 Free PMC article.
-
COVID-19 information received by the Peruvian population, during the first phase of the pandemic, and its association with developing psychological distress: Information about COVID-19 and distress in Peru.Medicine (Baltimore). 2022 Feb 4;101(5):e28625. doi: 10.1097/MD.0000000000028625. Medicine (Baltimore). 2022. PMID: 35119007 Free PMC article.
-
The Evolution and Disparities of Online Attitudes Toward COVID-19 Vaccines: Year-long Longitudinal and Cross-sectional Study.J Med Internet Res. 2022 Jan 21;24(1):e32394. doi: 10.2196/32394. J Med Internet Res. 2022. PMID: 34878410 Free PMC article.
References
-
- Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, Zhao X, Huang B, Shi W, Lu R, Niu P, Zhan F, Ma X, Wang D, Xu W, Wu G, Gao GF, Tan W, China Novel Coronavirus Investigating and Research Team A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med. 2020 Feb 20;382(8):727–733. doi: 10.1056/NEJMoa2001017. http://europepmc.org/abstract/MED/31978945 - DOI - PMC - PubMed
-
- WHO coronavirus disease (COVID-19) dashboard. World Health Organization. [2020-08-12]. https://covid19.who.int/
-
- Moreland A, Herlihy C, Tynan MA, Sunshine G, McCord RF, Hilton C, Poovey J, Werner AK, Jones CD, Fulmer EB, Gundlapalli AV, Strosnider H, Potvien A, García MC, Honeycutt S, Baldwin G, CDC Public Health Law Program. CDC COVID-19 Response Team‚ Mitigation Policy Analysis Unit Timing of state and territorial COVID-19 stay-at-home orders and changes in population movement - United States, March 1-May 31, 2020. MMWR Morb Mortal Wkly Rep. 2020 Sep 04;69(35):1198–1203. doi: 10.15585/mmwr.mm6935a2. doi: 10.15585/mmwr.mm6935a2. - DOI - DOI - PMC - PubMed
-
- Mayo Clinic Staff COVID-19 quarantine, self-isolation and social distancing. Mayo Clinic. 2020. [2020-08-12]. https://www.mayoclinic.org/diseases-conditions/coronavirus/in-depth/coro....
-
- Impact of COVID-19 on people's livelihoods, their health and our food systems. World Health Organization. 2020. Oct 13, [2020-12-01]. https://www.who.int/news/item/13-10-2020-impact-of-covid-19-on-people's-....
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous