Investigating COVID-19 News Across Four Nations: A Topic Modeling and Sentiment Analysis Approach
- PMID: 34786310
- PMCID: PMC8545217
- DOI: 10.1109/ACCESS.2021.3062875
Investigating COVID-19 News Across Four Nations: A Topic Modeling and Sentiment Analysis Approach
Abstract
Newspapers are very important for a society as they inform citizens about the events around them and how they can impact their life. Their importance becomes more crucial and indispensable in the times of health crisis such as the current COVID-19 pandemic. Since the starting of this pandemic newspapers are providing rich information to the public about various issues such as the discovery of a new strain of coronavirus, lockdown and other restrictions, government policies, and information related to the vaccine development for the same. In this scenario, analysis of emergent and widely reported topics/themes/issues and associated sentiments from various countries can help us better understand the COVID-19 pandemic. In our research, the database of more than 100,000 COVID-19 news headlines and articles were analyzed using top2vec (for topic modeling) and RoBERTa (for sentiment classification and analysis). Our topic modeling results highlighted that education, economy, US, and sports are some of the most common and widely reported themes across UK, India, Japan, South Korea. Further, our sentiment classification model achieved 90% validation accuracy and the analysis showed that the worst affected country, i.e. the UK (in our dataset) also has the highest percentage of negative sentiment.
Keywords: COVID-19; RoBERTa; Top2Vec; machine learning; natural language processing; newspaper; sentiment analysis; topic modeling.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
Figures












Similar articles
-
Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8. Online ahead of print. J Supercomput. 2023. PMID: 37359330 Free PMC article.
-
Using the COVID-19 Pandemic to Assess the Influence of News Affect on Online Mental Health-Related Search Behavior Across the United States: Integrated Sentiment Analysis and the Circumplex Model of Affect.J Med Internet Res. 2022 Jan 27;24(1):e32731. doi: 10.2196/32731. J Med Internet Res. 2022. PMID: 34932494 Free PMC article.
-
Topics, Trends, and Sentiments of Tweets About the COVID-19 Pandemic: Temporal Infoveillance Study.J Med Internet Res. 2020 Oct 23;22(10):e22624. doi: 10.2196/22624. J Med Internet Res. 2020. PMID: 33006937 Free PMC article.
-
Public Opinion and Sentiment Before and at the Beginning of COVID-19 Vaccinations in Japan: Twitter Analysis.JMIR Infodemiology. 2022 May 9;2(1):e32335. doi: 10.2196/32335. eCollection 2022 Jan-Jun. JMIR Infodemiology. 2022. PMID: 35578643 Free PMC article.
-
Sentiment analysis of epidemiological surveillance reports on COVID-19 in Greece using machine learning models.Front Public Health. 2023 Jul 18;11:1191730. doi: 10.3389/fpubh.2023.1191730. eCollection 2023. Front Public Health. 2023. PMID: 37533519 Free PMC article. Review.
Cited by
-
Hate Speech in a Telegram Conspiracy Channel During the First Year of the COVID-19 Pandemic.Soc Media Soc. 2022 Nov 21;8(4):20563051221138758. doi: 10.1177/20563051221138758. eCollection 2022 Oct-Dec. Soc Media Soc. 2022. PMID: 36447996 Free PMC article.
-
Beyond fear and anger: A global analysis of emotional response to Covid-19 news on Twitter using deep learning.Online Soc Netw Media. 2023 Jun 14:100253. doi: 10.1016/j.osnem.2023.100253. Online ahead of print. Online Soc Netw Media. 2023. PMID: 37360968 Free PMC article.
-
Agenda-Setting for COVID-19: A Study of Large-Scale Economic News Coverage Using Natural Language Processing.Int J Data Sci Anal. 2023;15(3):291-312. doi: 10.1007/s41060-022-00364-7. Epub 2022 Oct 6. Int J Data Sci Anal. 2023. PMID: 36217352 Free PMC article.
-
Public Opinion About COVID-19 on a Microblog Platform in China: Topic Modeling and Multidimensional Sentiment Analysis of Social Media.J Med Internet Res. 2024 Jan 31;26:e47508. doi: 10.2196/47508. J Med Internet Res. 2024. PMID: 38294856 Free PMC article.
-
Using 'infodemics' to understand public awareness and perception of SARS-CoV-2: A longitudinal analysis of online information about COVID-19 incidence and mortality during a major outbreak in Vietnam, July-September 2020.PLoS One. 2022 Apr 7;17(4):e0266299. doi: 10.1371/journal.pone.0266299. eCollection 2022. PLoS One. 2022. PMID: 35390078 Free PMC article.
References
-
- Johns Hopkins Coronavirus Resource Center. COVID-19 Map. Accessed: Jan. 18, 2021. [Online]. Available: https://coronavirus.jhu.edu/map.html
-
- Reese S. D., “Prologue—Framing public life,” in Framing Public Life. Perspectives on Media and our Understanding of the Social World, Reese S. D., Gandy O. H., and Grant A. H., Eds. Mahwah, NJ, USA: Lawrance Erlbaum, 2001, pp. 7–31.
Publication types
LinkOut - more resources
Full Text Sources