Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 11;6(2):174-207.
doi: 10.1007/s41666-021-00111-w. eCollection 2022 Jun.

COVID-19 Pandemic: Identifying Key Issues Using Social Media and Natural Language Processing

Affiliations

COVID-19 Pandemic: Identifying Key Issues Using Social Media and Natural Language Processing

Oladapo Oyebode et al. J Healthc Inform Res. .

Abstract

The COVID-19 pandemic has affected people's lives in many ways. Social media data can reveal public perceptions and experience with respect to the pandemic, and also reveal factors that hamper or support efforts to curb global spread of the disease. In this paper, we analyzed COVID-19-related comments collected from six social media platforms using natural language processing (NLP) techniques. We identified relevant opinionated keyphrases and their respective sentiment polarity (negative or positive) from over 1 million randomly selected comments, and then categorized them into broader themes using thematic analysis. Our results uncover 34 negative themes out of which 17 are economic, socio-political, educational, and political issues. Twenty (20) positive themes were also identified. We discuss the negative issues and suggest interventions to tackle them based on the positive themes and research evidence.

Keywords: COVID-19; Coronavirus; Health informatics; Keyphrase extraction; Natural language processing; Social media; Text mining; Thematic analysis.

PubMed Disclaimer

Conflict of interest statement

Conflict of InterestThe authors declare no competing interests.

Figures

Fig. 1
Fig. 1
NLP pipeline for extracting opinionated keyphrases from COVID-19-related comments
Fig. 2
Fig. 2
The KeyphraseExtractor algorithm based on the context-aware NLP approach
Fig. 3
Fig. 3
A sample parse tree illustrating the output of the chunker
Fig. 4
Fig. 4
Sample negative keyphrases and their frequency of occurrence (a larger bubble size illustrates higher dominance)
Fig. 5
Fig. 5
Sample positive keyphrases and their frequency of occurrence (a larger bubble size illustrates higher dominance)
Fig. 6
Fig. 6
The chart shows negative themes and the corresponding number of subthemes
Fig. 7
Fig. 7
The chart shows the total number of user comments associated with each negative theme
Fig. 8
Fig. 8
The chart shows positive themes and the corresponding number of subthemes
Fig. 9
Fig. 9
The chart shows the total number of user comments associated with each positive theme

References

    1. Morens DM, Folkers GK, Fauci AS. The challenge of emerging and re-emerging infectious diseases. Nature. 2004;430:242–249. doi: 10.1038/nature02759. - DOI - PMC - PubMed
    1. Jilani TN, Jamil RT, Siddiqui AH (2021) H1N1 Influenza. In: StatPearls. StatPearls Publishing, Treasure Island (FL)
    1. Dawood FS, Iuliano AD, Reed C, Meltzer MI, Shay DK, Cheng PY, Bandaranayake D, Breiman RF, Brooks WA, Buchy P, Feikin DR, Fowler KB, Gordon A, Hien NT, Horby P, Huang QS, Katz MA, Krishnan A, Lal R, Montgomery JM, Mølbak K, Pebody R, Presanis AM, Razuri H, Steens A, Tinoco YO, Wallinga J, Yu H, Vong S, Bresee J, Widdowson MA. Estimated global mortality associated with the first 12 months of 2009 pandemic influenza A H1N1 virus circulation: a modelling study. Lancet Infect Dis. 2012;12:687–695. doi: 10.1016/S1473-3099(12)70121-4. - DOI - PubMed
    1. World Health Organization (2019) HIV/AIDS. https://www.who.int/news-room/fact-sheets/detail/hiv-aids. Accessed 16 May 2020
    1. Tian H, Liu Y, Li Y, Wu C-H, Chen B, Kraemer MUG, Li B, Cai J, Xu B, Yang Q, Wang B, Yang P, Cui Y, Song Y, Zheng P, Wang Q, Bjornstad ON, Yang R, Grenfell BT, Pybus OG, Dye C. An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China. Science (80-) 2020;368:eabb6105. doi: 10.1126/science.abb6105. - DOI - PMC - PubMed

LinkOut - more resources