Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022;5(1):687-729.
doi: 10.1007/s42001-021-00150-8. Epub 2021 Oct 20.

A high-resolution temporal and geospatial content analysis of Twitter posts related to the COVID-19 pandemic

Affiliations

A high-resolution temporal and geospatial content analysis of Twitter posts related to the COVID-19 pandemic

Charalampos Ntompras et al. J Comput Soc Sci. 2022.

Abstract

The COVID-19 pandemic has deeply impacted all aspects of social, professional, and financial life, with concerns and responses being readily published in online social media worldwide. This study employs probabilistic text mining techniques for a large-scale, high-resolution, temporal, and geospatial content analysis of Twitter related discussions. Analysis considered 20,230,833 English language original COVID-19-related tweets with global origin retrieved between January 25, 2020 and April 30, 2020. Fine grain topic analysis identified 91 meaningful topics. Most of the topics showed a temporal evolution with local maxima, underlining the short-lived character of discussions in Twitter. When compared to real-world events, temporal popularity curves showed a good correlation with and quick response to real-world triggers. Geospatial analysis of topics showed that approximately 30% of original English language tweets were contributed by USA-based users, while overall more than 60% of the English language tweets were contributed by users from countries with an official language other than English. High-resolution temporal and geospatial analysis of Twitter content shows potential for political, economic, and social monitoring on a global and national level.

Keywords: COVID-19; Geospatial analysis; Latent Dirichlet Allocation; Social media analysis; Topic modeling; Twitter.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
Number of COVID-19-related English tweets per day from January 25 to April 30, 2020
Fig. 2
Fig. 2
Relative overall popularity of the topics categories
Fig. 3
Fig. 3
The top eight popular topics for the entire time span
Fig. 4
Fig. 4
Weekly variation of popularity for topics related to Life during the Pandemic category: a Most popular topics, with a peak of more than 2% for at least 1 week; b and c less popular topics
Fig. 5
Fig. 5
Weekly variation of popularity for topics related to Pandemic Management category. a Most popular topics, with a peak of more than 2% for at least 1 week; b and c less popular topics
Fig. 6
Fig. 6
Weekly variation of popularity for topics related to Medical category: a most popular topics, with a peak of more than 2% for at least 1 week; b and c less popular topics
Fig. 7
Fig. 7
Weekly variation of popularity for topics related to Outbreak category. a Most popular topics, with a peak of more than 2% for at least 1 week; and b less popular topics
Fig. 8
Fig. 8
Weekly variation of popularity for topics related to Lockdown category: a most popular topics, with a peak of more than 2% for at least 1 week; and b less popular topics
Fig. 9
Fig. 9
Weekly variation of popularity for topics related to Economy category: a most popular topics, with a peak of more than 2% for at least 1 week, and b less popular topics
Fig. 10
Fig. 10
Weekly variation of popularity for topics related to Cases and Deaths category: a most popular topics, with a peak of more than 2% for at least 1 week, and b less popular topics
Fig. 11
Fig. 11
Weekly variation of popularity for topics related to News and Fake News category
Fig. 12
Fig. 12
Weekly variation of popularity for topics related to Preventive Measures category
Fig. 13
Fig. 13
Geographical origin of the tweets in the final corpus for the top 20 contributing countries
Fig. 14
Fig. 14
Top 10 most popular tweets for the four countries with the highest number of tweets in the corpus
Fig. 15
Fig. 15
Representative events related to lockdown
Fig. 16
Fig. 16
Representative USA-related topics and events. a Most popular topics, with a peak of more than 2% for at least 1 week; and b less popular topics
Fig. 17
Fig. 17
Representative events related to disease outbreak
Fig. 18
Fig. 18
The topic 'toilet paper and other supplies panic-buying' and related events in the five countries where this topic was most popular

Similar articles

Cited by

References

    1. Iqbal, M. (2020). Twitter revenue and usage statistics (2020). Business of Apps. Retrieved January 5, 2021, from https://www.businessofapps.com/data/twitter-statistics/
    1. Fung IC-H, Duke CH, Finch KC, Snook KR, Tseng P-L, Hernandez AC, Tse ZTH. Ebola virus disease and social media: a systematic review. American Journal of Infection Control. 2016;44(12):1660–1671. doi: 10.1016/j.ajic.2016.05.011. - DOI - PubMed
    1. Tang L, Bie B, Park S-E, Zhi D. Social media and outbreaks of emerging infectious diseases: a systematic review of literature. American Journal of Infection Control. 2018;46(9):962–972. doi: 10.1016/j.ajic.2018.02.010. - DOI - PMC - PubMed
    1. Sinnenberg L, Buttenheim AM, Padrez K, Mancheno C, Ungar L, Merchant RM. Twitter as a tool for health research: a systematic review. American Journal of Public Health. 2017;107(1):e1–e8. doi: 10.2105/AJPH.2016.303512. - DOI - PMC - PubMed
    1. Xu P, Dredze M, Broniatowski DA. The twitter social mobility index: measuring social distancing practices with geolocated tweets. Journal of Medical Internet Research. 2020;22(12):e21499. doi: 10.2196/21499. - DOI - PMC - PubMed

LinkOut - more resources