Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov:129:109603.
doi: 10.1016/j.asoc.2022.109603. Epub 2022 Sep 5.

Twitter conversations predict the daily confirmed COVID-19 cases

Affiliations

Twitter conversations predict the daily confirmed COVID-19 cases

Rabindra Lamsal et al. Appl Soft Comput. 2022 Nov.

Abstract

As of writing this paper, COVID-19 (Coronavirus disease 2019) has spread to more than 220 countries and territories. Following the outbreak, the pandemic's seriousness has made people more active on social media, especially on the microblogging platforms such as Twitter and Weibo. The pandemic-specific discourse has remained on-trend on these platforms for months now. Previous studies have confirmed the contributions of such socially generated conversations towards situational awareness of crisis events. The early forecasts of cases are essential to authorities to estimate the requirements of resources needed to cope with the outgrowths of the virus. Therefore, this study attempts to incorporate the public discourse in the design of forecasting models particularly targeted for the steep-hill region of an ongoing wave. We propose a sentiment-involved topic-based latent variables search methodology for designing forecasting models from publicly available Twitter conversations. As a use case, we implement the proposed methodology on Australian COVID-19 daily cases and Twitter conversations generated within the country. Experimental results: (i) show the presence of latent social media variables that Granger-cause the daily COVID-19 confirmed cases, and (ii) confirm that those variables offer additional prediction capability to forecasting models. Further, the results show that the inclusion of social media variables introduces 48.83%-51.38% improvements on RMSE over the baseline models. We also release the large-scale COVID-19 specific geotagged global tweets dataset, MegaGeoCOV, to the public anticipating that the geotagged data of this scale would aid in understanding the conversational dynamics of the pandemic through other spatial and temporal contexts.

Keywords: ARIMAX models; Granger causality; Pandemic forecast; Social media analytics; Time series analysis; Twitter analytics; VAR models.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Daily (new) and total (cumulative) COVID-19 cases reported in Australia between January 25, 2020 (first COVID-19 case reported), and September 9, 2020.
Fig. 2
Fig. 2
The overall view of the Twitter-based COVID-19 cases forecast methodology.
Fig. 3
Fig. 3
Daily distribution of COVID-19 specific tweets between January 1, 2020, and September 9, 2020.
Fig. 4
Fig. 4
Plots of the variables (listed in Table 7) in Dts that Granger-cause y at most lags (10). For each subplot, the vertical axis represents the count of tweets, and the horizontal axis represents the date.
Fig. 5
Fig. 5
COVID-19 confirmed cases versus the cases predicted by the baseline and social media models at 1% and 5% significance levels.
Fig. 6
Fig. 6
Forecast of COVID-19 cases for the next 7 days with VAR(15) model. MAPE=9.08% (overall); MAPE=6.74% (excluding the 9/10/2021’s sudden rise).
Fig. 7
Fig. 7
Search interests data retrieved from Google Trends for the period January 1, 2020, and September 9, 2021.
None

Similar articles

Cited by

References

    1. Zhang L., Zhu F., Xie L., Wang C., Wang J., Chen R., Jia P., Guan H., Peng L., Chen Y., et al. Clinical characteristics of COVID-19-infected cancer patients: a retrospective case study in three hospitals within Wuhan, China. Ann. Oncol. 2020;31(7):894–901. - PMC - PubMed
    1. Worldometers L. 2021. COVID-19 coronavirUS PANDEMIC. URL https://www.worldometers.info/coronavirus/
    1. Smartraveller.gov.au L. 2021. Covid-19 vaccinations. URL https://www.smartraveller.gov.au/COVID-19/COVID-19-vaccinations.
    1. Health Ministry A. 2020. First confirmed case of novel coronavirus in Australia. URL https://www.health.gov.au/ministers/the-hon-greg-hunt-mp/media/first-con....
    1. Lamsal R., Harwood A., Read M.R. Socially enhanced situation awareness from microblogs using artificial intelligence: A survey. ACM Comput. Surv. 2022

LinkOut - more resources