Retrospective analysis of the accuracy of predicting the alert level of COVID-19 in 202 countries using Google Trends and machine learning
- PMID: 33110594
- PMCID: PMC7567446
- DOI: 10.7189/jogh.10.020511
Retrospective analysis of the accuracy of predicting the alert level of COVID-19 in 202 countries using Google Trends and machine learning
Abstract
Background: Internet search engine data, such as Google Trends, was shown to be correlated with the incidence of COVID-19, but only in several countries. We aim to develop a model from a small number of countries to predict the epidemic alert level in all the countries worldwide.
Methods: The "interest over time" and "interest by region" Google Trends data of Coronavirus, pneumonia, and six COVID symptom-related terms were searched. The daily incidence of COVID-19 from 10 January to 23 April 2020 of 202 countries was retrieved from the World Health Organization. Three alert levels were defined. Ten weeks' data from 20 countries were used for training with machine learning algorithms. The features were selected according to the correlation and importance. The model was then tested on 2830 samples of 202 countries.
Results: Our model performed well in 154 (76.2%) countries, of which each had no more than four misclassified samples. In these 154 countries, the accuracy was 0.8133, and the kappa coefficient was 0.6828. While in all 202 countries, the accuracy was 0.7527, and the kappa coefficient was 0.5841. The proposed algorithm based on Random Forest Classification and nine features performed better compared to other machine learning methods and the models with different numbers of features.
Conclusions: Our result suggested that the model developed from 20 countries with Google Trends data and Random Forest Classification can be applied to predict the epidemic alert levels of most countries worldwide.
Copyright © 2020 by the Journal of Global Health. All rights reserved.
Conflict of interest statement
Competing interests: The authors completed the ICMJE Unified Competing Interest form (available upon request from the corresponding author) and declare that they have no competing interests.
Figures
Similar articles
-
Increased Internet Searches for Insomnia as an Indicator of Global Mental Health During the COVID-19 Pandemic: Multinational Longitudinal Study.J Med Internet Res. 2020 Sep 21;22(9):e22181. doi: 10.2196/22181. J Med Internet Res. 2020. PMID: 32924951 Free PMC article.
-
Association of the COVID-19 pandemic with Internet Search Volumes: A Google TrendsTM Analysis.Int J Infect Dis. 2020 Jun;95:192-197. doi: 10.1016/j.ijid.2020.04.033. Epub 2020 Apr 17. Int J Infect Dis. 2020. PMID: 32305520 Free PMC article.
-
Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study.JMIR Public Health Surveill. 2020 Apr 14;6(2):e18828. doi: 10.2196/18828. JMIR Public Health Surveill. 2020. PMID: 32234709 Free PMC article.
-
COVID-19 Pandemic: Experiences in China and Implications for its Prevention and Treatment Worldwide.Curr Cancer Drug Targets. 2020;20(6):410-416. doi: 10.2174/1568009620666200414151419. Curr Cancer Drug Targets. 2020. PMID: 32286947 Review.
-
Usefulness of machine learning in COVID-19 for the detection and prognosis of cardiovascular complications.Rev Cardiovasc Med. 2020 Sep 30;21(3):345-352. doi: 10.31083/j.rcm.2020.03.120. Rev Cardiovasc Med. 2020. PMID: 33070540 Review.
Cited by
-
'Let communities do their work': the role of mutual aid and self-help groups in the Covid-19 pandemic response.Disasters. 2021 Dec;45 Suppl 1(Suppl 1):S146-S173. doi: 10.1111/disa.12515. Epub 2021 Dec 7. Disasters. 2021. PMID: 34562282 Free PMC article.
-
Impact of the World Inflammatory Bowel Disease Day and Crohn's and Colitis Awareness Week on Population Interest Between 2016 and 2020: Google Trends Analysis.JMIR Infodemiology. 2021 Oct 28;1(1):e32856. doi: 10.2196/32856. eCollection 2021 Jan-Dec. JMIR Infodemiology. 2021. PMID: 37114197 Free PMC article.
-
Impact of the cervical cancer awareness months on public interest in Japan: A Google Trends analysis, 2012-2021.Sci Rep. 2022 Sep 13;12(1):15391. doi: 10.1038/s41598-022-19798-x. Sci Rep. 2022. PMID: 36100649 Free PMC article.
-
A Survey on COVID-19 Data Analysis Using AI, IoT, and Social Media.Sensors (Basel). 2023 Jun 13;23(12):5543. doi: 10.3390/s23125543. Sensors (Basel). 2023. PMID: 37420714 Free PMC article. Review.
-
Machine learning and applications in microbiology.FEMS Microbiol Rev. 2021 Sep 8;45(5):fuab015. doi: 10.1093/femsre/fuab015. FEMS Microbiol Rev. 2021. PMID: 33724378 Free PMC article. Review.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous