Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 21;21(1):100.
doi: 10.1186/s12889-020-10106-8.

Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe

Affiliations

Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe

Loukas Samaras et al. BMC Public Health. .

Abstract

Background: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outbreaks before these are observed in population and, therefore, they can be used as a complement to other methods. In this research, our aim is to examine a highly infectious disease, measles, as there is no extensive literature on forecasting measles using Internet data, METHODS: This research has been conducted with official data on measles for 5 years (2013-2018) from the competent authority of the European Union (European Center of Disease and Prevention - ECDC) and data obtained from Google Trends by using scripts coded in Python. We compared regression models forecasting the development of measles in the five countries.

Results: Results show that measles can be estimated and predicted through Google Trends in terms of time, volume and the overall spread. The combined results reveal a strong relationship of measles cases with the predicted cases (correlation coefficient R= 0.779 in two-tailed significance p< 0.01). The mean standard error was relatively low 45.2 (12.19%) for the combined results. However, major differences and deviations were observed for countries with a relatively low impact of measles, such as the United Kingdom and Spain. For these countries, alternative models were tested in an attempt to improve the results.

Conclusions: The estimation of measles cases from Google Trends produces acceptable results and can help predict outbreaks in a robust and sound manner, at least 2 months in advance. Python scripts can be used individually or within the framework of an integrated Internet surveillance system for tracking epidemics as the one addressed here.

Keywords: Computational science; Forecasting; Linear regression; Measles; Programming languages; Syndromic surveillance.

PubMed Disclaimer

Conflict of interest statement

We declare that we have no conflict of interest.

Figures

Fig. 1
Fig. 1
Combined results graph for Italy, Germany and France
Fig. 2
Fig. 2
Combined results graph for all countries
Fig. 3
Fig. 3
Real cases, Predictions and normal P-P Plots for each country
Fig. 4
Fig. 4
Scatterplots for residuals against predicted values
Fig. 5
Fig. 5
Histograms for residuals against normal distribution
Fig. 6
Fig. 6
Normal Q-Q Plots for residuals
Fig. 7
Fig. 7
Standard residuals versus order
Fig. 8
Fig. 8
ARIMA model for the UK and Spain

References

    1. Johnson HA, Wagner MM, Hogan WR, Chapman W, Olszewski RT, Dowling J, Barnas G. Analysis of web access logs for surveillance of influenza. Medinfo. 2004;11(Pt 2):1202–1226. - PubMed
    1. Rees EE, Ng V, Gachon P, Mawudeku D, McKenney D, Pedlar J, Yemshanov D, Parmely J, Knox J. Early detection and prediction of infectious disease outbreaks. CCDR. 2019;45:5. doi: 10.14745/ccdr.v45i05a02. - DOI - PMC - PubMed
    1. Christaki E. New technologies in predicting, preventing and controlling emerging infectious diseases. Virulence. 2015;6:558–565. doi: 10.1080/21505594.2015.1040975. - DOI - PMC - PubMed
    1. Google . Google Trends. 2018.
    1. Google. Google Flu Trends. 2019. https://www.google.org/flutrends/about/ (Accessed 04 May 2019).

Publication types