What can digital disease detection learn from (an external revision to) Google Flu Trends?

Mauricio Santillana¹, D Wendong Zhang¹, Benjamin M Althouse², John W Ayers³

Affiliations

¹ School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts.
² Santa Fe Institute, Santa Fe, New Mexico.
³ Graduate School of Public Health, San Diego State University, San Diego, California. Electronic address: ayers.john.w@gmail.com.

PMID: 24997572
DOI: 10.1016/j.amepre.2014.05.020

What can digital disease detection learn from (an external revision to) Google Flu Trends?

Mauricio Santillana et al. Am J Prev Med. 2014 Sep.

. 2014 Sep;47(3):341-7.

doi: 10.1016/j.amepre.2014.05.020. Epub 2014 Jul 2.

Authors

Mauricio Santillana¹, D Wendong Zhang¹, Benjamin M Althouse², John W Ayers³

Affiliations

¹ School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts.
² Santa Fe Institute, Santa Fe, New Mexico.
³ Graduate School of Public Health, San Diego State University, San Diego, California. Electronic address: ayers.john.w@gmail.com.

PMID: 24997572
DOI: 10.1016/j.amepre.2014.05.020

Abstract

Background: Google Flu Trends (GFT) claimed to generate real-time, valid predictions of population influenza-like illness (ILI) using search queries, heralding acclaim and replication across public health. However, recent studies have questioned the validity of GFT.

Purpose: To propose an alternative methodology that better realizes the potential of GFT, with collateral value for digital disease detection broadly.

Methods: Our alternative method automatically selects specific queries to monitor and autonomously updates the model each week as new information about CDC-reported ILI becomes available, as developed in 2013. Root mean squared errors (RMSEs) and Pearson correlations comparing predicted ILI (proportion of patient visits indicative of ILI) with subsequently observed ILI were used to judge model performance.

Results: During the height of the H1N1 pandemic (August 2 to December 22, 2009) and the 2012-2013 season (September 30, 2012, to April 12, 2013), GFT's predictions had RMSEs of 0.023 and 0.022 (i.e., hypothetically, if GFT predicted 0.061 ILI one week, it is expected to err by 0.023) and correlations of r=0.916 and 0.927. Our alternative method had RMSEs of 0.006 and 0.009, and correlations of r=0.961 and 0.919 for the same periods. Critically, during these important periods, the alternative method yielded more accurate ILI predictions every week, and was typically more accurate during other influenza seasons.

Conclusions: GFT may be inaccurate, but improved methodologic underpinnings can yield accurate predictions. Applying similar methods elsewhere can improve digital disease detection, with broader transparency, improved accuracy, and real-world public health impacts.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

What can digital disease detection learn from (an external revision to) Google Flu Trends?

Affiliations

What can digital disease detection learn from (an external revision to) Google Flu Trends?

Authors

Affiliations

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous