HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports
- PMID: 18096908
- PMCID: PMC2274789
- DOI: 10.1197/jamia.M2544
HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports
Abstract
Objective: Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting in information overload. Our objective is to address this challenge through the HealthMap.org Web application, an automated system for querying, filtering, integrating and visualizing unstructured reports on disease outbreaks.
Design: This report describes the design principles, software architecture and implementation of HealthMap and discusses key challenges and future plans.
Measurements: We describe the process by which HealthMap collects and integrates outbreak data from a variety of sources, including news media (e.g., Google News), expert-curated accounts (e.g., ProMED Mail), and validated official alerts. Through the use of text processing algorithms, the system classifies alerts by location and disease and then overlays them on an interactive geographic map. We measure the accuracy of the classification algorithms based on the level of human curation necessary to correct misclassifications, and examine geographic coverage.
Results: As part of the evaluation of the system, we analyzed 778 reports with HealthMap, representing 87 disease categories and 89 countries. The automated classifier performed with 84% accuracy, demonstrating significant usefulness in managing the large volume of information processed by the system. Accuracy for ProMED alerts is 91% compared to Google News reports at 81%, as ProMED messages follow a more regular structure.
Conclusion: HealthMap is a useful free and open resource employing text-processing algorithms to identify important disease outbreak information through a user-friendly interface.
Figures
Comment in
-
Biosurveillance, classification, and semantic health technologies.J Am Med Inform Assoc. 2008 Mar-Apr;15(2):172-3. doi: 10.1197/jamia.m2693. J Am Med Inform Assoc. 2008. PMID: 18396506 Free PMC article. No abstract available.
Similar articles
-
Use of unstructured event-based reports for global infectious disease surveillance.Emerg Infect Dis. 2009 May;15(5):689-95. doi: 10.3201/eid1505.081114. Emerg Infect Dis. 2009. PMID: 19402953 Free PMC article. Review.
-
An exploratory study of a text classification framework for Internet-based surveillance of emerging epidemics.Int J Med Inform. 2011 Jan;80(1):56-66. doi: 10.1016/j.ijmedinf.2010.10.015. Epub 2010 Dec 4. Int J Med Inform. 2011. PMID: 21134784 Free PMC article.
-
Comparison of web-based biosecurity intelligence systems: BioCaster, EpiSPIDER and HealthMap.Transbound Emerg Dis. 2012 Jun;59(3):223-32. doi: 10.1111/j.1865-1682.2011.01258.x. Epub 2011 Dec 20. Transbound Emerg Dis. 2012. PMID: 22182229
-
Using HealthMap to Analyse Middle East Respiratory Syndrome (MERS) Data.Stud Health Technol Inform. 2016;226:213-6. Stud Health Technol Inform. 2016. PMID: 27350507
-
Public health surveillance and infectious disease detection.Biosecur Bioterror. 2012 Mar;10(1):6-16. doi: 10.1089/bsp.2011.0088. Biosecur Bioterror. 2012. PMID: 22455675 Review.
Cited by
-
Digital surveillance: a novel approach to monitoring the illegal wildlife trade.PLoS One. 2012;7(12):e51156. doi: 10.1371/journal.pone.0051156. Epub 2012 Dec 7. PLoS One. 2012. PMID: 23236444 Free PMC article.
-
A global compendium of human Crimean-Congo haemorrhagic fever virus occurrence.Sci Data. 2015 Apr 14;2:150016. doi: 10.1038/sdata.2015.16. eCollection 2015. Sci Data. 2015. PMID: 25977820 Free PMC article.
-
Implications of Knowledge Organization Systems for Health Information Exchange and Communication during the COVID-19 Pandemic.Data Inf Manag. 2020 Sep 1;4(3):148-170. doi: 10.2478/dim-2020-0009. Epub 2022 Mar 31. Data Inf Manag. 2020. PMID: 35382097 Free PMC article. Review.
-
Eliciting Disease Data from Wikipedia Articles.Proc Int AAAI Conf Weblogs Soc Media. 2015 May;2015:26-33. Proc Int AAAI Conf Weblogs Soc Media. 2015. PMID: 28721308 Free PMC article.
-
Using Baidu search values to monitor and predict the confirmed cases of COVID-19 in China: - evidence from Baidu index.BMC Infect Dis. 2021 Jan 21;21(1):98. doi: 10.1186/s12879-020-05740-x. BMC Infect Dis. 2021. PMID: 33478425 Free PMC article.
References
-
- Heymann DL, Rodier GR. Hot spots in a wired world: WHO surveillance of emerging and re-emerging infectious diseases Lancet Infect Dis 2001;1(5):345-353Dec. - PubMed
-
- Hiltz SR, Murray T. Structuring computer-mediated communication systems to avoid information overload Communications of the ACM 1985;28(7):680-689.
-
- Berghel H. Cyberspace 2000: Dealing with information overload Communications of the ACM 1997;40(2):19-24.
-
- Brownstein JS, Freifeld CC, Reis BY, Mandl KD. HealthMap: Internet-based emerging infectious disease intelligenceIn: Institute of Medicine, editor Infectious Disease Surveillance and Detection: Assessing the Challenges—Finding Solutions. 2007. pp. 183-204Washington, DC.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials