Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Mar-Apr;15(2):150-7.
doi: 10.1197/jamia.M2544. Epub 2007 Dec 20.

HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports

Affiliations

HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports

Clark C Freifeld et al. J Am Med Inform Assoc. 2008 Mar-Apr.

Abstract

Objective: Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting in information overload. Our objective is to address this challenge through the HealthMap.org Web application, an automated system for querying, filtering, integrating and visualizing unstructured reports on disease outbreaks.

Design: This report describes the design principles, software architecture and implementation of HealthMap and discusses key challenges and future plans.

Measurements: We describe the process by which HealthMap collects and integrates outbreak data from a variety of sources, including news media (e.g., Google News), expert-curated accounts (e.g., ProMED Mail), and validated official alerts. Through the use of text processing algorithms, the system classifies alerts by location and disease and then overlays them on an interactive geographic map. We measure the accuracy of the classification algorithms based on the level of human curation necessary to correct misclassifications, and examine geographic coverage.

Results: As part of the evaluation of the system, we analyzed 778 reports with HealthMap, representing 87 disease categories and 89 countries. The automated classifier performed with 84% accuracy, demonstrating significant usefulness in managing the large volume of information processed by the system. Accuracy for ProMED alerts is 91% compared to Google News reports at 81%, as ProMED messages follow a more regular structure.

Conclusion: HealthMap is a useful free and open resource employing text-processing algorithms to identify important disease outbreak information through a user-friendly interface.

PubMed Disclaimer

Figures

Figure 1
Figure 1
HealthMap System Architecture.
Figure 2
Figure 2
Lookup Tree.
Figure 3
Figure 3
User Interface.
Figure 4
Figure 4
Geographic coverage of the HealthMap system.

Comment in

Similar articles

Cited by

References

    1. Grein TW, Kamara KB, Rodier G, Plant AJ, Bovier P, Ryan MJ, et al. Rumors of disease in the global village: outbreak verification Emerg Infect Dis 2000;6(2):97-102Mar-Apr. - PMC - PubMed
    1. Heymann DL, Rodier GR. Hot spots in a wired world: WHO surveillance of emerging and re-emerging infectious diseases Lancet Infect Dis 2001;1(5):345-353Dec. - PubMed
    1. Hiltz SR, Murray T. Structuring computer-mediated communication systems to avoid information overload Communications of the ACM 1985;28(7):680-689.
    1. Berghel H. Cyberspace 2000: Dealing with information overload Communications of the ACM 1997;40(2):19-24.
    1. Brownstein JS, Freifeld CC, Reis BY, Mandl KD. HealthMap: Internet-based emerging infectious disease intelligenceIn: Institute of Medicine, editor Infectious Disease Surveillance and Detection: Assessing the Challenges—Finding Solutions. 2007. pp. 183-204Washington, DC.

Publication types

MeSH terms