Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Sep 17;10 Suppl 9(Suppl 9):S13.
doi: 10.1186/1471-2105-10-S9-S13.

Characterizing environmental and phenotypic associations using information theory and electronic health records

Affiliations

Characterizing environmental and phenotypic associations using information theory and electronic health records

Xiaoyan Wang et al. BMC Bioinformatics. .

Abstract

Background: The availability of up-to-date, executable, evidence-based medical knowledge is essential for many clinical applications, such as pharmacovigilance, but executable knowledge is costly to obtain and update. Automated acquisition of environmental and phenotypic associations in biomedical and clinical documents using text mining has showed some success. The usefulness of the association knowledge is limited, however, due to the fact that the specific relationships between clinical entities remain unknown. In particular, some associations are indirect relations due to interdependencies among the data.

Results: In this work, we develop methods using mutual information (MI) and its property, the data processing inequality (DPI), to help characterize associations that were generated based on use of natural language processing to encode clinical information in narrative patient records followed by statistical methods. Evaluation based on a random sample consisting of two drugs and two diseases indicates an overall precision of 81%.

Conclusion: This preliminary study demonstrates that the proposed method is effective for helping to characterize phenotypic and environmental associations obtained from clinical reports.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic diagram of the effect of DPI. (a) and (c): Among two paths between ML and SOB (ML-SOB, ML-HD-SOB), ML-SOB was eliminated by applying DPI (shown as 1 in table 1). (b) and (d): Similarly, among two paths between HF and SOB (HF-SOB, HF-HD-SOB), HF-HD-SOB was eliminated by applying DPI (shown as 6 in table 1). (e) Combining evidence from (c) and (d), the path between ML-SOB would more likely to be ML-HD-HF-SOB. ML: Metolazone; HD: Hypertensive Disease; HF: Hear Failure; SOB: Shortness of Breath.
Figure 2
Figure 2
Schematic diagram of methodological framework. A: Acquiring Associations; B: Characterizing Associations; EHR: Electric Health Records; MI: Mutual Information; DPI: Data Processing Inequality; Dx: disease; Rx: drug; Sx: symptom.

Similar articles

Cited by

References

    1. Berner ES, Webster GD, Shugerman AA, Jackson JR, Algina J, Baker AL, Ball EV, Cobbs CG, Dennis VW, Frenkel EP, et al. Performance of four computer-based diagnostic systems. N Engl J Med. 1994;330:1792–1796. doi: 10.1056/NEJM199406233302506. - DOI - PubMed
    1. Clauson KA, Marsh WA, Polen HH, Seamon MJ, Ortiz BI. Clinical decision support tools: analysis of online drug information databases. BMC Med Inform Decis Mak. 2007;7:7. doi: 10.1186/1472-6947-7-7. - DOI - PMC - PubMed
    1. Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp. 2001:17–21. - PMC - PubMed
    1. Weeber M, Klein H, Aronson AR, Mork JG, de Jong-van den Berg LT, Vos R. Text-based discovery in biomedicine: the architecture of the DAD-system. Proc AMIA Symp. 2000:903–907. - PMC - PubMed
    1. Pakhomov S, Weston SA, Jacobsen SJ, Chute CG, Meverden R, Roger VL. Electronic medical records for clinical research: application to the identification of heart failure. Am J Manag Care. 2007;13:281–288. - PubMed

Publication types

MeSH terms

LinkOut - more resources