. 2014 Sep-Oct;21(5):815-23.

doi: 10.1136/amiajnl-2013-001934. Epub 2014 Jan 9.

Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers

Ye Ye¹, Fuchiang Rich Tsui¹, Michael Wagner¹, Jeremy U Espino², Qi Li³

Affiliations

¹ Real-time Outbreak and Disease Surveillance Laboratory (RODS), Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA Intelligent Systems Program, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.
² Real-time Outbreak and Disease Surveillance Laboratory (RODS), Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.
³ Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, USA.

PMID: 24406261
PMCID: PMC4147621
DOI: 10.1136/amiajnl-2013-001934

Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers

Ye Ye et al. J Am Med Inform Assoc. 2014 Sep-Oct.

. 2014 Sep-Oct;21(5):815-23.

doi: 10.1136/amiajnl-2013-001934. Epub 2014 Jan 9.

Authors

Ye Ye¹, Fuchiang Rich Tsui¹, Michael Wagner¹, Jeremy U Espino², Qi Li³

Affiliations

¹ Real-time Outbreak and Disease Surveillance Laboratory (RODS), Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA Intelligent Systems Program, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.
² Real-time Outbreak and Disease Surveillance Laboratory (RODS), Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.
³ Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, USA.

PMID: 24406261
PMCID: PMC4147621
DOI: 10.1136/amiajnl-2013-001934

Erratum in

Correction.
[No authors listed] [No authors listed] J Am Med Inform Assoc. 2023 Dec 22;31(1):281. doi: 10.1093/jamia/ocad155. J Am Med Inform Assoc. 2023. PMID: 37757460 Free PMC article. No abstract available.

Abstract

Objectives: To evaluate factors affecting performance of influenza detection, including accuracy of natural language processing (NLP), discriminative ability of Bayesian network (BN) classifiers, and feature selection.

Methods: We derived a testing dataset of 124 influenza patients and 87 non-influenza (shigellosis) patients. To assess NLP finding-extraction performance, we measured the overall accuracy, recall, and precision of Topaz and MedLEE parsers for 31 influenza-related findings against a reference standard established by three physician reviewers. To elucidate the relative contribution of NLP and BN classifier to classification performance, we compared the discriminative ability of nine combinations of finding-extraction methods (expert, Topaz, and MedLEE) and classifiers (one human-parameterized BN and two machine-parameterized BNs). To assess the effects of feature selection, we conducted secondary analyses of discriminative ability using the most influential findings defined by their likelihood ratios.

Results: The overall accuracy of Topaz was significantly better than MedLEE (with post-processing) (0.78 vs 0.71, p<0.0001). Classifiers using human-annotated findings were superior to classifiers using Topaz/MedLEE-extracted findings (average area under the receiver operating characteristic (AUROC): 0.75 vs 0.68, p=0.0113), and machine-parameterized classifiers were superior to the human-parameterized classifier (average AUROC: 0.73 vs 0.66, p=0.0059). The classifiers using the 17 'most influential' findings were more accurate than classifiers using all 31 subject-matter expert-identified findings (average AUROC: 0.76>0.70, p<0.05).

Conclusions: Using a three-component evaluation method we demonstrated how one could elucidate the relative contributions of components under an integrated framework. To improve classification performance, this study encourages researchers to improve NLP accuracy, use a machine-parameterized classifier, and apply feature selection methods.

Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

PubMed Disclaimer

Figures

**Figure 1**
Bayesian network for influenza detection (GeNIe visualization).

**Figure 2**
Percentages of influenza cases and shigellosis cases with targeted influenza-related findings.

**Figure 3**
Log₁₀ LR⁺ (likelihood ratios) of features in expert-defined BN, BN-EM-Topaz, and BN-EM-MedLEE.

**Figure 4**
Log₁₀ LR⁻ (likelihood ratios) of features in expert-defined BN, BN-EM-Topaz, and BN-EM-MedLEE.

See this image and copyright information in PMC

References

1. Chu D. Clinical feature extraction from emergency department reports for biosurveillance [master's thesis]. Pittsburgh, University of Pittsburgh, 2007
1. Friedman C, Alderson PO, Austin JH, et al. A general natural-language text processor for clinical radiology. J Am Med Inform Assoc 1994;1:161–74 - PMC - PubMed
1. Friedman C, Shagina L, Lussier Y, et al. Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc 2004;11:392–402 - PMC - PubMed
1. McCarty CA, Chisholm RL, Chute CG, et al. The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med Genomics 2011:4–13 - PMC - PubMed
1. Conway M, Berg RL, Carrell D, et al. Analyzing the heterogeneity and complexity of electronic health record oriented phenotyping algorithms. AMIA Annu Symp Proc 2011:274–83 - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers

Affiliations

Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers

Authors

Affiliations

Erratum in

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical