Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr;26(4):378-385.
doi: 10.1002/pds.4149. Epub 2017 Jan 3.

Natural language processing to ascertain two key variables from operative reports in ophthalmology

Affiliations

Natural language processing to ascertain two key variables from operative reports in ophthalmology

Liyan Liu et al. Pharmacoepidemiol Drug Saf. 2017 Apr.

Abstract

Purpose: Antibiotic prophylaxis is critical to ophthalmology and other surgical specialties. We performed natural language processing (NLP) of 743 838 operative notes recorded for 315 246 surgeries to ascertain two variables needed to study the comparative effectiveness of antibiotic prophylaxis in cataract surgery. The first key variable was an exposure variable, intracameral antibiotic injection. The second was an intraoperative complication, posterior capsular rupture (PCR), which functioned as a potential confounder. To help other researchers use NLP in their settings, we describe our NLP protocol and lessons learned.

Methods: For each of the two variables, we used SAS Text Miner and other SAS text-processing modules with a training set of 10 000 (1.3%) operative notes to develop a lexicon. The lexica identified misspellings, abbreviations, and negations, and linked words into concepts (e.g. "antibiotic" linked with "injection"). We confirmed the NLP tools by iteratively obtaining random samples of 2000 (0.3%) notes, with replacement.

Results: The NLP tools identified approximately 60 000 intracameral antibiotic injections and 3500 cases of PCR. The positive and negative predictive values for intracameral antibiotic injection exceeded 99%. For the intraoperative complication, they exceeded 94%.

Conclusion: NLP was a valid and feasible method for obtaining critical variables needed for a research study of surgical safety. These NLP tools were intended for use in the study sample. Use with external datasets or future datasets in our own setting would require further testing. Copyright © 2017 John Wiley & Sons, Ltd.

Keywords: comparative effectiveness research; electronic health record; natural language processing; pharmacoepidemiology; practice variation; prophylaxis; surgical-site infection.

PubMed Disclaimer

Figures

Figure 1
Figure 1
NLP Flowchart
Figure 2
Figure 2
Text Miner
Figure 3
Figure 3
Concept Linking* *Concept linking is the processing finding and displaying terms that are spatially associated with a term (e.g., “capsule”) that is part of the lexicon.
Figure 4
Figure 4
Development of NLP to Code Posterior Capsular Rupture (PCR) from an Operative Report

References

    1. Boan S, Conway M, Phuong TM, Ohno-Machado L. Natural language processing in biomedicine: a unified system architecture overview. Methods Mol Biol. 2014;1168:275–94. - PubMed
    1. Hou JK, Imler TD, Imperiale TF. Current and future applications of natural language processing in the field of digestive disease. Clin Gastroenterol Hepatol. 2014;12:1257–61. - PubMed
    1. Murff HJ, FitzHenry F, Matheny ME. Automated Identification of Postoperative Complications Within an Electronic Medical Record Using Natural Language Processing. JAMA. 2011;306(8):848–855. - PubMed
    1. Doan S, Maehara CK, Chaparro JD, Lu S, et al. Building a Natural Language Processing Tool to Identify Patients with High Clinical Suspicion for Kawasaki Disease from Emergency Department Notes. Acad Emerg Med. 2016;23:628–367. - PMC - PubMed
    1. Cheng LT, Zheng J, Savova JK, et al. Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing. J Digital Imaging. 2010;23:119–132. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources