Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep 1;24(5):986-991.
doi: 10.1093/jamia/ocx039.

Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

Affiliations

Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

David S Carrell et al. J Am Med Inform Assoc. .

Abstract

Objective: Widespread application of clinical natural language processing (NLP) systems requires taking existing NLP systems and adapting them to diverse and heterogeneous settings. We describe the challenges faced and lessons learned in adapting an existing NLP system for measuring colonoscopy quality.

Materials and methods: Colonoscopy and pathology reports from 4 settings during 2013-2015, varying by geographic location, practice type, compensation structure, and electronic health record.

Results: Though successful, adaptation required considerably more time and effort than anticipated. Typical NLP challenges in assembling corpora, diverse report structures, and idiosyncratic linguistic content were greatly magnified.

Discussion: Strategies for addressing adaptation challenges include assessing site-specific diversity, setting realistic timelines, leveraging local electronic health record expertise, and undertaking extensive iterative development. More research is needed on how to make it easier to adapt NLP systems to new clinical settings.

Conclusions: A key challenge in widespread application of NLP is adapting existing systems to new clinical settings.

Keywords: cancer screening; data collection; electronic health records; information dissemination; natural language processing.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
NLP adaptation challenges (vertical bars) and potential mitigation strategies (horizontal arrows) for 3 major categories of challenges (corpus assembly, document structure, and linguistic complexity), and the influence of local environmental factors (EHR systems, local policies and practices, and practitioner customs).

References

    1. Jha AK. The promise of electronic records: around the corner or down the road? JAMA. 2011;306:880–81. - PubMed
    1. Wang SV, Rogers JR, Jin Y, Bates DW, Fischer MA. Use of electronic healthcare records to identify complex patients with atrial fibrillation for targeted intervention. J Am Med Inform Assoc. 2017;242:339–44. - PMC - PubMed
    1. Rochefort CM, Verma AD, Eguale T, Lee TC, Buckeridge DL. A novel method of adverse event detection can accurately identify venous thromboembolisms (VTEs) from narrative electronic health record data. J Am Med Inform Assoc. 2015;22:155–65. - PMC - PubMed
    1. Lin C, Karlson EW, Dligach D et al. Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. J Am Med Inform Assoc. 2015;22:e151–61. - PMC - PubMed
    1. Teixeira PL, Wei WQ, Cronin RM et al. Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals. J Am Med Inform Assoc. 2017;241:162–71. - PMC - PubMed