Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

Affiliations

¹ Kaiser Permanente of Washington Health Research Institute (formerly Group Health Research Institute), Seattle, WA, USA.
² Division of Gastroenterology, Hepatology, and Nutrition, Department of Medicine and Epidemiology, University of Pittsburgh, Pittsburgh, PA, USA.
³ Division of Gastroenterology, Beth Israel Deaconess Medical Center, Boston, MA, USA.
⁴ Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.
⁵ Department of Health Care Policy, Harvard Medical School, Boston, MA, USA.
⁶ Division of Gastroenterology and Hepatology, University of North Carolina School of Medicine, Chapel Hill, NC, USA.
⁷ Division of General Internal Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.

PMID: 28419261
PMCID: PMC6080843
DOI: 10.1093/jamia/ocx039

Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

David S Carrell et al. J Am Med Inform Assoc. 2017.

. 2017 Sep 1;24(5):986-991.

doi: 10.1093/jamia/ocx039.

Authors

Affiliations

¹ Kaiser Permanente of Washington Health Research Institute (formerly Group Health Research Institute), Seattle, WA, USA.
² Division of Gastroenterology, Hepatology, and Nutrition, Department of Medicine and Epidemiology, University of Pittsburgh, Pittsburgh, PA, USA.
³ Division of Gastroenterology, Beth Israel Deaconess Medical Center, Boston, MA, USA.
⁴ Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.
⁵ Department of Health Care Policy, Harvard Medical School, Boston, MA, USA.
⁶ Division of Gastroenterology and Hepatology, University of North Carolina School of Medicine, Chapel Hill, NC, USA.
⁷ Division of General Internal Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.

PMID: 28419261
PMCID: PMC6080843
DOI: 10.1093/jamia/ocx039

Abstract

Objective: Widespread application of clinical natural language processing (NLP) systems requires taking existing NLP systems and adapting them to diverse and heterogeneous settings. We describe the challenges faced and lessons learned in adapting an existing NLP system for measuring colonoscopy quality.

Materials and methods: Colonoscopy and pathology reports from 4 settings during 2013-2015, varying by geographic location, practice type, compensation structure, and electronic health record.

Results: Though successful, adaptation required considerably more time and effort than anticipated. Typical NLP challenges in assembling corpora, diverse report structures, and idiosyncratic linguistic content were greatly magnified.

Discussion: Strategies for addressing adaptation challenges include assessing site-specific diversity, setting realistic timelines, leveraging local electronic health record expertise, and undertaking extensive iterative development. More research is needed on how to make it easier to adapt NLP systems to new clinical settings.

Conclusions: A key challenge in widespread application of NLP is adapting existing systems to new clinical settings.

Keywords: cancer screening; data collection; electronic health records; information dissemination; natural language processing.

PubMed Disclaimer

Figures

**Figure 1.**
NLP adaptation challenges (vertical bars) and potential mitigation strategies (horizontal arrows) for 3 major categories of challenges (corpus assembly, document structure, and linguistic complexity), and the influence of local environmental factors (EHR systems, local policies and practices, and practitioner customs).

See this image and copyright information in PMC

References

1. Jha AK. The promise of electronic records: around the corner or down the road? JAMA. 2011;306:880–81. - PubMed
1. Wang SV, Rogers JR, Jin Y, Bates DW, Fischer MA. Use of electronic healthcare records to identify complex patients with atrial fibrillation for targeted intervention. J Am Med Inform Assoc. 2017;242:339–44. - PMC - PubMed
1. Rochefort CM, Verma AD, Eguale T, Lee TC, Buckeridge DL. A novel method of adverse event detection can accurately identify venous thromboembolisms (VTEs) from narrative electronic health record data. J Am Med Inform Assoc. 2015;22:155–65. - PMC - PubMed
1. Lin C, Karlson EW, Dligach D et al. Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. J Am Med Inform Assoc. 2015;22:e151–61. - PMC - PubMed
1. Teixeira PL, Wei WQ, Cronin RM et al. Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals. J Am Med Inform Assoc. 2017;241:162–71. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

Affiliations

Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

Authors

Affiliations

Abstract

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical