Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar;25(2):E245-E254.

Automated Extraction of Pain Symptoms: A Natural Language Approach using Electronic Health Records

Affiliations
  • PMID: 35322976

Automated Extraction of Pain Symptoms: A Natural Language Approach using Electronic Health Records

Amisha D Dave et al. Pain Physician. 2022 Mar.

Abstract

Background: Pain costs more than $600 billion annually and affects more than 100 million Americans, but is still a poorly understood problem and one for which there is very often limited effective treatment. Electronic health records (EHRs) are the only databases with a high volume of granular pain information that allows for documentation of detailed clinical notes on a patient's subjective experience.

Objectives: This study applied natural language processing (NLP) technology to an EHR dataset as part of a pilot study to capture pain information from clinical notes and prove its feasibility as an efficient method.

Study design: Retrospective study.

Setting: All data were from UConn Health John Dempsey Hospital (JDH) in Farmington, CT.

Methods: The JDH EHR dataset contains 611,355 clinical narratives from 359,854 patients from diverse demographic backgrounds from 2010 through 2019. These data were processed through a customized NLP pipeline. A training set of 100 notes was annotated based on focus group-generated ontology and used to generate and evaluate an NLP model that was later tested on the remaining notes. Validation of the model was evaluated externally and performance was analyzed.

Results: The model identified back pain as the most common location of experienced pain with 40,369 term frequencies. Patients most commonly experienced decreased mobility with their pain with 7,375 term frequencies. Pain was most commonly found to be radiating with 26,967 term frequencies and patients most commonly rated their pain as 8/10 with 2,375 term frequencies. All parameters studied had statistical F-scores greater than 0.85.

Limitations: A single-center, pilot study subject to reporting bias, recording bias, and missing patient data.

Conclusions: Our customized NLP model demonstrated good and successful performance in extracting granular pain information from clinical notes in electronic health records.

Keywords: automation; electronic health records; natural language processing; pain location; pain quality; pain quantity; pain symptoms; Pain ontology.

PubMed Disclaimer