Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 22:316:909-913.
doi: 10.3233/SHTI240559.

A Comprehensive Natural Language Processing Pipeline for the Chronic Lupus Disease

Affiliations

A Comprehensive Natural Language Processing Pipeline for the Chronic Lupus Disease

Livia Lilli et al. Stud Health Technol Inform. .

Abstract

Electronic Health Records (EHRs) contain a wealth of unstructured patient data, making it challenging for physicians to do informed decisions. In this paper, we introduce a Natural Language Processing (NLP) approach for the extraction of therapies, diagnosis, and symptoms from ambulatory EHRs of patients with chronic Lupus disease. We aim to demonstrate the effort of a comprehensive pipeline where a rule-based system is combined with text segmentation, transformer-based topic analysis and clinical ontology, in order to enhance text preprocessing and automate rules' identification. Our approach is applied on a sub-cohort of 56 patients, with a total of 750 EHRs written in Italian language, achieving an Accuracy and an F-score over 97% and 90% respectively, in the three extracted domains. This work has the potential to be integrated with EHR systems to automate information extraction, minimizing the human intervention, and providing personalized digital solutions in the chronic Lupus disease domain.

Keywords: Artificial Intelligence (AI); Electronic Health Record (EHR); Information Extraction (IE); Natural Language Processing (NLP); Systemic Lupus Erythematosus (SLE).

PubMed Disclaimer

LinkOut - more resources