Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 18:302:831-832.
doi: 10.3233/SHTI230281.

Information Extraction from Medical Texts with BERT Using Human-in-the-Loop Labeling

Affiliations

Information Extraction from Medical Texts with BERT Using Human-in-the-Loop Labeling

Hendrik Šuvalov et al. Stud Health Technol Inform. .

Abstract

Neural network language models, such as BERT, can be used for information extraction from medical texts with unstructured free text. These models can be pre-trained on a large corpus to learn the language and characteristics of the relevant domain and then fine-tuned with labeled data for a specific task. We propose a pipeline using human-in-the-loop labeling to create annotated data for Estonian healthcare information extraction. This method is particularly useful for low-resource languages and is more accessible to those in the medical field than rule-based methods like regular expressions.

Keywords: BERT; information extraction; medical texts; named entity recognition; natural language processing.

PubMed Disclaimer

LinkOut - more resources