Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 28:6:1282043.
doi: 10.3389/fdgth.2024.1282043. eCollection 2024.

Word sense disambiguation of acronyms in clinical narratives

Affiliations

Word sense disambiguation of acronyms in clinical narratives

Daphné Chopard et al. Front Digit Health. .

Abstract

Clinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associated with patient privacy and manual annotation, which limit the size and diversity of training data. In this study, we demonstrate how scientific abstracts can be utilised to overcome these issues by creating a large automatically annotated dataset of artificially simulated global acronyms. A neural network trained on such a dataset achieved the F1-score of 95% on disambiguation of acronym mentions in scientific abstracts. This network was integrated with multi-word term recognition to extract a sense inventory of acronyms from a corpus of clinical narratives on the fly. Acronym sense extraction achieved the F1-score of 74% on a corpus of radiology reports. In clinical practice, the suggested approach can be used to facilitate development of institution-specific inventories.

Keywords: acronym disambiguation; deep learning; machine learning; natural language processing; silver standard; word sense disambiguation.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
An acronym disambiguation framework.
Figure 2
Figure 2
BERT-based representation of the acronym disambiguation problem.
Figure 3
Figure 3
Ground truth with the distribution of long form candidates.

References

    1. Fandrych I. Submorphemic elements in the formation of acronyms, blends, clippings. Lexis J Engl Lexicol (2008) 2. 10.4000/lexis.713 - DOI
    1. Laszlo S, Federmeier KD. The acronym superiority effect. Psychon Bull Rev (2007) 14:1158–63. 10.3758/BF03193106 - DOI - PMC - PubMed
    1. Moon S, Pakhomov S, Liu N, Ryan JO, Melton GB. A sense inventory for clinical abbreviations, acronyms created using clinical notes, medical dictionary resources. J Am Med Inform Assoc (2014) 21:299–307. 10.1136/amiajnl-2012-001506 - DOI - PMC - PubMed
    1. Spasić I, Krzemiński D, Corcoran P, Balinsky A. Cohort selection for clinical trials from longitudinal patient records: text mining approach. JMIR Med Inform (2019) 7:e15980. 10.2196/15980 - DOI - PMC - PubMed
    1. Holper S, Barmanray R, Colman B, Yates CJ, Liew D, Smallwood D. Ambiguous medical abbreviation study: challenges and opportunities. Intern Med J (2020) 50:1073–8. 10.1111/imj.14442 - DOI - PubMed

LinkOut - more resources