Contextualized medication information extraction using Transformer-based deep learning architectures
- PMID: 37100106
- PMCID: PMC10980542
- DOI: 10.1016/j.jbi.2023.104370
Contextualized medication information extraction using Transformer-based deep learning architectures
Abstract
Objective: To develop a natural language processing (NLP) system to extract medications and contextual information that help understand drug changes. This project is part of the 2022 n2c2 challenge.
Materials and methods: We developed NLP systems for medication mention extraction, event classification (indicating medication changes discussed or not), and context classification to classify medication changes context into 5 orthogonal dimensions related to drug changes. We explored 6 state-of-the-art pretrained transformer models for the three subtasks, including GatorTron, a large language model pretrained using > 90 billion words of text (including > 80 billion words from > 290 million clinical notes identified at the University of Florida Health). We evaluated our NLP systems using annotated data and evaluation scripts provided by the 2022 n2c2 organizers.
Results: Our GatorTron models achieved the best F1-scores of 0.9828 for medication extraction (ranked 3rd), 0.9379 for event classification (ranked 2nd), and the best micro-average accuracy of 0.9126 for context classification. GatorTron outperformed existing transformer models pretrained using smaller general English text and clinical text corpora, indicating the advantage of large language models.
Conclusion: This study demonstrated the advantage of using large transformer models for contextual medication information extraction from clinical narratives.
Keywords: Clinical natural language processing; Deep learning; Medication information extraction; Named entity recognition; Text classification.
Copyright © 2023. Published by Elsevier Inc.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models.J Healthc Inform Res. 2024 May 17;8(3):463-477. doi: 10.1007/s41666-024-00166-5. eCollection 2024 Sep. J Healthc Inform Res. 2024. PMID: 39131104 Free PMC article.
-
A deep learning approach for medication disposition and corresponding attributes extraction.J Biomed Inform. 2023 Jul;143:104391. doi: 10.1016/j.jbi.2023.104391. Epub 2023 May 15. J Biomed Inform. 2023. PMID: 37196988 Free PMC article.
-
Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting.J Am Med Inform Assoc. 2020 Jan 1;27(1):65-72. doi: 10.1093/jamia/ocz144. J Am Med Inform Assoc. 2020. PMID: 31504605 Free PMC article.
-
Overview of the 2022 n2c2 shared task on contextualized medication event extraction in clinical notes.J Biomed Inform. 2023 Aug;144:104432. doi: 10.1016/j.jbi.2023.104432. Epub 2023 Jun 24. J Biomed Inform. 2023. PMID: 37356640 Free PMC article. Review.
-
Deep learning in clinical natural language processing: a methodical review.J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200. J Am Med Inform Assoc. 2020. PMID: 31794016 Free PMC article. Review.
Cited by
-
Large Language Models in Biomedical and Health Informatics: A Review with Bibliometric Analysis.J Healthc Inform Res. 2024 Sep 14;8(4):658-711. doi: 10.1007/s41666-024-00171-8. eCollection 2024 Dec. J Healthc Inform Res. 2024. PMID: 39463859
-
Opportunities and Risks of Large Language Models in Psychiatry.NPP Digit Psychiatry Neurosci. 2024;2(1):8. doi: 10.1038/s44277-024-00010-z. Epub 2024 May 24. NPP Digit Psychiatry Neurosci. 2024. PMID: 39554888 Free PMC article.
-
What can artificial intelligence do for EUS?Endosc Ultrasound. 2025 Jan-Feb;14(1):1-3. doi: 10.1097/eus.0000000000000102. Epub 2025 Feb 27. Endosc Ultrasound. 2025. PMID: 40151598 Free PMC article. No abstract available.
-
A Joint Classification Method for COVID-19 Lesions Based on Deep Learning and Radiomics.Tomography. 2024 Sep 5;10(9):1488-1500. doi: 10.3390/tomography10090109. Tomography. 2024. PMID: 39330755 Free PMC article.
-
Twenty-Five Years of Evolution and Hurdles in Electronic Health Records and Interoperability in Medical Research: Comprehensive Review.J Med Internet Res. 2025 Jan 9;27:e59024. doi: 10.2196/59024. J Med Internet Res. 2025. PMID: 39787599 Free PMC article. Review.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources