Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 1;27(1):47-55.
doi: 10.1093/jamia/ocz120.

Adverse drug event and medication extraction in electronic health records via a cascading architecture with different sequence labeling models and word embeddings

Affiliations

Adverse drug event and medication extraction in electronic health records via a cascading architecture with different sequence labeling models and word embeddings

Hong-Jie Dai et al. J Am Med Inform Assoc. .

Abstract

Objective: An adverse drug event (ADE) refers to an injury resulting from medical intervention related to a drug including harm caused by drugs or from the usage of drugs. Extracting ADEs from clinical records can help physicians associate adverse events to targeted drugs.

Materials and methods: We proposed a cascading architecture to recognize medical concepts including ADEs, drug names, and entities related to drugs. The architecture includes a preprocessing method and an ensemble of conditional random fields (CRFs) and neural network-based models to respectively address the challenges of surrogate string and overlapping annotation boundaries observed in the employed ADEs and medication extraction (ADME) corpus. The effectiveness of applying different pretrained and postprocessed word embeddings for the ADME task was also studied.

Results: The empirical results showed that both CRFs and neural network-based models provide promising solution for the ADME task. The neural network-based models particularly outperformed CRFs in concept types involving narrative descriptions. Our best run achieved an overall micro F-score of 0.919 on the employed corpus. Our results also suggested that the Global Vectors for word representation embedding in general domain provides a very strong baseline, which can be further improved by applying the principal component analysis to generate more isotropic vectors.

Conclusions: We have demonstrated that the proposed cascading architecture can handle the problem of overlapped annotations and further improve the overall recall and F-scores because the architecture enables the developed models to exploit more context information and forms an ensemble for creating a stronger recognizer.

Keywords: adverse drug event; electronic health record; information extraction; named entity recognition; word embedding.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An example of the overlapping annotations.
Figure 2.
Figure 2.
The general workflow applied by top-ranked systems.
Figure 3.
Figure 3.
A cascading architecture developed for the ADME task.
Figure 4.
Figure 4.
Entity type performance comparison for submitted runs using Run 1 as the baseline.
Figure 5.
Figure 5.
Performance comparison for models with different word embeddings. Here the y-axis is the difference between the F-scores of each model for different entity types and that of the model with the nlplab embedding.

References

    1. Nebeker JR, Barach P, Samore MH.. Clarifying adverse drug events: a clinician's guide to terminology, documentation, and reporting. Ann Intern Med 2004; 140 (10): 795–801. - PubMed
    1. Bates DW, Cullen DJ, Laird N, et al. Incidence of adverse drug events and potential adverse drug events: implications for prevention. JAMA 1995; 274 (1): 29–34. - PubMed
    1. Classen DC, Pestotnik SL, Evans RS, Lloyd JF, Burke JP.. Adverse drug events in hospitalized patients: excess length of stay, extra costs, and attributable mortality. JAMA 1997; 277 (4): 301–6. - PubMed
    1. Leape LL, Bates DW, Cullen DJ, et al. Systems analysis of adverse drug events. JAMA 1995; 274 (1): 35–43. - PubMed
    1. Harpaz R, Callahan A, Tamang S, et al. Text mining for adverse drug events: the promise, challenges, and state of the art. Drug Saf 2014; 37 (10): 777–90. - PMC - PubMed

Publication types