Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep-Oct;17(5):514-8.
doi: 10.1136/jamia.2010.003947.

Extracting medication information from clinical text

Affiliations

Extracting medication information from clinical text

Ozlem Uzuner et al. J Am Med Inform Assoc. 2010 Sep-Oct.

Abstract

The Third i2b2 Workshop on Natural Language Processing Challenges for Clinical Records focused on the identification of medications, their dosages, modes (routes) of administration, frequencies, durations, and reasons for administration in discharge summaries. This challenge is referred to as the medication challenge. For the medication challenge, i2b2 released detailed annotation guidelines along with a set of annotated discharge summaries. Twenty teams representing 23 organizations and nine countries participated in the medication challenge. The teams produced rule-based, machine learning, and hybrid systems targeted to the task. Although rule-based systems dominated the top 10, the best performing system was a hybrid. Of all medication-related fields, durations and reasons were the most difficult for all systems to detect. While medications themselves were identified with better than 0.75 F-measure by all of the top 10 systems, the best F-measure for durations and reasons were 0.525 and 0.459, respectively. State-of-the-art natural language processing systems go a long way toward extracting medication names, dosages, modes, and frequencies. However, they are limited in recognizing duration and reason fields and would benefit from future research.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sample discharge summary narrative and the output for the italicized portion. m, do, mo, f, du, r, and ln stand for medication, dosage, mode, frequency, duration, reason, and list/narrative, respectively.

References

    1. Breydo EM, Chu JT, Turchin A. Identification of inactive medications in narrative medical text. AMIA Annu Symp Proc 2008:66–70 - PMC - PubMed
    1. Turchin A, Shubina M, Breydo E, et al. Comparison of information content of structured and narrative text data sources on the example of medication intensification. J Am Med Inform Assoc 2009;16:362–70 - PMC - PubMed
    1. Grishman R, Sundheim B. Message understanding conference-6: A brief history. 16th Conference on Computational linguistics (COLING). 1996:466–71
    1. Sparck Jones K. Reflections on TREC. Information Processing Management 1995;31:291–314
    1. Uzuner Ö, Solti I, Xia F, et al. Community Annotation Experiment for Ground Truth Generation for the i2b2 Medication Challenge J Am Med Inform Assoc. In current issue. - PMC - PubMed

Publication types

Substances