Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar 4:15:64.
doi: 10.1186/1471-2105-15-64.

Knowledge-based extraction of adverse drug events from biomedical text

Affiliations

Knowledge-based extraction of adverse drug events from biomedical text

Ning Kang et al. BMC Bioinformatics. .

Abstract

Background: Many biomedical relation extraction systems are machine-learning based and have to be trained on large annotated corpora that are expensive and cumbersome to construct. We developed a knowledge-based relation extraction system that requires minimal training data, and applied the system for the extraction of adverse drug events from biomedical text. The system consists of a concept recognition module that identifies drugs and adverse effects in sentences, and a knowledge-base module that establishes whether a relation exists between the recognized concepts. The knowledge base was filled with information from the Unified Medical Language System. The performance of the system was evaluated on the ADE corpus, consisting of 1644 abstracts with manually annotated adverse drug events. Fifty abstracts were used for training, the remaining abstracts were used for testing.

Results: The knowledge-based system obtained an F-score of 50.5%, which was 34.4 percentage points better than the co-occurrence baseline. Increasing the training set to 400 abstracts improved the F-score to 54.3%. When the system was compared with a machine-learning system, jSRE, on a subset of the sentences in the ADE corpus, our knowledge-based system achieved an F-score that is 7 percentage points higher than the F-score of jSRE trained on 50 abstracts, and still 2 percentage points higher than jSRE trained on 90% of the corpus.

Conclusion: A knowledge-based approach can be successfully used to extract adverse drug events from biomedical text without need for a large training set. Whether use of a knowledge base is equally advantageous for other biomedical relation-extraction tasks remains to be investigated.

PubMed Disclaimer

References

    1. Jensen LJ, Saric J, Bork P. Literature mining for the biologist: from information retrieval to biological discovery. Nat Rev Genet. 2006;7:119–129. doi: 10.1038/nrg1768. - DOI - PubMed
    1. Zweigenbaum P, Demner-Fushman D, Yu H, Cohen KB. Frontiers of biomedical text mining: current progress. Brief Bioinform. 2007;8:358–375. doi: 10.1093/bib/bbm045. - DOI - PMC - PubMed
    1. Simpson MS, Demner-Fushman D. In: Mining Text Data. Aggarwal CC, Zhai C, editor. New York: Springer; 2012. Biomedical text mining: a survey of recent progress; pp. 465–517.
    1. Revere D, Fuller S. Characterizing biomedical concept relationships. Med Inform (Lond) 2005;8:183–210. doi: 10.1007/0-387-25739-X_7. - DOI
    1. Dai HJ, Chang YC, Tzong-Han Tsai R, Hsu WL. New challenges for biological text-mining in the next decade. J Comput Sci Tech. 2010;25:169–179. doi: 10.1007/s11390-010-9313-5. - DOI

Publication types