Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jun:55:23-30.
doi: 10.1016/j.jbi.2015.03.002. Epub 2015 Mar 19.

Extracting drug-drug interactions from literature using a rich feature-based linear kernel approach

Affiliations

Extracting drug-drug interactions from literature using a rich feature-based linear kernel approach

Sun Kim et al. J Biomed Inform. 2015 Jun.

Abstract

Identifying unknown drug interactions is of great benefit in the early detection of adverse drug reactions. Despite existence of several resources for drug-drug interaction (DDI) information, the wealth of such information is buried in a body of unstructured medical text which is growing exponentially. This calls for developing text mining techniques for identifying DDIs. The state-of-the-art DDI extraction methods use Support Vector Machines (SVMs) with non-linear composite kernels to explore diverse contexts in literature. While computationally less expensive, linear kernel-based systems have not achieved a comparable performance in DDI extraction tasks. In this work, we propose an efficient and scalable system using a linear kernel to identify DDI information. The proposed approach consists of two steps: identifying DDIs and assigning one of four different DDI types to the predicted drug pairs. We demonstrate that when equipped with a rich set of lexical and syntactic features, a linear SVM classifier is able to achieve a competitive performance in detecting DDIs. In addition, the one-against-one strategy proves vital for addressing an imbalance issue in DDI type classification. Applied to the DDIExtraction 2013 corpus, our system achieves an F1 score of 0.670, as compared to 0.651 and 0.609 reported by the top two participating teams in the DDIExtraction 2013 challenge, both based on non-linear kernel methods.

Keywords: Biomedical literature; Drug–drug interaction; Linear kernel approach.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Two-phase DDI extraction framework. DDI detection (①) decides whether a drug pair interacts. DDI type classification (②) assigns DDI types to interacting pairs.
Figure 2
Figure 2
An example of preprocessing and feature extraction. The underlined drug pair, clidinium and phenothiazines, is the candidate DDI. ‘NPC’ means noun phrase-constrained coordination and ‘BE’ denotes between candidate drugs.
Figure 3
Figure 3
A solution for presenting drug pairs with significant word and word pair features. Highly weighted words are highlighted in the sentence and emphasized according to all the weights they receive in the feature list. ‘BF’, ‘BE’ and ‘AF’ mean before, between and after, respectively. DRUG indicates a target drug.
Figure 4
Figure 4
Example sentences which start with “[drug name]:”. “[drug name]:” is a section title which is concatenated with the next sentence in the DDIExtraction set.
Figure 5
Figure 5
Example sentences, where some drug names are not annotated. “calcium” in the first sentence and the second “corticosteroids” in the second sentence are not annotated as drug names.

References

    1. Baxter K, Preston CL, editors. Stockley’s Drug Interactions. London, UK: Pharmaceutical Press; 2013.
    1. Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, Djoumbou Y, Eisner R, Guo AC, Wishart DS. DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs. Nucleic Acids Research. 2011;39(Suppl 1):D1035–D1041. - PMC - PubMed
    1. Duda S, Aliferis C, Miller R, Statnikov A, Johnson K. Extracting drug-drug interaction articles from MEDLINE to improve the content of drug databases; AMIA Annual Symposium Proceedings; 2005. pp. 216–220. - PMC - PubMed
    1. Rubin DL, Thorn CF, Klein TE, Altman RB. A statistical approach to scanning the biomedical literature for pharmacogenetics knowledge. Journal of the American Medical Informatics Association. 2005;12(2):121–129. - PMC - PubMed
    1. Segura-Bedmar I, Martínez P, de Pablo-Sánchez C. A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents. BMC Bioinformatics. 2011;12(Suppl 2):S1. - PMC - PubMed

Publication types