Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar 29;12 Suppl 2(Suppl 2):S1.
doi: 10.1186/1471-2105-12-S2-S1.

A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents

Affiliations

A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents

Isabel Segura-Bedmar et al. BMC Bioinformatics. .

Abstract

Background: A drug-drug interaction (DDI) occurs when one drug influences the level or activity of another drug. The increasing volume of the scientific literature overwhelms health care professionals trying to be kept up-to-date with all published studies on DDI.

Methods: This paper describes a hybrid linguistic approach to DDI extraction that combines shallow parsing and syntactic simplification with pattern matching. Appositions and coordinate structures are interpreted based on shallow syntactic parsing provided by the UMLS MetaMap tool (MMTx). Subsequently, complex and compound sentences are broken down into clauses from which simple sentences are generated by a set of simplification rules. A pharmacist defined a set of domain-specific lexical patterns to capture the most common expressions of DDI in texts. These lexical patterns are matched with the generated sentences in order to extract DDIs.

Results: We have performed different experiments to analyze the performance of the different processes. The lexical patterns achieve a reasonable precision (67.30%), but very low recall (14.07%). The inclusion of appositions and coordinate structures helps to improve the recall (25.70%), however, precision is lower (48.69%). The detection of clauses does not improve the performance.

Conclusions: Information Extraction (IE) techniques can provide an interesting way of reducing the time spent by health care professionals on reviewing the literature. Nevertheless, no approach has been carried out to extract DDI from texts. To the best of our knowledge, this work proposes the first integral solution for the automatic extraction of DDI from biomedical texts.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Architecture for drug-drug interactions extraction. This figure shows the pipeline architecture of our drug-drug interaction prototype. Firstly, texts are processed by the MMTx program. This tool performs sentence splitting, tokenization, POS-tagging, chunking, and linking of phrases with UMLS concepts. Then, appositions and coordinate structures are interpreted based on shallow syntactic parsing provided by the UMLS MetaMap tool (MMTx). Subsequently, complex and compound sentences are broken down into clauses from which simple sentences are generated by a set of simplification rules. Finally, the lexical patterns are matched with the generated sentences in order to extract DDIs.

References

    1. Rodríguez-Terol A, Camacho C. et al.Calidad estructural de las bases de datos de interacciones. Farmacia Hospitalaria. 2009;33(03):134. doi: 10.1016/S1130-6343(09)71155-9. - DOI - PubMed
    1. Hansten P. Drug interaction management. Pharmacy World & Science. 2003;25(3):94–97. doi: 10.1023/A:1024077018902. - DOI - PubMed
    1. Zhou D, He Y. Extracting interactions between proteins from the literature. Journal of Biomedical Informatics. 2007;41(2):393–407. doi: 10.1016/j.jbi.2007.11.008. - DOI - PubMed
    1. Siddharthan A. Syntactic simplification and text cohesion. Research on Language & Computation. 2006;4:77–109. doi: 10.1007/s11168-006-9011-1. - DOI
    1. Krallinger M, Leitner F, Valencia A. The BioCreative II.5 challenge overview. Proceedings of the BioCreative II. 5 Workshop 2009 on Digital Annotations. 2009. p. 19.

Publication types