Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Mar 25:2016:baw036.
doi: 10.1093/database/baw036. Print 2016.

CD-REST: a system for extracting chemical-induced disease relation in literature

Affiliations

CD-REST: a system for extracting chemical-induced disease relation in literature

Jun Xu et al. Database (Oxford). .

Abstract

Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this challenge. In this article, we present our participation system Chemical Disease Relation Extraction SysTem (CD-REST), an end-to-end system for extracting chemical-induced disease relations in biomedical literature. CD-REST consists of two main components: (1) a chemical and disease named entity recognition and normalization module, which employs the Conditional Random Fields algorithm for entity recognition and a Vector Space Model-based approach for normalization; and (2) a relation extraction module that classifies both sentence-level and document-level candidate drug-disease pairs by support vector machines. Our system achieved the best performance on the chemical-induced disease relation extraction subtask in the BioCreative V CDR Track, demonstrating the effectiveness of our proposed machine learning-based approaches for automatic extraction of chemical-induced disease relations in biomedical literature. The CD-REST system provides web services using HTTP POST request. The web services can be accessed fromhttp://clinicalnlptool.com/cdr The online CD-REST demonstration system is available athttp://clinicalnlptool.com/cdr/cdr.html. Database URL:http://clinicalnlptool.com/cdr;http://clinicalnlptool.com/cdr/cdr.html.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A sample from the CDR corpus with the annotations of mentions, corresponding normalized MeSH IDs for both chemical and disease entities and normalized chemical-induced disease relation conveyed in the abstract.
Figure 2.
Figure 2.
An overview of CD-REST.

References

    1. Davis A.P., Grondin C.J., Lennon-Hopkins K. et al. (2015) The Comparative Toxicogenomics Database’s 10th year anniversary: update 2015. Nucleic Acids Res., 43, D914–D920. - PMC - PubMed
    1. Arighi C.N., Roberts P.M., Agarwal S. et al. (2011) BioCreative III interactive task: an overview. BMC Bioinformatics, 12, S4.. - PMC - PubMed
    1. Wiegers T.C., Davis A.P., Mattingly C.J. (2012) Collaborative biocuration—text-mining development task for document prioritization for curation. Database, 2012, bas037. - PMC - PubMed
    1. Wiegers T.C., Davis A.P., Mattingly C.J. (2014) Web services-based text-mining demonstrates broad impacts for interoperability and process simplification. Database, 2014, bau050. - PMC - PubMed
    1. Savova G.K., Masanz J.J., Ogren P.V. et al. (2010) Mayo clinical text analysis and knowledge extraction system (cTAKES): architecture, component evaluation and applications. J. Am. Med. Inf. Assoc., 17, 507–513. - PMC - PubMed

Publication types

LinkOut - more resources