Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015:2015:910423.
doi: 10.1155/2015/910423. Epub 2015 Aug 10.

A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set

Affiliations

A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set

Abdul Wahab Muzaffar et al. Comput Math Methods Med. 2015.

Abstract

The information extraction from unstructured text segments is a complex task. Although manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually because of the exponential increase in data size. Thus, there is a need for automatic tools and techniques for information extraction in biomedical text mining. Relation extraction is a significant area under biomedical information extraction that has gained much importance in the last two decades. A lot of work has been done on biomedical relation extraction focusing on rule-based and machine learning techniques. In the last decade, the focus has changed to hybrid approaches showing better results. This research presents a hybrid feature set for classification of relations between biomedical entities. The main contribution of this research is done in the semantic feature set where verb phrases are ranked using Unified Medical Language System (UMLS) and a ranking algorithm. Support Vector Machine and Naïve Bayes, the two effective machine learning techniques, are used to classify these relations. Our approach has been validated on the standard biomedical text corpus obtained from MEDLINE 2001. Conclusively, it can be articulated that our framework outperforms all state-of-the-art approaches used for relation extraction on the same corpus.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Hybrid feature set based relation extraction framework.

Similar articles

Cited by

References

    1. Jensen L. J., Saric J., Bork P. Literature mining for the biologist: from information retrieval to biological discovery. Nature Reviews Genetics. 2006;7(2):119–129. doi: 10.1038/nrg1768. - DOI - PubMed
    1. Ananiadou S., Kell D. B., Tsujii J.-I. Text mining and its potential applications in systems biology. Trends in Biotechnology. 2006;24(12):571–579. doi: 10.1016/j.tibtech.2006.10.002. - DOI - PubMed
    1. Chapman W. W., Cohen K. B. Current issues in biomedical text mining and natural language processing. Journal of Biomedical Informatics. 2009;42(5):757–759. doi: 10.1016/j.jbi.2009.09.001. - DOI - PubMed
    1. Zweigenbaum P., Demner-Fushman D., Yu H., Cohen K. B. Frontiers of biomedical text mining: current progress. Briefings in Bioinformatics. 2007;8(5):358–375. doi: 10.1093/bib/bbm045. - DOI - PMC - PubMed
    1. Cusick M. E., Yu H., Smolyar A., et al. Literature-curated protein interaction datasets perspective. Nature Methods. 2009;6:39–46. doi: 10.1038/nmeth.1284. - DOI - PMC - PubMed