Using name-internal and contextual features to classify biological terms
- PMID: 15542022
- DOI: 10.1016/j.jbi.2004.08.007
Using name-internal and contextual features to classify biological terms
Abstract
There has been considerable work done recently in recognizing named entities in biomedical text. In this paper, we investigate the named entity classification task, an integral part of the named entity extraction task. We focus on the different sources of information that can be utilized for classification, and note the extent to which they are effective in classification. To classify a name, we consider features that appear within the name as well as nearby phrases. We also develop a new strategy based on the context of occurrence and show that they improve the performance of the classification system. We show how our work relates to previous works on named entity classification in the biological domain as well as to those in generic domains. The experiments were conducted on the GENIA corpus Ver. 3.0 developed at University of Tokyo. We achieve f value of 86 in 10-fold cross validation evaluation on this corpus.
Similar articles
-
Recognizing names in biomedical texts: a machine learning approach.Bioinformatics. 2004 May 1;20(7):1178-90. doi: 10.1093/bioinformatics/bth060. Epub 2004 Feb 10. Bioinformatics. 2004. PMID: 14871877
-
Comparison of character-level and part of speech features for name recognition in biomedical texts.J Biomed Inform. 2004 Dec;37(6):423-35. doi: 10.1016/j.jbi.2004.08.008. J Biomed Inform. 2004. PMID: 15542016
-
Enhancing HMM-based biomedical named entity recognition by studying special phenomena.J Biomed Inform. 2004 Dec;37(6):411-22. doi: 10.1016/j.jbi.2004.08.005. J Biomed Inform. 2004. PMID: 15542015
-
Status of text-mining techniques applied to biomedical text.Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011. Drug Discov Today. 2006. PMID: 16580973 Review.
-
Hairpins in bookstacks: information retrieval from biomedical text.Brief Bioinform. 2005 Sep;6(3):222-38. doi: 10.1093/bib/6.3.222. Brief Bioinform. 2005. PMID: 16212771 Review.
Cited by
-
Using contextual and lexical features to restructure and validate the classification of biomedical concepts.BMC Bioinformatics. 2007 Jul 24;8:264. doi: 10.1186/1471-2105-8-264. BMC Bioinformatics. 2007. PMID: 17650333 Free PMC article.
-
Enhancing acronym/abbreviation knowledge bases with semantic information.AMIA Annu Symp Proc. 2007 Oct 11;2007:731-5. AMIA Annu Symp Proc. 2007. PMID: 18693933 Free PMC article.
-
Combining contextual and lexical features to classify UMLS concepts.AMIA Annu Symp Proc. 2007 Oct 11;2007:231-5. AMIA Annu Symp Proc. 2007. PMID: 18693832 Free PMC article.
-
Automated recognition of malignancy mentions in biomedical literature.BMC Bioinformatics. 2006 Nov 7;7:492. doi: 10.1186/1471-2105-7-492. BMC Bioinformatics. 2006. PMID: 17090325 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources