Enhancing HMM-based biomedical named entity recognition by studying special phenomena
- PMID: 15542015
- DOI: 10.1016/j.jbi.2004.08.005
Enhancing HMM-based biomedical named entity recognition by studying special phenomena
Abstract
The purpose of this research is to enhance an HMM-based named entity recognizer in the biomedical domain. First, we analyze the characteristics of biomedical named entities. Then, we propose a rich set of features, including orthographic, morphological, part-of-speech, and semantic trigger features. All these features are integrated via a Hidden Markov Model with back-off modeling. Furthermore, we propose a method for biomedical abbreviation recognition and two methods for cascaded named entity recognition. Evaluation on the GENIA V3.02 and V1.1 shows that our system achieves 66.5 and 62.5 F-measure, respectively, and outperforms the previous best published system by 8.1 F-measure on the same experimental setting. The major contribution of this paper lies in its rich feature set specially designed for biomedical domain and the effective methods for abbreviation and cascaded named entity recognition. To our best knowledge, our system is the first one that copes with the cascaded phenomena.
Similar articles
-
Recognizing names in biomedical texts: a machine learning approach.Bioinformatics. 2004 May 1;20(7):1178-90. doi: 10.1093/bioinformatics/bth060. Epub 2004 Feb 10. Bioinformatics. 2004. PMID: 14871877
-
Comparison of character-level and part of speech features for name recognition in biomedical texts.J Biomed Inform. 2004 Dec;37(6):423-35. doi: 10.1016/j.jbi.2004.08.008. J Biomed Inform. 2004. PMID: 15542016
-
Recognizing names in biomedical texts using mutual information independence model and SVM plus sigmoid.Int J Med Inform. 2006 Jun;75(6):456-67. doi: 10.1016/j.ijmedinf.2005.06.012. Epub 2005 Aug 19. Int J Med Inform. 2006. PMID: 16112894
-
Information retrieval and knowledge discovery utilising a biomedical Semantic Web.Brief Bioinform. 2005 Sep;6(3):252-62. doi: 10.1093/bib/6.3.252. Brief Bioinform. 2005. PMID: 16212773 Review.
-
Status of text-mining techniques applied to biomedical text.Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011. Drug Discov Today. 2006. PMID: 16580973 Review.
Cited by
-
Processing of Short-Form Content in Clinical Narratives: Systematic Scoping Review.J Med Internet Res. 2024 Sep 26;26:e57852. doi: 10.2196/57852. J Med Internet Res. 2024. PMID: 39325515 Free PMC article.
-
Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.JMIR Med Inform. 2024 Oct 30;12:e52897. doi: 10.2196/52897. JMIR Med Inform. 2024. PMID: 39475725 Free PMC article.
-
Using text mining techniques to extract phenotypic information from the PhenoCHF corpus.BMC Med Inform Decis Mak. 2015;15 Suppl 2(Suppl 2):S3. doi: 10.1186/1472-6947-15-S2-S3. Epub 2015 Jun 15. BMC Med Inform Decis Mak. 2015. PMID: 26099853 Free PMC article.
-
Biomedical named entity recognition with the combined feature attention and fully-shared multi-task learning.BMC Bioinformatics. 2022 Nov 3;23(1):458. doi: 10.1186/s12859-022-04994-3. BMC Bioinformatics. 2022. PMID: 36329384 Free PMC article.
-
Unregistered biological words recognition by Q-learning with transfer learning.ScientificWorldJournal. 2014 Feb 19;2014:173290. doi: 10.1155/2014/173290. eCollection 2014. ScientificWorldJournal. 2014. PMID: 24701139 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources