RysannMD: A biomedical semantic annotator balancing speed and accuracy
- PMID: 28552401
- DOI: 10.1016/j.jbi.2017.05.016
RysannMD: A biomedical semantic annotator balancing speed and accuracy
Abstract
Recently, both researchers and practitioners have explored the possibility of semantically annotating large and continuously evolving collections of biomedical texts such as research papers, medical reports, and physician notes in order to enable their efficient and effective management and use in clinical practice or research laboratories. Such annotations can be automatically generated by biomedical semantic annotators - tools that are specifically designed for detecting and disambiguating biomedical concepts mentioned in text. The biomedical community has already presented several solid automated semantic annotators. However, the existing tools are either strong in their disambiguation capacity, i.e., the ability to identify the correct biomedical concept for a given piece of text among several candidate concepts, or they excel in their processing time, i.e., work very efficiently, but none of the semantic annotation tools reported in the literature has both of these qualities. In this paper, we present RysannMD (Ryerson Semantic Annotator for Medical Domain), a biomedical semantic annotation tool that strikes a balance between processing time and performance while disambiguating biomedical terms. In other words, RysannMD provides reasonable disambiguation performance when choosing the right sense for a biomedical term in a given context, and does that in a reasonable time. To examine how RysannMD stands with respect to the state of the art biomedical semantic annotators, we have conducted a series of experiments using standard benchmarking corpora, including both gold and silver standards, and four modern biomedical semantic annotators, namely cTAKES, MetaMap, NOBLE Coder, and Neji. The annotators were compared with respect to the quality of the produced annotations measured against gold and silver standards using precision, recall, and F1 measure and speed, i.e., processing time. In the experiments, RysannMD achieved the best median F1 measure across the benchmarking corpora, independent of the standard used (silver/gold), biomedical subdomain, and document size. In terms of the annotation speed, RysannMD scored the second best median processing time across all the experiments. The obtained results indicate that RysannMD offers the best performance among the examined semantic annotators when both quality of annotation and speed are considered simultaneously.
Keywords: Automated semantic annotation; Biomedical ontologies; Entity linking; Medical terminology; Natural language processing; UMLS metathesaurus.
Copyright © 2017 Elsevier Inc. All rights reserved.
Similar articles
-
Semantic annotation in biomedicine: the current landscape.J Biomed Semantics. 2017 Sep 22;8(1):44. doi: 10.1186/s13326-017-0153-x. J Biomed Semantics. 2017. PMID: 28938912 Free PMC article. Review.
-
SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2. BMC Bioinformatics. 2018. PMID: 30400805 Free PMC article.
-
Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.PLoS One. 2015 Jan 21;10(1):e0116040. doi: 10.1371/journal.pone.0116040. eCollection 2015. PLoS One. 2015. PMID: 25607983 Free PMC article.
-
A multilingual gold-standard corpus for biomedical concept recognition: the Mantra GSC.J Am Med Inform Assoc. 2015 Sep;22(5):948-56. doi: 10.1093/jamia/ocv037. Epub 2015 May 6. J Am Med Inform Assoc. 2015. PMID: 25948699 Free PMC article.
-
A survey on annotation tools for the biomedical literature.Brief Bioinform. 2014 Mar;15(2):327-40. doi: 10.1093/bib/bbs084. Epub 2012 Dec 18. Brief Bioinform. 2014. PMID: 23255168 Review.
Cited by
-
As Ontologies Reach Maturity, Artificial Intelligence Starts Being Fully Efficient: Findings from the Section on Knowledge Representation and Management for the Yearbook 2018.Yearb Med Inform. 2018 Aug;27(1):140-145. doi: 10.1055/s-0038-1667078. Epub 2018 Aug 29. Yearb Med Inform. 2018. PMID: 30157517 Free PMC article. Review.
-
An overview of biomedical entity linking throughout the years.J Biomed Inform. 2023 Jan;137:104252. doi: 10.1016/j.jbi.2022.104252. Epub 2022 Dec 2. J Biomed Inform. 2023. PMID: 36464228 Free PMC article. Review.
-
Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow.Syst Rev. 2021 May 26;10(1):156. doi: 10.1186/s13643-021-01700-x. Syst Rev. 2021. PMID: 34039433 Free PMC article.
-
Transformation of Pathology Reports Into the Common Data Model With Oncology Module: Use Case for Colon Cancer.J Med Internet Res. 2020 Dec 9;22(12):e18526. doi: 10.2196/18526. J Med Internet Res. 2020. PMID: 33295294 Free PMC article.
-
Parallel sequence tagging for concept recognition.BMC Bioinformatics. 2022 Mar 24;22(Suppl 1):623. doi: 10.1186/s12859-021-04511-y. BMC Bioinformatics. 2022. PMID: 35331131 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources