Utilizing descriptive statements from the biodiversity heritage library to expand the Hymenoptera Anatomy Ontology
- PMID: 23441153
- PMCID: PMC3575469
- DOI: 10.1371/journal.pone.0055674
Utilizing descriptive statements from the biodiversity heritage library to expand the Hymenoptera Anatomy Ontology
Abstract
Hymenoptera, the insect order that includes sawflies, bees, wasps, and ants, exhibits an incredible diversity of phenotypes, with over 145,000 species described in a corpus of textual knowledge since Carolus Linnaeus. In the absence of specialized training, often spanning decades, however, these articles can be challenging to decipher. Much of the vocabulary is domain-specific (e.g., Hymenoptera biology), historically without a comprehensive glossary, and contains much homonymous and synonymous terminology. The Hymenoptera Anatomy Ontology was developed to surmount this challenge and to aid future communication related to hymenopteran anatomy, as well as provide support for domain experts so they may actively benefit from the anatomy ontology development. As part of HAO development, an active learning, dictionary-based, natural language recognition tool was implemented to facilitate Hymenoptera anatomy term discovery in literature. We present this tool, referred to as the 'Proofer', as part of an iterative approach to growing phenotype-relevant ontologies, regardless of domain. The process of ontology development results in a critical mass of terms that is applied as a filter to the source collection of articles in order to reveal term occurrence and biases in natural language species descriptions. Our results indicate that taxonomists use domain-specific terminology that follows taxonomic specialization, particularly at superfamily and family level groupings and that the developed Proofer tool is effective for term discovery, facilitating ontology construction.
Conflict of interest statement
Figures





References
-
- Bodenreider O (2006) Lexical, terminological and ontological resources for biological text mining. In: Ananiadou S, McNaught J, editors. Text Mining for Biology and Biomedicine. Boston and London: Artech House. 43–66.
-
- International Code of Zoological Nomenclature website. Available: http://iczn.org/code. Accessed 2012 Oct 8.
-
- International Commission on Zoological Nomenclature (2012) Amendment of Articles 8, 9, 10, 21 and 78 of the International Code of Zoological Nomenclature to expand and refine methods of publication. ZooKeys 219: 1–10 doi:10.3897/zookeys.219.3944. - DOI - PMC - PubMed
-
- Seltmann K, Yoder M, Miko I, Forshage M, Bertone M, et al. (2012) A hymenopterists’ guide to the Hymenoptera Anatomy Ontology: utility, clarification, and future directions. Journal of Hymenoptera Research 27: 67.
Publication types
MeSH terms
LinkOut - more resources
Other Literature Sources