Essie: a concept-based search engine for structured biomedical text
- PMID: 17329729
- PMCID: PMC2244877
- DOI: 10.1197/jamia.M2233
Essie: a concept-based search engine for structured biomedical text
Abstract
This article describes the algorithms implemented in the Essie search engine that is currently serving several Web sites at the National Library of Medicine. Essie is a phrase-based search engine with term and concept query expansion and probabilistic relevancy ranking. Essie's design is motivated by an observation that query terms are often conceptually related to terms in a document, without actually occurring in the document text. Essie's performance was evaluated using data and standard evaluation methods from the 2003 and 2006 Text REtrieval Conference (TREC) Genomics track. Essie was the best-performing search engine in the 2003 TREC Genomics track and achieved results comparable to those of the highest-ranking systems on the 2006 TREC Genomics track task. Essie shows that a judicious combination of exploiting document structure, phrase searching, and concept based query expansion is a useful approach for information retrieval in the biomedical domain.
Figures




Similar articles
-
Vaidurya: a multiple-ontology, concept-based, context-sensitive clinical-guideline search engine.J Biomed Inform. 2009 Feb;42(1):11-21. doi: 10.1016/j.jbi.2008.07.003. Epub 2008 Aug 3. J Biomed Inform. 2009. PMID: 18721900
-
Factors affecting the effectiveness of biomedical document indexing and retrieval based on terminologies.Artif Intell Med. 2013 Feb;57(2):155-67. doi: 10.1016/j.artmed.2012.08.006. Epub 2012 Oct 23. Artif Intell Med. 2013. PMID: 23092678
-
A dimensional retrieval model for integrating semantics and statistical evidence in context for genomics literature search.Comput Biol Med. 2009 Jan;39(1):61-8. doi: 10.1016/j.compbiomed.2008.11.002. Epub 2009 Jan 15. Comput Biol Med. 2009. PMID: 19147128
-
Information retrieval in digital libraries: bringing search to the net.Science. 1997 Jan 17;275(5298):327-34. doi: 10.1126/science.275.5298.327. Science. 1997. PMID: 8994022 Review.
-
Where to search top-K biomedical ontologies?Brief Bioinform. 2019 Jul 19;20(4):1477-1491. doi: 10.1093/bib/bby015. Brief Bioinform. 2019. PMID: 29579141 Free PMC article. Review.
Cited by
-
eTACTS: a method for dynamically filtering clinical trial search results.J Biomed Inform. 2013 Dec;46(6):1060-7. doi: 10.1016/j.jbi.2013.07.014. Epub 2013 Aug 3. J Biomed Inform. 2013. PMID: 23916863 Free PMC article.
-
Towards the creation of a visual ontology of biomedical imaging entities.AMIA Annu Symp Proc. 2012;2012:866-75. Epub 2012 Nov 3. AMIA Annu Symp Proc. 2012. PMID: 23304361 Free PMC article.
-
Effects of Porting Essie Tokenization and Normalization to Solr.AMIA Annu Symp Proc. 2024 Jan 11;2023:369-378. eCollection 2023. AMIA Annu Symp Proc. 2024. PMID: 38222430 Free PMC article.
-
Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text.J Biomed Inform. 2013 Dec;46(6):1116-24. doi: 10.1016/j.jbi.2013.08.008. Epub 2013 Sep 4. J Biomed Inform. 2013. PMID: 24012881 Free PMC article.
-
Automatically extracting information needs from complex clinical questions.J Biomed Inform. 2010 Dec;43(6):962-71. doi: 10.1016/j.jbi.2010.07.007. Epub 2010 Jul 27. J Biomed Inform. 2010. PMID: 20670693 Free PMC article.
References
-
- Friedman C, Kra P, Rzhetsky A. Two biomedical sublanguages: A description based on the theories of Zellig Harris J Biomed Inform 2002;35:222-235. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources