Evaluating relevance ranking strategies for MEDLINE retrieval
- PMID: 18952932
- PMCID: PMC2605593
- DOI: 10.1197/jamia.M2935
Evaluating relevance ranking strategies for MEDLINE retrieval
Abstract
This paper evaluates the retrieval effectiveness of relevance ranking strategies on a collection of 55 queries and about 160,000 MEDLINE((R)) citations used in the 2006 and 2007 Text Retrieval Conference (TREC) Genomics Tracks. The authors study two relevance ranking strategies: term frequency-inverse document frequency (TF-IDF) weighting and sentence-level co-occurrence, and examine their ability to rank retrieved MEDLINE documents given user queries. Furthermore, the authors use the reverse chronological order-PubMed's default display option-as a baseline for comparison. Retrieval effectiveness is assessed using both mean average precision and mean rank precision. Experimental results show that retrievals based on the two strategies had improved performance over the baseline performance, and that TF-IDF weighting is more effective in retrieving relevant documents based on the comparison between the two strategies.
References
-
- Hersh W. Information Retrieval: A Health and Biomedical Perspective2nd edition. Spring-Verlag; 2003.
-
- Salton G. Introduction to Modern Information RetrievalMcGraw-Hill; 1983.
-
- Salton G. Developments in automatic text retrieval Science 1991;253(5023):974. - PubMed
-
- Salton G, Buckley C. Term weighting approaches in automatic text retrieval Inf Process Manag 1988;24:513-523.
-
- Robertson SE, Walker S, Jones S, Hancock-Beaulieu M, Gatford M. Okapi at TREC-3 Proceedings of the 3rd Text REtrieval Conference (TREC-3). 1994.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
