On the query reformulation technique for effective MEDLINE document retrieval
- PMID: 20394839
- DOI: 10.1016/j.jbi.2010.04.005
On the query reformulation technique for effective MEDLINE document retrieval
Abstract
Improving the retrieval accuracy of MEDLINE documents is still a challenging issue due to low retrieval precision. Focusing on a query expansion technique based on pseudo-relevance feedback (PRF), this paper addresses the problem by systematically examining the effects of expansion term selection and adjustment of the term weights of the expanded query using a set of MEDLINE test documents called OHSUMED. Implementing a baseline information retrieval system based on the Okapi BM25 retrieval model, we compared six well-known term ranking algorithms for useful expansion term selection and then compared traditional term reweighting algorithms with our new variant of the standard Rocchio's feedback formula, which adopts a group-based weighting scheme. Our experimental results on the OHSUMED test collection showed a maximum improvement of 20.2% and 20.4% for mean average precision and recall measures over unexpanded queries when terms were expanded using a co-occurrence analysis-based term ranking algorithm in conjunction with our term reweighting algorithm (p-value<0.05). Our study shows the behaviors of different query reformulation techniques that can be utilized for more effective MEDLINE document retrieval.
Similar articles
-
Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval.Healthc Inform Res. 2011 Jun;17(2):120-30. doi: 10.4258/hir.2011.17.2.120. Epub 2011 Jun 30. Healthc Inform Res. 2011. PMID: 21886873 Free PMC article.
-
Clinical task-specific query expansion for the retrieval of scientifically rigorous research documents.Stud Health Technol Inform. 2010;160(Pt 2):1174-8. Stud Health Technol Inform. 2010. PMID: 20841869
-
Reflecting all query aspects on query expansion.AMIA Annu Symp Proc. 2008 Nov 6:1189. AMIA Annu Symp Proc. 2008. PMID: 18998930
-
An adaptive term proximity based rocchio's model for clinical decision support retrieval.BMC Med Inform Decis Mak. 2019 Dec 12;19(Suppl 9):251. doi: 10.1186/s12911-019-0986-6. BMC Med Inform Decis Mak. 2019. PMID: 31830960 Free PMC article.
-
Evaluating relevance ranking strategies for MEDLINE retrieval.J Am Med Inform Assoc. 2009 Jan-Feb;16(1):32-6. doi: 10.1197/jamia.M2935. Epub 2008 Oct 24. J Am Med Inform Assoc. 2009. PMID: 18952932 Free PMC article.
Cited by
-
Development and empirical user-centered evaluation of semantically-based query recommendation for an electronic health record search engine.J Biomed Inform. 2017 Mar;67:1-10. doi: 10.1016/j.jbi.2017.01.013. Epub 2017 Jan 25. J Biomed Inform. 2017. PMID: 28131722 Free PMC article.
-
Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval.Healthc Inform Res. 2011 Jun;17(2):120-30. doi: 10.4258/hir.2011.17.2.120. Epub 2011 Jun 30. Healthc Inform Res. 2011. PMID: 21886873 Free PMC article.
-
GO2PUB: Querying PubMed with semantic expansion of gene ontology terms.J Biomed Semantics. 2012 Sep 7;3(1):7. doi: 10.1186/2041-1480-3-7. J Biomed Semantics. 2012. PMID: 22958570 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources