Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005:2005:619-23.

Using concept relations to improve ranking in information retrieval

Affiliations

Using concept relations to improve ranking in information retrieval

Susan L Price et al. AMIA Annu Symp Proc. 2005.

Abstract

Despite improved search engine technology, most searches return numerous documents not directly related to the query. This problem is mitigated if relevant documents appear high on a ranked list of search results. We propose that some queries and the underlying information needs can be modeled as relationships between concepts (relations), and we match relations in queries to relations in documents to try to improve ranking of search results. We investigate four techniques to identify two relationships important in medicine, causes and treats, to improve the ranking of medical text documents relevant to clinical questions about causation and treatment. Preliminary results suggest that identifying relation instances can improve the ranking of search results.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Proposed query types. Example queries are shown with relational representations. The letters X and Y represent variables, meaning the slot can be filled by any concept or relationship.
Figure 2
Figure 2
Calculation of similarity score for document d relative to query q based on terms t in q. coord: fraction of terms in q that appear in d tf (term frequency): factor for frequency of t in d idf (inverse term freq.): factor for number of documents containing t (increases when t is in fewer documents) lengthNorm: normalizes for the length of the document
Figure 3
Figure 3
An example of a regular expression and two phrases that it matched. The word “metronidazole” from the query relation treats(“metronidazole”, *) was used to instantiate the variable $intervention.
Figure 4
Figure 4
Query histograms showing the effects of using expansion, regular expressions, and proximity matching on each query. Bars show change in average precision compared to the baseline run. Queries 1–6 have full treats relations, queries 7–12 have full causes relations, queries 13–18 have partial treats relations and queries 19–24 have partial causes relations.

Similar articles

References

    1. National Library of Medicine. ClinicalQuestions Collection. URL: http://clinques.nlm.nih.gov Last accessed: March 7, 2005.
    1. Lancaster FW. MEDLARS: Report on the evaluation of its operating efficiency. American Documentation. 1969;20: p. 119–42. Reprinted in Sparck Jones K, Willett P, editors. Readings in Information Retrieval. San Francisco, CA: Morgan Kaufmann Publishers, Inc.; 1997. p. 223– 246.
    1. Khoo CSG, Myaeng SH. Identifying semantic relations in text for information retrieval and information extraction. In: Green R, Bean CA, Myaeng SH, editors. The semantics of relationships: An interdisciplinary perspective. Boston, MA: Kluwer Academic Publishers; 2002. p. 161–80.
    1. UpToDate. URL: http://www.uptodate.com Last accessed: February 22, 2005.
    1. Ely JW, Osheroff JA, Ebell MH, et al. Analysis of questions asked by family doctors regarding patient care. BMJ. 1999;319:358–61. - PMC - PubMed

LinkOut - more resources