Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Oct;28(5):537-46.
doi: 10.1007/s10278-015-9792-6.

Analyzing Medical Image Search Behavior: Semantics and Prediction of Query Results

Affiliations
Review

Analyzing Medical Image Search Behavior: Semantics and Prediction of Query Results

Maria De-Arteaga et al. J Digit Imaging. 2015 Oct.

Abstract

Log files of information retrieval systems that record user behavior have been used to improve the outcomes of retrieval systems, understand user behavior, and predict events. In this article, a log file of the ARRS GoldMiner search engine containing 222,005 consecutive queries is analyzed. Time stamps are available for each query, as well as masked IP addresses, which enables to identify queries from the same person. This article describes the ways in which physicians (or Internet searchers interested in medical images) search and proposes potential improvements by suggesting query modifications. For example, many queries contain only few terms and therefore are not specific; others contain spelling mistakes or non-medical terms that likely lead to poor or empty results. One of the goals of this report is to predict the number of results a query will have since such a model allows search engines to automatically propose query modifications in order to avoid result lists that are empty or too large. This prediction is made based on characteristics of the query terms themselves. Prediction of empty results has an accuracy above 88%, and thus can be used to automatically modify the query to avoid empty result sets for a user. The semantic analysis and data of reformulations done by users in the past can aid the development of better search systems, particularly to improve results for novice users. Therefore, this paper gives important ideas to better understand how people search and how to use this knowledge to improve the performance of specialized medical search engines.

Keywords: Human-computer interaction; Image retrieval; Information storage and retrieval; Log file analysis; Machine learning; Medical image search; Statistic analysis.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Proportion of the queries containing the most frequently occurring terms
Fig. 2
Fig. 2
The number of queries with a specific number of terms in the query
Fig. 3
Fig. 3
Number of queries mapped to each RadLex axis

Similar articles

References

    1. High-level Expert Group on Scientific Data. Riding the wave: How Europe can gain from the rising tide of scientific data. Submission to the European Commission, available online at http://cordis.europa.eu/fp7/ict/e-infrastructure/docs/hlg-sdi-report.pdf, 2010
    1. Doi K. Computer-aided diagnosis in medical imaging: historical review, current status and future potential. Comput Med Imaging Graph. 2007;31:198–211. doi: 10.1016/j.compmedimag.2007.02.002. - DOI - PMC - PubMed
    1. Müller H, Michoux N, Bandon D, Geissbuhler A. A review of content-based image retrieval systems in medicine—clinical benefits and future directions. Int J Med Inform. 2004;73:1–23. doi: 10.1016/j.ijmedinf.2003.11.024. - DOI - PubMed
    1. Markonis D, Holzer M, Dungs S, Vargas A, Langs G, Kriewel S, et al. A survey on visual information search behavior and requirements of radiologists. Methods Inf Med. 2012;51:539–548. doi: 10.3414/ME11-02-0025. - DOI - PubMed
    1. Markonis D, Baroz F, de Castaneda RL R, Boyer C, Müller H. User tests for assessing a medical image retrieval system: a pilot study. Stud Health Technol Inf. 2013;192:224–228. - PubMed

LinkOut - more resources