Learning semantic and visual similarity for endomicroscopy video retrieval

Barbara Andre¹, Tom Vercauteren, Anna M Buchner, Michael B Wallace, Nicholas Ayache

Affiliations

PMID: 22353403
DOI: 10.1109/TMI.2012.2188301

Learning semantic and visual similarity for endomicroscopy video retrieval

Barbara Andre et al. IEEE Trans Med Imaging. 2012 Jun.

. 2012 Jun;31(6):1276-88.

doi: 10.1109/TMI.2012.2188301. Epub 2012 Feb 16.

Authors

Barbara Andre¹, Tom Vercauteren, Anna M Buchner, Michael B Wallace, Nicholas Ayache

Affiliation

¹ Mauna Kea Technologies, 75010 Paris, France. barbara.andre@maunakeatech.com

PMID: 22353403
DOI: 10.1109/TMI.2012.2188301

Abstract

Content-based image retrieval (CBIR) is a valuable computer vision technique which is increasingly being applied in the medical community for diagnosis support. However, traditional CBIR systems only deliver visual outputs, i.e., images having a similar appearance to the query, which is not directly interpretable by the physicians. Our objective is to provide a system for endomicroscopy video retrieval which delivers both visual and semantic outputs that are consistent with each other. In a previous study, we developed an adapted bag-of-visual-words method for endomicroscopy retrieval, called "Dense-Sift," that computes a visual signature for each video. In this paper, we present a novel approach to complement visual similarity learning with semantic knowledge extraction, in the field of in vivo endomicroscopy. We first leverage a semantic ground truth based on eight binary concepts, in order to transform these visual signatures into semantic signatures that reflect how much the presence of each semantic concept is expressed by the visual words describing the videos. Using cross-validation, we demonstrate that, in terms of semantic detection, our intuitive Fisher-based method transforming visual-word histograms into semantic estimations outperforms support vector machine (SVM) methods with statistical significance. In a second step, we propose to improve retrieval relevance by learning an adjusted similarity distance from a perceived similarity ground truth. As a result, our distance learning method allows to statistically improve the correlation with the perceived similarity. We also demonstrate that, in terms of perceived similarity, the recall performance of the semantic signatures is close to that of visual signatures and significantly better than those of several state-of-the-art CBIR methods. The semantic signatures are thus able to communicate high-level medical knowledge while being consistent with the low-level visual signatures and much shorter than them. In our resulting retrieval system, we decide to use visual signatures for perceived similarity learning and retrieval, and semantic signatures for the output of an additional information, expressed in the endoscopist own language, which provides a relevant semantic translation of the visual retrieval outputs.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- IEEE Engineering in Medicine and Biology Society
Other Literature Sources
- The Lens - Patent Citations Database
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Learning semantic and visual similarity for endomicroscopy video retrieval

Affiliation

Learning semantic and visual similarity for endomicroscopy video retrieval

Authors

Affiliation

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials