Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 21:2:56.
doi: 10.1038/s41746-019-0131-z. eCollection 2019.

Similar image search for histopathology: SMILY

Affiliations

Similar image search for histopathology: SMILY

Narayan Hegde et al. NPJ Digit Med. .

Abstract

The increasing availability of large institutional and public histopathology image datasets is enabling the searching of these datasets for diagnosis, research, and education. Although these datasets typically have associated metadata such as diagnosis or clinical notes, even carefully curated datasets rarely contain annotations of the location of regions of interest on each image. As pathology images are extremely large (up to 100,000 pixels in each dimension), further laborious visual search of each image may be needed to find the feature of interest. In this paper, we introduce a deep-learning-based reverse image search tool for histopathology images: Similar Medical Images Like Yours (SMILY). We assessed SMILY's ability to retrieve search results in two ways: using pathologist-provided annotations, and via prospective studies where pathologists evaluated the quality of SMILY search results. As a negative control in the second evaluation, pathologists were blinded to whether search results were retrieved by SMILY or randomly. In both types of assessments, SMILY was able to retrieve search results with similar histologic features, organ site, and prostate cancer Gleason grade compared with the original query. SMILY may be a useful general-purpose tool in the pathologist's arsenal, to improve the efficiency of searching large archives of histopathology images, without the need to develop and implement specific tools for each application.

Keywords: Image processing; Machine learning; Medical imaging; Software.

PubMed Disclaimer

Conflict of interest statement

Competing interestsN.H., J.D.H., Y.L., E.R., D.S., M.T., C.J.C., C.H.M., P.Q.N., L.H.P., G.S.C. and M.C.S. are employees of Google LLC and own Alphabet stock. M.E.B. and M.B.A. were compensated for their expertize and time as pathologists.

Figures

Fig. 1
Fig. 1
Overview of Similar Medical Images Like Yours (SMILY). First, a database of image patches and a numerical characterization of each patch’s image contents (termed the embedding) is created. SMILY uses a convolutional neural network to compute this embedding (schematic used for illustration purposes only, see Methods for architecture descriptions). Next, when a query image is selected, SMILY computes the embedding of that query image and compares the embedding with those in the database in a computationally efficient manner. Finally, SMILY returns the k most similar patches, where k is customizable
Fig. 2
Fig. 2
Sample view of the SMILY user interface. Sample query from a prostate specimen and search results. One of the search results has been magnified for better visualization. Clicking a search result opens a new viewer centered on the result that can be zoomed in for detail or zoomed out for context. Additional examples of queries and search results are presented in Supplementary Fig. 4, including an additional interface for scoring the quality of each search result for the prospective studies with pathologists
Fig. 3
Fig. 3
SMILY search accuracy from large-scale quantitative evaluation using pathologist-provided annotations. a Results for histologic feature match in prostate specimens, in comparison with a traditional image feature extractor (scale-invariant feature transform, SIFT) and random search. b Results for prostate cancer Gleason grade and histologic feature match, in comparison with the same baselines. Error bars indicate 95% confidence intervals (Methods)
Fig. 4
Fig. 4
Confusion matrix from SMILY search. An element in row i, column j indicates the fraction of search results for query i that result in a “hit” based on the top-5 score for the category j. a Confusion matrix for the results from Fig. 3a: histologic feature match in prostate specimens. b Confusion matrix for histologic feature match across prostate, breast, and colon specimens. To improve visual contrast and highlight trends better, only discrete colors and rounded-off values are used
Fig. 5
Fig. 5
Evaluation of SMILY from studies with pathologists. The pathologists evaluated search results, blinded to whether the results were retrieved by SMILY versus a negative control, random selection. The similarity scoring rubrics are detailed in the “Prospective studies with pathologists” subsection in the Methods. a Histologic feature match in prostate specimens. b Histologic feature match in prostate, breast, and colon specimens. c Organ site match in prostate, breast, and colon specimens. d Overall match score (Table 3) in prostate specimens for similarity in histology and prostate cancer Gleason grade. Error bars indicate 95% confidence intervals (Methods)
Fig. 6
Fig. 6
Visualizations of the embeddings of image patches in the SMILY database. Each dot represents an image patch. a Colored by organ site, indicating that patches from the same organ were distributed among different clusters. b Colored by histologic feature, indicating a more distinct separation between histologic features

Similar articles

Cited by

References

    1. Mukhopadhyay S, et al. Whole slide imaging versus microscopy for primary diagnosis in surgical pathology: a multicenter blinded randomized noninferiority study of 1992 cases (Pivotal Study) Am. J. Surg. Pathol. 2018;42:39–52. - PMC - PubMed
    1. Lew MS, Sebe N, Djeraba C, Jain R. Content-based multimedia information retrieval: state of the art and challenges. ACM Trans. Multimed. Comput. Commun. Appl. 2006;2:1–19. doi: 10.1145/1126004.1126005. - DOI
    1. Stanoszek LM, Wang GY, Harms PW. Histologic mimics of basal cell carcinoma. Arch. Pathol. Lab. Med. 2017;141:1490–1502. doi: 10.5858/arpa.2017-0222-RA. - DOI - PubMed
    1. Google Images. https://images.google.com/. (Accessed 17 Jan 2019).
    1. Amazon Flow. A9. https://a9.com/what-we-do/visual-search.html. (Accessed 17 Jan 2019).