Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features
- PMID: 37780058
- PMCID: PMC10540469
- DOI: 10.1016/j.knosys.2023.110907
Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features
Abstract
Nearest neighbor search, also known as NNS, is a technique used to locate the points in a high-dimensional space closest to a given query point. This technique has multiple applications in medicine, such as searching large medical imaging databases, disease classification, and diagnosis. However, when the number of points is significantly large, the brute-force approach for finding the nearest neighbor becomes computationally infeasible. Therefore, various approaches have been developed to make the search faster and more efficient to support the applications. With a focus on medical imaging, this paper proposes DenseLinkSearch (DLS), an effective and efficient algorithm that searches and retrieves the relevant images from heterogeneous sources of medical images. Towards this, given a medical database, the proposed algorithm builds an index that consists of pre-computed links of each point in the database. The search algorithm utilizes the index to efficiently traverse the database in search of the nearest neighbor. We also explore the role of medical image feature representation in content-based medical image retrieval tasks. We propose a Transformer-based feature representation technique that outperformed the existing pre-trained Transformer-based approaches on benchmark medical image retrieval datasets. We extensively tested the proposed NNS approach and compared the performance with state-of-the-art NNS approaches on benchmark datasets and our created medical image datasets. The proposed approach outperformed the existing approaches in terms of retrieving accurate neighbors and retrieval speed. In comparison to the existing approximate NNS approaches, our proposed DLS approach outperformed them in terms of lower average time per query and ≥ 99% on 11 out of 13 benchmark datasets. We also found that the proposed medical feature representation approach is better for representing medical images compared to the existing pre-trained image models. The proposed feature extraction strategy obtained an improvement of 9.37%, 7.0%, and 13.33% in terms of P@5, P@10, and P@20, respectively, in comparison to the best-performing pre-trained image model. The source code and datasets of our experiments are available at https://github.com/deepaknlp/DLS.
Keywords: Content-based image retrieval; Image feature representation; Indexing; Nearest neighbor search; Searching in High Dimensions.
Conflict of interest statement
Declarations of interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures






























References
-
- Almalawi AM, Fahad A, Tari Z, Cheema MA, & Khalil I (2015). k nnvwc: An efficient k-nearest neighbors approach based on various-widths clustering. IEEE Transactions on Knowledge and Data Engineering, 28, 68–81.
-
- Antani S, Long LR, & Thoma GR (2004). Content-based image retrieval for large biomedical image archives. In MEDINFO 2004 (pp. 829–833). IOS Press. - PubMed
-
- Antani SK, Deserno TM, Long LR, Güld MO, Neve L, & Thoma GR (2007). Interfacing global and local cbir systems for medical image retrieval. In Bildverarbeitung für die Medizin 2007 (pp. 166–171). Springer.
-
- Ba JL, Kiros JR, & Hinton GE (2016). Layer normalization. arXiv preprint arXiv:1607.06450
-
- Babenko A, & Lempitsky V (2014). The inverted multi-index. IEEE transactions on pattern analysis and machine intelligence, 37, 1247–1260. - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources