. 2025 Feb 22;25(5):1350.

doi: 10.3390/s25051350.

Content-Based Histopathological Image Retrieval

Camilo Nuñez-Fernández¹, Humberto Farias², Mauricio Solar¹

Affiliations

¹ Departamento de Informática, Universidad Tecnica Federico Santa Maria, Campus San Joaquin, Santiago 8940897, Chile.
² Institute for Multidisciplinary Research, Universidad de La Serena, La Serena 8380453, Chile.

PMID: 40096145
PMCID: PMC11902497
DOI: 10.3390/s25051350

Content-Based Histopathological Image Retrieval

Camilo Nuñez-Fernández et al. Sensors (Basel). 2025.

. 2025 Feb 22;25(5):1350.

doi: 10.3390/s25051350.

Authors

Camilo Nuñez-Fernández¹, Humberto Farias², Mauricio Solar¹

Affiliations

¹ Departamento de Informática, Universidad Tecnica Federico Santa Maria, Campus San Joaquin, Santiago 8940897, Chile.
² Institute for Multidisciplinary Research, Universidad de La Serena, La Serena 8380453, Chile.

PMID: 40096145
PMCID: PMC11902497
DOI: 10.3390/s25051350

Abstract

Feature descriptors in histopathological images are an important challenge for the implementation of Content-Based Image Retrieval (CBIR) systems, an essential tool to support pathologists. Deep learning models like Convolutional Neural Networks and Vision Transformers improve the extraction of these feature descriptors. These models typically generate embeddings by leveraging deeper single-scale linear layers or advanced pooling layers. However, these embeddings, by focusing on local spatial details at a single scale, miss out on the richer spatial context from earlier layers. This gap suggests the development of methods that incorporate multi-scale information to enhance the depth and utility of feature descriptors in histopathological image analysis. In this work, we propose the Local-Global Feature Fusion Embedding Model. This proposal is composed of three elements: (1) a pre-trained backbone for feature extraction from multi-scales, (2) a neck branch for local-global feature fusion, and (3) a Generalized Mean (GeM)-based pooling head for feature descriptors. Based on our experiments, the model's neck and head were trained on ImageNet-1k and PanNuke datasets employing the Sub-center ArcFace loss and compared with the state-of-the-art Kimia Path24C dataset for histopathological image retrieval, achieving a Recall@1 of 99.40% for test patches.

Keywords: content-based image retrieval; feature embedding; feature fusion; histopathological image; transfer learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 1**
The LGFFEM architecture comprises a pre-trained backbone as a feature extractor from multi-scale stages, a trainable neck consisting of layers for local–global feature fusion, and a pooling head composed of trainable GeM mini-heads for each multi-scale fused feature from the neck.

**Figure 2**
Illustration of the bottleneck operation for the Local and Global Aggregators and the pooling GeM mini-head. (a) Detailed schematic of the Global Feature Aggregator Unit. (b) Detailed schematic of the Local Feature Aggregator Unit. (c) Detailed schematic of the mini-head unit from the GeM head.

**Figure 3**
Query image selected for the class set S0 and their first retrieve image from the Kimia Patch24C dataset. (a) Query image ID S0-1. (b) First image retrieved ID S0-2.

**Figure 4**
Grad-CAM applied to the first layer of the neck used in strategy C for the first image’s retrieved ID S0-2. (a) Grad-CAM applied to the outer aggregation fusion node $P_{1_2}$ in Layer 1. (b) Grad-CAM applied to the outer aggregation fusion node $P_{2_2}$ in Layer 1. (c) Grad-CAM applied to the outer aggregation fusion node $P_{3_2}$ in Layer 1. (d) Grad-CAM applied to the outer aggregation fusion node $P_{4_2}$ in Layer 1. (e) Grad-CAM applied to the collapse of all outer aggregation fusion nodes in Layer 1.

**Figure 5**
Grad-CAM applied to the second layer of the neck used in strategy C for the first image’s retrieved ID S0-2. (a) Grad-CAM applied to the outer aggregation fusion node $P_{1_2}$ in Layer 2. (b) Grad-CAM applied to the outer aggregation fusion node $P_{2_2}$ in Layer 2. (c) Grad-CAM applied to the outer aggregation fusion node $P_{3_2}$ in Layer 2. (d) Grad-CAM applied to the outer aggregation fusion node $P_{4_2}$ in Layer 2. (e) Grad-CAM applied to the collapse of all outer aggregation fusion nodes in Layer 2.

**Figure 6**
Grad-CAM applied to the third layer of the neck used in strategy C for the first image’s retrieved ID S0-2. (a) Grad-CAM applied to the outer aggregation fusion node $P_{1_2}$ in Layer 3. (b) Grad-CAM applied to the outer aggregation fusion node $P_{2_2}$ in Layer 3. (c) Grad-CAM applied to the outer aggregation fusion node $P_{3_2}$ in Layer 3. (d) Grad-CAM applied to the outer aggregation fusion node $P_{4_2}$ in Layer 3. (e) Grad-CAM applied to the collapse of all outer aggregation fusion nodes in Layer 3.

**Figure 7**
Visualization of the 2D projection of embeddings from the Kimia Patch24C test dataset using strategy C. Each dot represents an image, and each color represents a class of tissue.

See this image and copyright information in PMC

References

1. Solar M., Castañeda V., Ñanculef R., Dombrovskaia L., Araya M. A Data Ingestion Procedure towards a Medical Images Repository. Sensors. 2024;24:4985. doi: 10.3390/s24154985. - DOI - PMC - PubMed
1. Rahaman M.M., Li C., Wu X., Yao Y., Hu Z., Jiang T., Li X., Qi S. A Survey for Cervical Cytopathology Image Analysis Using Deep Learning. IEEE Access. 2020;8:61687–61710. doi: 10.1109/ACCESS.2020.2983186. - DOI
1. Solar M., Aguirre P. Deep learning techniques to process 3D chest CT. J. Univ. Comput. Sci. 2024;30:758. doi: 10.3897/jucs.112977. - DOI
1. Hegde N., Hipp J.D., Liu Y., Emmert-Buck M., Reif E., Smilkov D., Terry M., Cai C.J., Amin M.B., Mermel C.H., et al. Similar image search for histopathology: SMILY. Npj Digit. Med. 2019;2:56. doi: 10.1038/s41746-019-0131-z. - DOI - PMC - PubMed
1. Hashimoto N., Takagi Y., Masuda H., Miyoshi H., Kohno K., Nagaishi M., Sato K., Takeuchi M., Furuta T., Kawamoto K., et al. Case-based similar image retrieval for weakly annotated large histopathological images of malignant lymphoma using deep metric learning. Med. Image Anal. 2023;85:102752. doi: 10.1016/j.media.2023.102752. - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

IT21I0019/FONDEF IDEA

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Content-Based Histopathological Image Retrieval

Affiliations

Content-Based Histopathological Image Retrieval

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous