Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 28;10(10):1144.
doi: 10.3390/bioengineering10101144.

WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval

Affiliations

WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval

Zahra Tabatabaei et al. Bioengineering (Basel). .

Abstract

The paper proposes a federated content-based medical image retrieval (FedCBMIR) tool that utilizes federated learning (FL) to address the challenges of acquiring a diverse medical data set for training CBMIR models. CBMIR is a tool to find the most similar cases in the data set to assist pathologists. Training such a tool necessitates a pool of whole-slide images (WSIs) to train the feature extractor (FE) to extract an optimal embedding vector. The strict regulations surrounding data sharing in hospitals makes it difficult to collect a rich data set. FedCBMIR distributes an unsupervised FE to collaborative centers for training without sharing the data set, resulting in shorter training times and higher performance. FedCBMIR was evaluated by mimicking two experiments, including two clients with two different breast cancer data sets, namely BreaKHis and Camelyon17 (CAM17), and four clients with the BreaKHis data set at four different magnifications. FedCBMIR increases the F1 score (F1S) of each client from 96% to 98.1% in CAM17 and from 95% to 98.4% in BreaKHis, with 11.44 fewer hours in training time. FedCBMIR provides 98%, 96%, 94%, and 97% F1S in the BreaKHis experiment with a generalized model and accomplishes this in 25.53 fewer hours of training.

Keywords: breast cancer; computer-aided diagnosis; content-based medical image retrieval (CBMIR); convolutional auto-encoder (CAE); digital pathology; federated learning (FL); histopathological images; whole-slide images (WSIs).

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
An overview of the use case of a worldwide CBMIR. Pathologists send their query (Q) to the worldwide CBMIR since they need a second opinion to make a more confident decision. Then, the model retrieved top K similar images (S-R), and the pathologists can obtain a second opinion from whole over the world.
Figure 2
Figure 2
A comprehensive illustration of the entire process in a CBMIR, demonstrating the utilization of DL models to acquire images from a hospital and offer a second opinion for pathologists.
Figure 3
Figure 3
The structure of the custom-built CAE. The stride in the encoder = [1,2,2,2], in the bottleneck = [1,1,1,1], in the decoder related to the encoder = [2,2,2,1]. The kernel size of the layers in all parts of the structure and for each layer is 3.
Figure 4
Figure 4
The pipeline of CBMIR. It contains three important sections, namely (1) FE, (2) indexing and saving, and (3) similarity measure and search.
Figure 5
Figure 5
The FedCBMIR pipeline consists of four main steps. Step 1: the server initializes weights, and then sends to client for local training. Step 2: client starts local training. Step 3: client updates local weights to the server side. Step 4: the server side aggregates and updates the distributed weights. (a) An overview of the FedCBMIR pipeline with two clients training, fed with BreaKHis 40× and CAM17 data sets. (b) An overview of the FedCBMIR pipeline with four clients training over clusters at universities and companies with BreaKHis in four different magnifications.
Figure 6
Figure 6
Three random queries from Hospital 5 of CAM17 (test set). Corresponding to each query, the top 5 images are shown from four other hospitals with the most similar patterns to the query. The green and red lines around the retrieved images explain the correct and wrong retrieved images.
Figure 7
Figure 7
(a) shows the results of local training on CAM17 in the TY server. (b) is the result of the searching task in CAM17 by applying the well-trained FedCBMIR model from the first experiment.
Figure 8
Figure 8
(ad) show the CMs as a result of local training and searching at the same magnification. (eh) are the CMs of FL models. The reported results are with top K retrieved images. In all CMs, "0” and “1” indicate “Benign” and “Malignant”, respectively. “True labels” and “Predicted labels” correspond to the query and the retrieved labels, accordingly.
Figure 9
Figure 9
BreaKHis images at four different magnification levels (40×, 100×, 200×, and 400×). The higher magnification offers increased access to relevant information with a reduced field of view.
Figure 10
Figure 10
An indirect comparison between the results of FedCBMIR in both experiments and some recent methods for different amounts of K.
Figure 11
Figure 11
Five lines of random histopathological WSIs with their magnifications. The first column is the query, and the following five columns show the retrieved images. This figure brings a proper overview of Sen2. The retrieved image with the same and different labels as the query is indicated by the green and red borders, accordingly.

References

    1. Arnold M., Morgan E., Rumgay H., Mafra A., Singh D., Laversanne M., Vignat J., Gralow J.R., Cardoso F., Siesling S., et al. Current and future burden of breast cancer: Global statistics for 2020 and 2040. Breast. 2022;66:15–23. doi: 10.1016/j.breast.2022.08.010. - DOI - PMC - PubMed
    1. Zhao T., Fu C., Tie M., Sham C.W., Ma H. RGSB-UNet: Hybrid Deep Learning Framework for Tumour Segmentation in Digital Pathology Images. Bioengineering. 2023;10:957. doi: 10.3390/bioengineering10080957. - DOI - PMC - PubMed
    1. Rashmi R., Prasad K., Udupa C.B.K. Breast histopathological image analysis using image processing techniques for diagnostic purposes: A methodological review. J. Med. Syst. 2022;46:7. doi: 10.1007/s10916-021-01786-9. - DOI - PMC - PubMed
    1. Morelli P., Porazzi E., Ruspini M., Restelli U., Banfi G. Analysis of errors in histology by root cause analysis: A pilot study. J. Prev. Med. Hyg. 2013;54:90. - PMC - PubMed
    1. World Health Organization . Laboratory Quality Standards and Their Implementation. World Health Organization; Geneva, Switzerland: 2011.

LinkOut - more resources