Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 1:8:e1155.
doi: 10.7717/peerj-cs.1155. eCollection 2022.

Homologous point transformer for multi-modality prostate image registration

Affiliations

Homologous point transformer for multi-modality prostate image registration

Alexander Ruchti et al. PeerJ Comput Sci. .

Abstract

Registration is the process of transforming images so they are aligned in the same coordinate space. In the medical field, image registration is often used to align multi-modal or multi-parametric images of the same organ. A uniquely challenging subset of medical image registration is cross-modality registration-the task of aligning images captured with different scanning methodologies. In this study, we present a transformer-based deep learning pipeline for performing cross-modality, radiology-pathology image registration for human prostate samples. While existing solutions for multi-modality prostate image registration focus on the prediction of transform parameters, our pipeline predicts a set of homologous points on the two image modalities. The homologous point registration pipeline achieves better average control point deviation than the current state-of-the-art automatic registration pipeline. It reaches this accuracy without requiring masked MR images which may enable this approach to achieve similar results in other organ systems and for partial tissue samples.

Keywords: Control points; Deep learning; Medical imaging; Registration; Transformer.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1. Demonstration of the image processing pipeline.
Input to pipeline requires single slice histology image (moving) and corresponding 2-D MR slice (fixed) without masking. (Left) Solving for high information histology landmarks using a scale invariant feature transform (SIFT) algorithm and enforcing a specified point density. (Middle) Three head network design that accepts MR image, histology image, and proposed histology control points which then solves for corresponding MR homologous points. (Right) With homologous points solved in both domains, a homologous point registration algorithm (Thin-plate Spline) is applied and histology image is warped into MR space.
Figure 2
Figure 2. Network overview.
(A) Full homologous point prediction network. The network has three input heads; M—Grayscale whole mount histology slide (512 × 512), F—Grayscale T2—weighted MR image (512 × 512), X—Histology landmarks. The output of the network is the set of homologous points in MR space—Y. (B) Histology and MR images are divided into patches, prior to linear embedding (MR image shown). The number of patches is a function of the number of points found in the histology landmark generation step. The transformer then uses the linear patch embeddings to compare histology and MR images to find similar patches and propose homologous points. (C) Locality-preserving transformer encoder block that uses convolution as opposed to the commonly-used global multi-head mechanism.
Figure 3
Figure 3. Evaluation metrics.
(A) Example control point deviation. Average control point deviation is the mean length of the red lines between the ground truth points (blue) and the warped landmarks (orange). (B) Example Dice coefficient. MA is red, MB is blue, MA MB is white.
Figure 4
Figure 4. Softmax output visualization for a single point.
Left—Original histology with single landmark (blue). Center—Corresponding MR slice with predicted point probability heatmap generated from the network. The predicted point probability is used to generate the estimated location of the originally placed histology landmark in MR space. Right—Corresponding MR slice with blue predicted point vs. yellow ground truth point.
Figure 5
Figure 5. Comparison of registration results.
Method evaluations from left to right; Human Ground Truth, ProsRegNet—Paper, ProsRegNet—Tuned, and Homologous Point Pipeline. Top— visualization of the Dice coefficient. Blue indicates the ground truth prostate mask in MR space, red indicates prostate mask of histology post registration, and white denotes overlap between the two masks. Top middle—Dice coefficients with individual prostate slides shown in color, black dotted line represents average Dice coefficient for each method. Bottom middle—visualization of the control point displacement between transformed histology space ground truth and MR space ground truth. Yellow dots mark MR point ground truth, blue dots mark transformed histology landmarks, red lines indicate error. Bottom—Mean control point deviation with individual prostate slides shown in color, black dotted line represents average control point deviation for each method. p-value < 0.005 marked with ‘*’.

References

    1. Dice LR. Measures of the amount of ecologic association between species. Ecology. 1945;26(3):297–302. doi: 10.2307/1932409. - DOI
    1. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreital J, Houlsby N. An image is worth 16×16 words: transformers for image recognition at scale. ArXiv preprint. 2020. - DOI
    1. Harris C, Stephens M. A combined corner and edge detector. Alvey Vision Conference; Princeton: Citeseer; 1988. pp. 10–5244.
    1. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Piscataway: IEEE; 2016.
    1. Hurrell SL, McGarry SD, Kaczmarowski AL, Iczkowski KA, Jacobsohn KM, Hohenwalter MD, Hall WA, See WA, Banerjee A, Charles DK, Nevalainen MT, Mackinnon AC, LaViolette PS. Optimized b-value selection for the discrimination of prostate cancer grades, including the cribriform pattern, using diffusion weighted imaging. Journal of Medical Imaging. 2017;5(1):011004. doi: 10.1117/1.JMI.5.1.011004. - DOI - PMC - PubMed

LinkOut - more resources