Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2012 Feb;31(2):153-63.
doi: 10.1109/TMI.2011.2163944. Epub 2011 Aug 8.

Image similarity and tissue overlaps as surrogates for image registration accuracy: widely used but unreliable

Affiliations
Comparative Study

Image similarity and tissue overlaps as surrogates for image registration accuracy: widely used but unreliable

Torsten Rohlfing. IEEE Trans Med Imaging. 2012 Feb.

Abstract

The accuracy of nonrigid image registrations is commonly approximated using surrogate measures such as tissue label overlap scores, image similarity, image difference, or transformation inverse consistency error. This paper provides experimental evidence that these measures, even when used in combination, cannot distinguish accurate from inaccurate registrations. To this end, we introduce a "registration" algorithm that generates highly inaccurate image transformations, yet performs extremely well in terms of the surrogate measures. Of the tested criteria, only overlap scores of localized anatomical regions reliably distinguish reasonable from inaccurate registrations, whereas image similarity and tissue overlap do not. We conclude that tissue overlap and image similarity, whether used alone or together, do not provide valid evidence for accurate registrations and should thus not be reported or accepted as such.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Original (top row) and modified (bottom row) image data of the IBSR_01 subject. In the modified data, non-brain tissue was removed from the structural images to facilitate inter-subject registration, and missing CSF labels (note, for example, the third ventricle) were added to previously unlabeled regions inside the brain masks.
Fig. 2
Fig. 2
Schematic illustration of rank order-based mapping of a pixel from the fixed image (left) to the moving image (right) via correspondence of pixels sorted by increasing intensities. See text for details and notation.
Fig. 3
Fig. 3
Reformatted images after registration of one image pair (IBSR_01 to IBSR_02) using different registration algorithms. Columns from left to right: Fixed image (IBSR_01), and moving image (IBSR_02) after affine, FFD, SyN, and CURT registration. Rows from top to bottom: structural MR image, difference images (all with identical window/level settings), three-compartment tissue segmentation, and region labels provided by the IBSR.
Fig. 4
Fig. 4
Groupwise average (top row) and standard deviation (bottom row) images of IBSR_02 through IBSR_18 after registration to IBSR_01 using each of the four registration algorithms. To obtain consistent intensity ranges, the pixel intensity values in each reformatted image were globally rescaled to match mean and standard deviation of the reference image intensities. All average images, as well as all standard deviation images, are shown using identical gray scales.
Fig. 5
Fig. 5
Plots of Jaccard overlap scores, J (larger values are better), after registration vs. gold standard region size in pixels. (a) FFD, (b) SyN, (c) CURT. All plots use the same axis scales. Correspondence between plot symbols and ROIs is shown in (c). For bilateral regions, both left and right region values are plotted separately but using the same symbol.
Fig. 6
Fig. 6
Summary graphs over all 306 image pairs comparing the post-registration image similarities after affine, FFD, SyN, and CURT registrations. (a) RMS image difference, (b) NCC image correlation, and (c) NMI image similarity. Stars mark results for which CURT differs significantly from the other algorithms (the remaining algorithms were not tested against one another).
Fig. 7
Fig. 7
Summary graph over all 306 image pairs comparing the tissue overlap scores of affine, FFD, SyN, and CURT registrations. Stars mark results for which CURT differs significantly from the other algorithms (the remaining algorithms were not tested against one another).

References

    1. West JB, Fitzpatrick JM, Wang MY, Dawant BM, Maurer CR, Jr, Kessler RM, Maciunas RJ, Barillot C, Lemoine D, Collignon A, Maes F, Suetens P, Vandermeulen D, van den Elsen PA, Napel S, Sumanaweera TS, Harkness B, Hemler PF, Hill DLG, Hawkes DJ, Studholme C, Maintz JBA, Viergever MA, Malandain G, Pennec X, Noz ME, Maguire GQ, Jr, Pollack M, Pelizzari CA, Robb RA, Hanson D, Woods RP. Comparison and evaluation of retrospective intermodality brain image registration techniques. J. Comput. Assist. Tomogr. 1997;vol. 21(no. 4):554–566. - PubMed
    1. Murphy K, van Ginneken B, Klein S, Staring M, de Hoop B, Viergever M, Pluim J. Semi-automatic construction of reference standards for evaluation of image registration. Med. Image. Anal. 2011 Feb.vol. 15(no. 1):71–84. - PubMed
    1. Fitzpatrick JM, West JB, Maurer CR., Jr Predicting error in rigid-body, point-based registration. IEEE Trans. Med. Imag. 1998 Oct.vol. 17(no. 5):694–702. - PubMed
    1. Song JH, Christensen GE, Hawley JA, Wei Y, Kuhl JG. Evaluating image registration using NIREP. In: Fischer B, Dawant BM, Lorenz C, editors. Biomedical; Image Registration — 4th International Workshop, WBIR 2010; July 11–13, 2010; Lübeck, Germany. Berlin/Heidelberg: Springer-Verlag; 2010. pp. 140–150. Proceedings, ser. LNCS.
    1. Internet Brain Segmentation Repository (IBSR) [Online]. Available: http://www.cma.mgh.harvard.edu/ibsr/

MeSH terms