Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 1;117(48):30088-30095.
doi: 10.1073/pnas.1907377117. Epub 2020 May 11.

On instabilities of deep learning in image reconstruction and the potential costs of AI

Affiliations

On instabilities of deep learning in image reconstruction and the potential costs of AI

Vegard Antun et al. Proc Natl Acad Sci U S A. .

Abstract

Deep learning, due to its unprecedented success in tasks such as image classification, has emerged as a new tool in image reconstruction with potential to change the field. In this paper, we demonstrate a crucial phenomenon: Deep learning typically yields unstable methods for image reconstruction. The instabilities usually occur in several forms: 1) Certain tiny, almost undetectable perturbations, both in the image and sampling domain, may result in severe artefacts in the reconstruction; 2) a small structural change, for example, a tumor, may not be captured in the reconstructed image; and 3) (a counterintuitive type of instability) more samples may yield poorer performance. Our stability test with algorithms and easy-to-use software detects the instability phenomena. The test is aimed at researchers, to test their networks for instabilities, and for government agencies, such as the Food and Drug Administration (FDA), to secure safe use of deep learning methods.

Keywords: AI; deep learning; image reconstruction; instability; inverse problems.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Perturbations rj (created to simulate worst-case effect) with |r1|<|r2|<|r3| are added to the image x. (Top) Images 1 to 4 are original image x and perturbations x+rj. (Bottom) Images 1 to 4 are reconstructions from A(x+rj) using the Deep MRI (DM) network f, where A is a subsampled Fourier transform (33% subsampling); see Methods and SI Appendix for details. (Top and Bottom) Image 5 is a reconstruction from Ax and A(x+r3) using an SoA method; see Methods for details. Note how the artifacts (red arrows) are hard to dismiss as nonphysical.
Fig. 2.
Fig. 2.
A random Gaussian vector eCm is computed by drawing (the real and imaginary part of) each component independently from the normal distribution N(0,10). We let v=A*e, and rescale v so that v2=14r12, where r1 is the perturbation from Fig. 1. The Deep MRI network f reconstructs from the measurements A(x+r1+v) and shows the same artifact as was seen for r1 in Fig. 1. Note that, in this experiment, ACm×N is a subsampled normalized discrete Fourier transform (33% subsampling), so that AA*=I that is, e=Av.
Fig. 3.
Fig. 3.
Perturbations rj (created to simulate worst-case effect) are added to the measurements y=Ax, where |r1|<|r2|<|r3|<|r4|, and A is a subsampled Fourier transform (60% subsampling). To visualize, we show |x+rj|, where y+rj=A(x+rj). (Top) Original image x with perturbations rj. (Middle) Reconstructions from A(x+rj) by the AUTOMAP network f. (Bottom) Reconstructions from A(x+rj) by an SoA method (see Methods for details). A detail in the form of a heart, with varying intensity, is added to visualize the loss in quality.
Fig. 4.
Fig. 4.
(Top) Perturbations r1,r2 (created to simulate worst-case effect) are added to the images x and x. (Middle) The reconstructions by the network f (MRI-VN), from Ax and A(x+r1), and the network f (MED 50), from Ãx and Ã(x+r2). A is a subsampled discrete Fourier transform, and à is a subsampled Radon transform. (Bottom) SoA comparisons. Red arrows are added to highlight the instabilities.
Fig. 5.
Fig. 5.
(Top, Upper Middle, Middle, and Lower Middle) Images xj plus structured perturbations rj (in the form of text and symbols) are reconstructed from measurements yj=Aj(xj+rj) with neural networks fj and SoA methods. The networks are f1=Ell50, f2=DAGAN, f3=MRI-VN, and f4=Deep MRI. The sampling modalities Aj are as follows: A1 is a subsampled discrete Radon transform, A2 is a subsampled discrete Fourier transform (single coil simulation), A3 is a superposition of subsampled discrete Fourier transforms (parallel MRI simulation with 15 coils elements), and A4 is a subsampled discrete Fourier transform (single coil). Note that Deep MRI has not been trained with images containing any of the letters or symbols used in the perturbation, yet it is completely stable with respect to the structural changes. The same is true for the AUTOMAP network (see first column of Fig. 3). (Bottom) PSNR as a function of the subsampling rate for different networks. The dashed red line indicates the subsampling ratio that the networks were trained for.

References

    1. Gull S. F., Daniell G. J., Image reconstruction from incomplete and noisy data. Nature 272, 686–690 (1978).
    1. Studer V., et al. , Compressive fluorescence microscopy for biological and hyperspectral imaging. Proc. Natl. Acad. Sci. U.S.A. 109, 1679–1687 (2011). - PMC - PubMed
    1. Engl H. W., Hanke M., Neubauer A., Regularization of Inverse Problems (Mathematics and its Applications, Springer Netherlands, 1996).
    1. Hansen P. C., “The l-curve and its use in the numerical treatment of inverse problems” in Computational Inverse Problems in Electrocardiology, Johnston P. R., Ed. (Advances in Computational Bioengineering, WIT Press, 2000), pp. 119–142.
    1. LeCun Y., Bengio Y., Hinton G., Deep learning. Nature 521, 436–444 (2015). - PubMed

Publication types

LinkOut - more resources