Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun;37(6):1348-1357.
doi: 10.1109/TMI.2018.2827462.

Low-Dose CT Image Denoising Using a Generative Adversarial Network With Wasserstein Distance and Perceptual Loss

Low-Dose CT Image Denoising Using a Generative Adversarial Network With Wasserstein Distance and Perceptual Loss

Qingsong Yang et al. IEEE Trans Med Imaging. 2018 Jun.

Abstract

The continuous development and extensive use of computed tomography (CT) in medical practice has raised a public concern over the associated radiation dose to the patient. Reducing the radiation dose may lead to increased noise and artifacts, which can adversely affect the radiologists' judgment and confidence. Hence, advanced image reconstruction from low-dose CT data is needed to improve the diagnostic performance, which is a challenging problem due to its ill-posed nature. Over the past years, various low-dose CT methods have produced impressive results. However, most of the algorithms developed for this application, including the recently popularized deep learning techniques, aim for minimizing the mean-squared error (MSE) between a denoised CT image and the ground truth under generic penalties. Although the peak signal-to-noise ratio is improved, MSE- or weighted-MSE-based methods can compromise the visibility of important structural details after aggressive denoising. This paper introduces a new CT image denoising method based on the generative adversarial network (GAN) with Wasserstein distance and perceptual similarity. The Wasserstein distance is a key concept of the optimal transport theory and promises to improve the performance of GAN. The perceptual loss suppresses noise by comparing the perceptual features of a denoised output against those of the ground truth in an established feature space, while the GAN focuses more on migrating the data noise distribution from strong to weak statistically. Therefore, our proposed method transfers our knowledge of visual perception to the image denoising task and is capable of not only reducing the image noise level but also trying to keep the critical information at the same time. Promising results have been obtained in our experiments with clinical CT images.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The overall structure of the proposed WGAN-VGG network. In Part 1, n stands for the number of convolutional kernels and s for convolutional stride. So, n32s1 means the convolutional layer has 32 kernels with stride 1.
Fig. 2
Fig. 2
The structure of the discriminator network. n and s have the same meaning as in Fig. 1
Fig. 3
Fig. 3
Optimization procedure of WGAN-VGG network.
Fig. 4
Fig. 4
Plots of validation loss versus the number of epochs during the training of the 5 networks. (a) MSE loss convergence, (b) VGG loss convergence and (c) Wasserstein estimation convergence.
Fig. 5
Fig. 5
Transverse CT images of the abdomen demonstrate a low attenuation liver lesion (in the red box) and a cystic lesion in the upper pole of the left kidney (in the blue box). This display window is [−160, 240]HU.
Fig. 6
Fig. 6
Zoomed ROI of the red rectangle in Fig. 5. The low attenuation liver lesion with in the dashed circle represents metastasis. The lesion is difficult to assess on quarter dose FBP recon (b) due to high noise content. This display window is [−160, 240]HU.
Fig. 7
Fig. 7
Transverse CT images of the abdomen demonstrate small low attenuation liver lesions. The display window is [−160, 240]HU.
Fig. 8
Fig. 8
Zoomed ROI of the red rectangle in Fig. 7 demonstrates the two attenuation liver lesions in the red and blue circles. The display window is [−160, 240]HU.
Fig. 9
Fig. 9
VGG feature maps of full dose and quarter dose images in Fig. 5 and their absolute difference.

References

    1. Brenner DJ, Hall EJ. Computed tomography - an increasing source of radiation exposure. New England J. Med. 2007 Nov.357(22):2277–2284. - PubMed
    1. De Gonzalez AB, Darby S. Risk of cancer from diagnostic x-rays: estimates for the UK and 14 other countries. The Lancet. 2004 Jan.363(9406):345–351. - PubMed
    1. Wang J, Lu H, Li T, Liang Z. Med. Imag. 2005: Image Process. Vol. 5747. International Society for Optics and Photonics; Apr. 2005. Sinogram noise reduction for low-dose CT by statistics-based nonlinear filters; pp. 2058–2067.
    1. Wang J, Li T, Lu H, Liang Z. Penalized weighted least-squares approach to sinogram noise reduction and image reconstruction for low-dose x-ray computed tomography. IEEE Trans. Med. Imag. 2006 Oct.25(10):1272–1283. - PMC - PubMed
    1. Manduca A, Yu L, Trzasko JD, Khaylova N, Kofler JM, McCollough CM, Fletcher JG. Projection space denoising with bilateral filtering and CT noise modeling for dose reduction in CT. Med. Phys. 2009 Nov.36(11):4911–4919. - PMC - PubMed

Publication types