Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul 1;11(7):e0158201.
doi: 10.1371/journal.pone.0158201. eCollection 2016.

Context-Aware Image Compression

Affiliations

Context-Aware Image Compression

Jacky C K Chan et al. PLoS One. .

Erratum in

Abstract

We describe a physics-based data compression method inspired by the photonic time stretch wherein information-rich portions of the data are dilated in a process that emulates the effect of group velocity dispersion on temporal signals. With this coding operation, the data can be downsampled at a lower rate than without it. In contrast to previous implementation of the warped stretch compression, here the decoding can be performed without the need of phase recovery. We present rate-distortion analysis and show improvement in PSNR compared to compression via uniform downsampling.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Overview schematic for image compression codec with warped stretch.
The input is split into two components: i) the downsampled warped image and ii) the metadata, which contains a compressed version of the warp kernel. These two components are jointly used for recovering the original input. Since the warp kernel is image-dependent, we must send it as part of the compressed file, which creates extra overhead relative to an image-independent compression technique, such as uniform sampling. However, if the metadata can be compressed extremely compactly, the overall compression ratio can still be significant.
Fig 2
Fig 2. Flowchart for warped stretch compression using piecewise decomposition.
(A) A separate warp kernel is generated for each row of the image. The warp is based on the local row bandwidth, i.e. the derivative of the image intensity. The input image is then warped by the kernel and downsampled at a uniform rate. The compressed image is then saved into a custom binary file format (WST), along with the warp kernel, which itself is compressed via piecewise decomposition. To reconstruct the image, we decompress the piecewise-decomposed warp kernel and use it to perform non-uniform upsampling on the reloaded compressed image. (B) For comparison purposes, we also uniformly downsample the input image in 1D with a lower downsampling rate that accounts for the warp kernel overhead saved in the WST binary format.
Fig 3
Fig 3. Results for row 838 (out of 1672) of the fractal clock image at 6X warped stretch compression with overhead compensation.
The line signal (A) is first rescaled using non-uniform cubic interpolation as defined by the warp kernel, generated according to Eq (7). In this warped space (B), the signal can now be downsampled at a uniform rate (indicated by the red circles) that is lower than what is possible using uniform downsampling, at a given reconstruction quality. The number of downsampled points is less than 1/8th of the number of pixels in the original line signal so that the compression ratio becomes 8X after taking the warp kernel overhead into consideration when saving to file. Both the warping and the downsampling operations can be reversed to reconstruct the line signal (C). The corresponding locations of the downsampled points (red circles in (B)) overlay the (A) original and (C) reconstructed line signals for visual reference. The dashed frames in (A) and (B) are shown in closeup form in Fig 4.
Fig 4
Fig 4. Closeup of Fig 3A and 3B.
(A) A particular subsection of row 838 of the fractal clock image (dashed frame in Fig 3A), which contains a mixture of feature-sparse and feature-dense regions, is shown in expanded form. (B) The same subsection, after warping by the warp kernel (dashed frame in Fig 3B). The subsection in (B) matches the length of the original line subsection (A) to show the redistribution of the feature density caused by the warped stretch transform. The corresponding locations of the downsampled points (red circles in (B)) overlay the original line signal (B) for visual reference.
Fig 5
Fig 5. Comparison of compression performance with the fractal clock image.
The original input image (A-B) and the 4X uniformly downsampled case (C-E), as compared with the reconstructed image after 5.25X warped stretch compression (F-H). The downsampling rate for the uniform case was increased (hence the image quality improvement) such that the resultant file sizes for both warped and uniform compression become equal (to compensate for the warp kernel overhead). After reconstruction, the warped case (G-H) achieved a PSNR of 37.7 dB, which was 6.32 dB better than the uniform downsampling case (D-E).
Fig 6
Fig 6. Comparison of compression performance with the colour portrait image.
(A) The 8X uniformly downsampled image and (B) the 10.2X warp stretch-compressed image are shown with (C) the original image and (D-E) their respective reconstructions, while (F-H) are, in turn, their respective close-up portions. Further zoom-ins on the rims of the glasses are shown in (I-K), highlighting the failure of uniform downsampling to capture this sharp feature. The downsampling rate for the uniform case was adjusted such that the resultant file sizes for both warped and uniform compression become equal; however, since all three colour channels share the same warp kernel, the overhead burden is reduced by a third in this scenario. After reconstruction, the warped case (E,H,K) achieved a PSNR of 39.1 dB, which was 3.11 dB better than the uniform downsampling case (D,G,I).
Fig 7
Fig 7. Empirical rate distortion plot for fractal clock and portrait images.
The PSNR of warped stretch compression (solid) is compared with uniform downsampling (dotted) over a range of compression ratios for (A) the grayscale fractal clock image and (B) the colour portrait image. At a compression ratio of 4X, warped stretch outperforms in PSNR by 6.32 dB in the clock, and by 4.10 dB in the portrait. Beyond the compression ratios of 9X and 20X respectively, the overhead from the warp kernel completely compromises the performance.

References

    1. White RL. High-Performance Compression of Asdronomical Images. Sp. Earth Sci. Data Work., 1993, p. 117–23.
    1. White RL, Percival JW. Compression and progressive transmission of astronomical images. SPIE Adv. Technol. Opt. Telesc. V, vol. 2199, 1994, p. 703–13.
    1. Louys M, Starck JL, Mei S, Bonnarel F, Murtagh F. Astronomical image compression. Astron Astrophys Suppl Ser 1999;136:579–90. 10.1051/aas:1999235 - DOI
    1. Zabala A, Pons X, Díaz-Delgado R, García F, Aulí-Llinàs F, Serra-Sagristà J. Effects of JPEG and JPEG2000 lossy compression on remote sensing image classification for mapping crops and forest areas. Int Geosci Remote Sens Symp 2006:790–3. 10.1109/IGARSS.2006.203 - DOI
    1. Tintrup F, Natale F De, Giusto D. COMPRESSION ALGORITHMS FOR CLASSIFICATION OF REMOTELY SENSED IMAGES. IEEE Int. Conf. Acoust. Speech Signal Process., vol. 5, 1998, p. 2565–8.

MeSH terms

LinkOut - more resources