Rate Shaping through Feature Analysis

Abstract: Rate shaping plays an important role in transmitting compressed video on a network with dynamic bandwidth. In this paper, we present a scheme for rate shaping JPEG/MPEG-precompressed image/video with feature analysis in scale space. The proposed scheme draws its strengths from a technique for identifying perceptually important features based on geometry-driven diffusion, an interpolation scheme allowing many intentionally dropped blocks to be recovered faithfully from neighboring transmitted blocks, and some careful observations about the close interactions between block dropping and feature analysis. Compared with a successful rate shaping scheme based on joint block dropping and uniform coefficient truncation, the new scheme not only produces images of significantly higher visual quality, but also better PSNR at low bit rates.

EDICS: IP 1.1

System Overview

Figure 1: The proposed system diagram. The output bit rate of the rate shaper R(Y) should not exceed the bit rate constraint R_{budget}, which could vary with time.

Feature Analysis

Figure 2: The image contours of (a) at a scale is shown in (b). The bottom row is the result of thresholding on importance(C) for every contour C: (c) the remaining contours, and (d) the corresponding image blocks.

Comparison of Different Approaches

Comparison of different rate shaping approaches for ``baboon" image, pre-compressed by JPEG at 1.3 bpp. Press here for the behaviors of block-dropping+nonadaptive coefficient truncation scheme, described in [Zeng and Liu '96]

Press here for the behaviors of block-dropping+adaptive coefficient truncation scheme, described in [Zeng, Guo, and Liu '96]

In these graphs, the upper envelops of the solid lines are the optimal solutions.

JPEG Example

Figure 3: Reconstructed ``baboon" images, pre-compressed by JPEG at 1.3 bpp, and rate-shaped at 0.367 bpp. Top: block-dropping+adaptive coefficient truncation scheme, PSNR=21.66 dB; Bottom: block-dropping+nonadaptive coefficient truncation scheme, PSNR=21.37 dB.

MPEG Example

Figure 5: Reconstructed I frames (luminance only) of the ``Bicycle" sequence, precompressed by MPEG at 1.0 bpp, and rate-shaped at 0.51 bpp. Top: block-dropping+adaptive coefficient truncation scheme, PSNR=26.87 dB; Bottom: block-dropping+nonadaptive coefficient truncation scheme, PSNR=25.54 dB.

People: Wenjun Zeng, Department of Electrical Engineering, Princeton University

Bede Liu, Department of Electrical Engineering, Princeton University

References: Zeng, W. and B. Liu, ``Rate Shaping by Block Dropping for Transmission of MPEG-precoded Video over Channels of Dynamic Bandwidth'', presented at ACM Multimedia '96, Boston, Nov. 1996

Zeng, W., B. Guo, and B. Liu, ``Rate Shaping of Precompressed Video through Feature Analysis in Scale Space'', submitted in Nov. 1996

Acknowledgements:

Many thanks to S. Zhong for insightful discussion on wavelet implementation. This work is supported, in part, by the Canadian National Science and Engineering Research Council (NSERC). The computing facilities used in the research are supported by the National Science Foundation under Grant CDA-9216171.

Last update : Dec. 28, 1996.

If images on this page look dark to you, see the note about gamma correction.

back to the main page