Share Email Print

Proceedings Paper

Computationally efficient operational rate-distortion optimal SNR scalable codec
Author(s): Lisimachos P. Kondi
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We present a new scalable video codec which features a computationally efficient decoder as well as an operational rate-distortion optimal encoder. The SNR scalability is accomplished by splitting the Discrete Cosine Transform (DCT) coefficients of the Displaced Frame Difference (DFD) into groups that correspond to the scalable layers. Our Operational Rate-Distortion optimal scalable codec partitions the DCT coefficients of the DFD (or the intensity for intra blocks) into a base layer and one or more enhancement layers. The base layer is constructed by subtracting a value from each quantized DCT coefficient. The subtracted values are then sent as enhancement. If more than two scalable layers are required, the values subtracted for the creation of the base layer are further broken into other values. The partitioning of the DCT coefficients into layers is accomplished by formulating a constrained optimization problem which is then solved using lagrangian optimization. A Dynamic Programming (DP) solution is proposed in order to minimize the Lagrangian cost in a computationally efficient manner. We introduce a new method for finding the required Lagrangian multiplier λ which will meet our target bitrate. The method further reduces the computational complexity of the encoder by minimizing the need for an iterative calculation of the Lagrangian multiplier. Our experimental results show that the proposed codec typically outperforms H.263+ SNR scalability in terms of PSNR while exhibiting a low-complexity decoder. Due to the use of Dynamic Programming and a new method for the estimation of λ, the computational complexity of the operational rate-distortion optimal encoder is competitive with H.263+.

Paper Details

Date Published: 7 May 2003
PDF: 9 pages
Proc. SPIE 5022, Image and Video Communications and Processing 2003, (7 May 2003); doi: 10.1117/12.476450
Show Author Affiliations
Lisimachos P. Kondi, Univ. at Buffalo (United States)

Published in SPIE Proceedings Vol. 5022:
Image and Video Communications and Processing 2003
Bhaskaran Vasudev; T. Russell Hsing; Andrew G. Tescher; Touradj Ebrahimi, Editor(s)

© SPIE. Terms of Use
Back to Top