Share Email Print

Proceedings Paper

MINMAX rate control with a perceived distortion metric
Author(s): Yegnaswamy Sermadevi; Mark Masry; Sheila S. Hemami
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

This paper presents a Rate-Distortion based optimization procedure for video coding that explicitly incorporates a perceptual distortion measure. A flexible MINMAX (minimizing the maximum distortion) optimization procedure based on Linear Programming (LP) is used to minimize the variation in distortion across frames. The distortion measure utilizes a multi-channel human vision model to generate a map of the perceived distortion at each pixel in a frame. This map is then collapsed into a single frame-level distortion. Lagrangian optimization is employed within each frame to assign macroblock-level quantization step-sizes that minimize this frame-level distortion. Coding results with MPEG-2 show a large reduction in peak perceived distortion with respect to TM5 as well as lower variation in distortion across the frames in the video sequence. Although results are presented for a specific metric, the LP based optimization procedure easily extends to other distortion measures.

Paper Details

Date Published: 18 January 2004
PDF: 12 pages
Proc. SPIE 5308, Visual Communications and Image Processing 2004, (18 January 2004); doi: 10.1117/12.527254
Show Author Affiliations
Yegnaswamy Sermadevi, Cornell Univ. (United States)
Mark Masry, Cornell Univ. (United States)
Sheila S. Hemami, Cornell Univ. (United States)

Published in SPIE Proceedings Vol. 5308:
Visual Communications and Image Processing 2004
Sethuraman Panchanathan; Bhaskaran Vasudev, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?