Share Email Print

Proceedings Paper

Multiple-reference-picture video coding using polynomial motion models
Author(s): Thomas Wiegand; Eckehard G. Steinbach; Axel Stensrud; Bernd Girod
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We present a new video coding scheme that uses several references frames for improved motion-compensated prediction. The reference pictures are warped versions of the previously decoded frame applying polynomial motion compensation. In contrast to global motion compensation, where typically one motion model is transmitted, we show that in the general case more than one motion model is of benefit in terms of coding efficiency. In order to determine the multiple motion models we employ a robust clustering method based on the iterative application of the least median of squares estimator. The approach is incorporated into an H-263-based video codec and embedded into a rate- constrained motion estimation and macroblock mode decision frame work. It is demonstrated that adaptive multiple reference picture coding in general improves rate-distortion performance. PSNR gains of 1.2 dB in comparison to the H-263 codec for the high global and local motion sequence Stefan and 1 dB for the sequence Mobile and Calendar, which contains no global motion, are reported. These PSNR gains correspond to bit-rate savings of 21 percent and 30 percent comparing to the H-263 codec, respectively. The average number of motion models selected by the encoder for our test sequences is between 1 and 7 depending on the actual bit- rate.

Paper Details

Date Published: 9 January 1998
PDF: 12 pages
Proc. SPIE 3309, Visual Communications and Image Processing '98, (9 January 1998); doi: 10.1117/12.298323
Show Author Affiliations
Thomas Wiegand, Univ. Erlangen-Nuernberg (Germany)
Eckehard G. Steinbach, Univ. Erlangen-Nuernberg (United States)
Axel Stensrud, Univ. Erlangen-Nuernberg (Germany)
Bernd Girod, Univ. Erlangen-Nuernberg (United States)

Published in SPIE Proceedings Vol. 3309:
Visual Communications and Image Processing '98
Sarah A. Rajala; Majid Rabbani, Editor(s)

© SPIE. Terms of Use
Back to Top