Share Email Print

Proceedings Paper

Audio coding based on rate distortion and perceptual optimization
Author(s): Markus Erne; George Moschytz
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

The time-frequency tiling, bit allocation and the quantizer of most perceptual coding algorithms is either fixed or controlled by a perceptual mode. The large variety of existing audio signals, each exhibiting different coding requirements due to their different temporal and spectral fine-structure suggests to use a signal-adaptive algorithm. The framework which is described in this is paper makes use of a signal-adaptive wavelet filterbank which allows to switch any node of the wavelet-packet tree individually. Therefore each subband can have an individual time- segmentation and the overall time-frequency tiling can be adapted to the signal using optimization techniques. A rate- distortion optimality can be defined which will minimize the distortion for a given rate in every subband, based on a perceptual model. Due to the additivity of the rate and distortion measure over disjoint covers of the input signal, an overall cost function including the switching cost for the filterbank switching can be defined. By the use of dynamic programming techniques, the wavelet-packet tree can be pruned base don a top-down or bottom-up 'split-merge' decision in every node of the wavelet-tree. Additionally we can profit form temporal masking due to the fact that each subband can have an individual segmentation in time without introducing time domain artifacts such as pre-echo distortion.

Paper Details

Date Published: 5 April 2000
PDF: 12 pages
Proc. SPIE 4056, Wavelet Applications VII, (5 April 2000); doi: 10.1117/12.381685
Show Author Affiliations
Markus Erne, Swiss Federal Institute of Technology/Zurich (Switzerland)
George Moschytz, Swiss Federal Institute of Technology/Zurich (Switzerland)

Published in SPIE Proceedings Vol. 4056:
Wavelet Applications VII
Harold H. Szu; Martin Vetterli; William J. Campbell; James R. Buss, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?