Share Email Print

Proceedings Paper

An efficient video shot representation for fast video retrieval
Author(s): Cheng Cai; Kin-Man Lam; Zheng Tan
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

For video retrieval, a video is partitioned into a group of shots, which are then represented by either key frames or video shot representations. An optimal representation of a shot should include all the information about the frames concerned. In this paper, we propose an efficient representation scheme for a shot, which considers both the spatial frequency contents and the temporal statistics of the frames for video retrieval. In our scheme, each frame in a video shot is transformed into the frequency domain using the discrete cosine transform (DCT), and a number of values at each frequency are selected based on their probability of occurrence. This representation scheme allows retrieval to be carried out hierarchically, i.e. from low-frequency to high-frequency components. Experimental results show that our proposed scheme outperforms the alpha-trimmed average histogram method in terms of retrieval accuracy.

Paper Details

Date Published: 31 July 2006
PDF: 9 pages
Proc. SPIE 5960, Visual Communications and Image Processing 2005, 59600P (31 July 2006); doi: 10.1117/12.631564
Show Author Affiliations
Cheng Cai, The Hong Kong Polytechnic Univ. (Hong Kong China)
Xi'an Jiaotong Univ. (China)
Kin-Man Lam, The Hong Kong Polytechnic Univ. (Hong Kong China)
Zheng Tan, Xi'an Jiaotong Univ. (China)

Published in SPIE Proceedings Vol. 5960:
Visual Communications and Image Processing 2005
Shipeng Li; Fernando Pereira; Heung-Yeung Shum; Andrew G. Tescher, Editor(s)

© SPIE. Terms of Use
Back to Top