Share Email Print

Proceedings Paper

Third- and first-party ground truth collection for auto key frame extraction from consumer video clips
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Extracting key frames (KF) from video is of great interest in many applications, such as video summary, video organization, video compression, and prints from video. KF extraction is not a new problem. However, current literature has been focused mainly on sports or news video. In the consumer video space, the biggest challenges for key frame selection from consumer videos are the unconstrained content and lack of any preimposed structure. In this study, we conduct ground truth collection of key frames from video clips taken by digital cameras (as opposed to camcorders) using both first- and third-party judges. The goals of this study are: (1) to create a reference database of video clips reasonably representative of the consumer video space; (2) to identify associated key frames by which automated algorithms can be compared and judged for effectiveness; and (3) to uncover the criteria used by both first- and thirdparty human judges so these criteria can influence algorithm design. The findings from these ground truths will be discussed.

Paper Details

Date Published: 12 February 2007
PDF: 10 pages
Proc. SPIE 6492, Human Vision and Electronic Imaging XII, 64921N (12 February 2007); doi: 10.1117/12.707534
Show Author Affiliations
Kathleen Costello, Eastman Kodak Co. (United States)
Jiebo Luo, Eastman Kodak Co. (United States)

Published in SPIE Proceedings Vol. 6492:
Human Vision and Electronic Imaging XII
Bernice E. Rogowitz; Thrasyvoulos N. Pappas; Scott J. Daly, Editor(s)

© SPIE. Terms of Use
Back to Top