Share Email Print

Proceedings Paper

TRECVID: the utility of a content-based video retrieval evaluation
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

TRECVID, an annual retrieval evaluation benchmark organized by NIST, encourages research in information retrieval from digital video. TRECVID benchmarking covers both interactive and manual searching by end users, as well as the benchmarking of some supporting technologies including shot boundary detection, extraction of semantic features, and the automatic segmentation of TV news broadcasts. Evaluations done in the context of the TRECVID benchmarks show that generally, speech transcripts and annotations provide the single most important clue for successful retrieval. However, automatically finding the individual images is still a tremendous and unsolved challenge. The evaluations repeatedly found that none of the multimedia analysis and retrieval techniques provide a significant benefit over retrieval using only textual information such as from automatic speech recognition transcripts or closed captions. In interactive systems, we do find significant differences among the top systems, indicating that interfaces can make a huge difference for effective video/image search. For interactive tasks efficient interfaces require few key clicks, but display large numbers of images for visual inspection by the user. The text search finds the right context region in the video in general, but to select specific relevant images we need good interfaces to easily browse the storyboard pictures. In general, TRECVID has motivated the video retrieval community to be honest about what we don't know how to do well (sometimes through painful failures), and has focused us to work on the actual task of video retrieval, as opposed to flashy demos based on technological capabilities.

Paper Details

Date Published: 16 January 2006
PDF: 8 pages
Proc. SPIE 6061, Internet Imaging VII, 606107 (16 January 2006); doi: 10.1117/12.660261
Show Author Affiliations
Alexander G. Hauptmann, Carnegie Mellon Univ. (United States)

Published in SPIE Proceedings Vol. 6061:
Internet Imaging VII
Simone Santini; Raimondo Schettini; Theo Gevers, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?