Share Email Print

Proceedings Paper

Video retrieval using speech and image information
Author(s): Alexander G. Hauptmann; Rong Jin; Tobun D. Ng
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Video contains multiple types of audio and visual information, which are difficult to extract, combine or trade-off in general video information retrieval. This paper provides an evaluation on the effects of different types of information used for video retrieval from a video collection. A number of different sources of information are present in most typical broadcast video collections and can be exploited for information retrieval. We will discuss the contributions of automatically recognized speech transcripts, image similarity matching, face detection and video OCR in the contexts of experiments performed as part of 2001 TREC Video Retrieval Track evaluation performed by the National Institute of Standards and Technology. For the queries used in this evaluation, image matching and video OCR proved to be the deciding aspects of video information retrieval.

Paper Details

Date Published: 10 January 2003
PDF: 12 pages
Proc. SPIE 5021, Storage and Retrieval for Media Databases 2003, (10 January 2003); doi: 10.1117/12.479747
Show Author Affiliations
Alexander G. Hauptmann, Carnegie Mellon Univ. (United States)
Rong Jin, Carnegie Mellon Univ. (United States)
Tobun D. Ng, Carnegie Mellon Univ. (United States)

Published in SPIE Proceedings Vol. 5021:
Storage and Retrieval for Media Databases 2003
Minerva M. Yeung; Rainer W. Lienhart; Chung-Sheng Li, Editor(s)

© SPIE. Terms of Use
Back to Top