Share Email Print

Proceedings Paper

Dialog detection in narrative video by shot and face analysis
Author(s): B. Kroon; J. Nesvadba; A. Hanjalic
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

The proliferation of captured personal and broadcast content in personal consumer archives necessitates comfortable access to stored audiovisual content. Intuitive retrieval and navigation solutions require however a semantic level that cannot be reached by generic multimedia content analysis alone. A fusion with film grammar rules can help to boost the reliability significantly. The current paper describes the fusion of low-level content analysis cues including face parameters and inter-shot similarities to segment commercial content into film grammar rule-based entities and subsequently classify those sequences into so-called shot reverse shots, i.e. dialog sequences. Moreover shot reverse shot specific mid-level cues are analyzed augmenting the shot reverse shot information with dialog specific descriptions.

Paper Details

Date Published: 29 January 2007
PDF: 10 pages
Proc. SPIE 6506, Multimedia Content Access: Algorithms and Systems, 65060K (29 January 2007); doi: 10.1117/12.707364
Show Author Affiliations
B. Kroon, Philips Research Europe (Netherlands)
Delft Univ. of Technology (Netherlands)
J. Nesvadba, Philips Research Europe (Netherlands)
A. Hanjalic, Delft Univ. of Technology (Netherlands)

Published in SPIE Proceedings Vol. 6506:
Multimedia Content Access: Algorithms and Systems
Alan Hanjalic; Raimondo Schettini; Nicu Sebe, Editor(s)

© SPIE. Terms of Use
Back to Top