Share Email Print
cover

Proceedings Paper

Blind summarization: content-adaptive video summarization using time-series analysis
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Severe complexity constraints on consumer electronic devices motivate us to investigate general-purpose video summarization techniques that are able to apply a common hardware setup to multiple content genres. On the other hand, we know that high quality summaries can only be produced with domain-specific processing. In this paper, we present a time-series analysis based video summarization technique that provides a general core to which we are able to add small content-specific extensions for each genre. The proposed time-series analysis technique consists of unsupervised clustering of samples taken through sliding windows from the time series of features obtained from the content. We classify content into two broad categories, scripted content such as news and drama, and unscripted content such as sports and surveillance. The summarization problem then reduces to finding either finding semantic boundaries of the scripted content or detecting highlights in the unscripted content. The proposed technique is essentially an event detection technique and is thus best suited to unscripted content, however, we also find applications to scripted content. We thoroughly examine the trade-off between content-neutral and content-specific processing for effective summarization for a number of genres, and find that our core technique enables us to minimize the complexity of the content-specific processing and to postpone it to the final stage. We achieve the best results with unscripted content such as sports and surveillance video in terms of quality of summaries and minimizing content-specific processing. For other genres such as drama, we find that more content-specific processing is required. We also find that judicious choice of key audio-visual object detectors enables us to minimize the complexity of the content-specific processing while maintaining its applicability to a broad range of genres. We will present a demonstration of our proposed technique at the conference.

Paper Details

Date Published: 16 January 2006
PDF: 5 pages
Proc. SPIE 6073, Multimedia Content Analysis, Management, and Retrieval 2006, 607302 (16 January 2006); doi: 10.1117/12.648419
Show Author Affiliations
Ajay Divakaran, Mitsubishi Electric Research Labs. (United States)
Regunathan Radhakrishnan, Mitsubishi Electric Research Labs. (United States)
Kadir A. Peker, Mitsubishi Electric Research Labs. (United States)


Published in SPIE Proceedings Vol. 6073:
Multimedia Content Analysis, Management, and Retrieval 2006
Edward Y. Chang; Alan Hanjalic; Nicu Sebe, Editor(s)

© SPIE. Terms of Use
Back to Top