Share Email Print

Proceedings Paper

Video to the rescue of audio: shot boundary assisted speaker change detection
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Speaker change detection (SCD) is a preliminary step for many audio applications such as speaker segmentation and recognition. Thus, its robustness is crucial to achieve a good performance in the later steps. Especially, misses (false negatives) affect the results. For some applications, domain-specific characteristics can be used to improve the reliability of the SCD. In broadcast news and discussions, the cooccurrence of shot boundaries and change points provides a robust clue for speaker changes. In this paper, two multimodal approaches are presented that utilize the results of a shot boundary detection (SBD) step to improve the robustness of the SCD. Both approaches clearly outperform the audio-only approach and are exclusively applicable for TV broadcast news and plenary discussions.

Paper Details

Date Published: 29 January 2007
PDF: 8 pages
Proc. SPIE 6506, Multimedia Content Access: Algorithms and Systems, 650609 (29 January 2007); doi: 10.1117/12.703114
Show Author Affiliations
Amjad Samour, Technical Univ. of Berlin (Germany)
Mustafa Karaman, Technical Univ. of Berlin (Germany)
Lutz Goldmann, Technical Univ. of Berlin (Germany)
Thomas Sikora, Technical Univ. of Berlin (Germany)

Published in SPIE Proceedings Vol. 6506:
Multimedia Content Access: Algorithms and Systems
Alan Hanjalic; Raimondo Schettini; Nicu Sebe, Editor(s)

© SPIE. Terms of Use
Back to Top