Share Email Print

Proceedings Paper

Vision-based speaker location detection
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Generally, speaker location detection in video conferencing is audio-based. However, physical room environment which is beyond the control of the speaker detection system can severely change room acoustics. Room acoustics introduce interference and can deteriorate the performance of audio-based speaker detection system. In this paper, we propose a video-based speaker detection method which can be used independently or along with audio-based detection systems. The information on speaker location is intended to create 3-dimensional audio reproduction in order to provide more reality to video conference. In the proposed ethod, we detect moving lips in video sequences. We first detect lips using color information and determine whether the lips are moving. Experiments with real videos provide promising results.

Paper Details

Date Published: 14 March 2005
PDF: 8 pages
Proc. SPIE 5685, Image and Video Communications and Processing 2005, (14 March 2005); doi: 10.1117/12.587326
Show Author Affiliations
Jaehyun Lim, Yonsei Univ. (South Korea)
Jonggeun Park, Yonsei Univ. (South Korea)
Chulhee Lee, Yonsei Univ. (South Korea)

Published in SPIE Proceedings Vol. 5685:
Image and Video Communications and Processing 2005
Amir Said; John G. Apostolopoulos, Editor(s)

© SPIE. Terms of Use
Back to Top