Share Email Print

Proceedings Paper

Current speaker detection system using lip motion information
Author(s): Heak-bong Kwon; Young-jun Song; Un-dong Chang; Jae-hyeong Ahn
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We propose a system that detects the current speaker in multi-speaker videoconferencing by using lip motion. First, the system detects the face and lip region of each of the candidate speakers using face color and shape information. Then, to detect the current speaker, it calculates the change between the current frame and the previous frame in lip region. To close-up the detected current speaker, we used two CCD cameras. One is a general CCD camera, the other is a PTZ camera controlled by RS-232C serial port. The experimental result is the proposed system capable of detecting the face of current speaker in a video feed with more than three people, regardless of orientation of the faces. With this system, it only takes 4 to 5 seconds to zoom in on the speaker from the initial reference image. Also, it is a more efficient image transmission system for such things as video conferencing and internet broadcasting because it offers a close up face image at a resolution of 320x240, while at the same time providing a whole background image.

Paper Details

Date Published: 1 March 2005
PDF: 8 pages
Proc. SPIE 5672, Image Processing: Algorithms and Systems IV, (1 March 2005); doi: 10.1117/12.587578
Show Author Affiliations
Heak-bong Kwon, Kimpo College (South Korea)
Young-jun Song, Chungbuk National Univ. (South Korea)
Un-dong Chang, Chungbuk National Univ. (South Korea)
Jae-hyeong Ahn, Chungbuk National Univ. (South Korea)

Published in SPIE Proceedings Vol. 5672:
Image Processing: Algorithms and Systems IV
Edward R. Dougherty; Jaakko T. Astola; Karen O. Egiazarian, Editor(s)

© SPIE. Terms of Use
Back to Top