Share Email Print
cover

Proceedings Paper

Robust real-time audiovisual face detection
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

This paper presents a face detection system that synergizes audio localization and visual face detection. This audiovisual face detection system is based on microphone sound localization, and image processing algorithms. The system integrates the application of sound localization by Time Delay of Arrival and the iterative application of Adaptive Background Segmentation, to robustly perform real-time face detection on a stream of webcam images. Experimental results using an array of 24 microphones and a fixed-view webcam, show that the audiovisual face detection system is able to perform face detection of success rate 97.5% at 0.82 seconds of convergence time, and 5.8Hz display frame rate, on a Pentium IV 2.5GHz.

Paper Details

Date Published: 12 April 2004
PDF: 12 pages
Proc. SPIE 5434, Multisensor, Multisource Information Fusion: Architectures, Algorithms, and Applications 2004, (12 April 2004); doi: 10.1117/12.545934
Show Author Affiliations
Wei Mark Fang, Univ. of Toronto (Canada)
Parham Aarabi, Univ. of Toronto (Canada)


Published in SPIE Proceedings Vol. 5434:
Multisensor, Multisource Information Fusion: Architectures, Algorithms, and Applications 2004
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top