Share Email Print
cover

Proceedings Paper

Audio-visual gender recognition
Author(s): Ming Liu; Xun Xu; Thomas S. Huang
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Combining different modalities for pattern recognition task is a very promising field. Basically, human always fuse information from different modalities to recognize object and perform inference, etc. Audio-Visual gender recognition is one of the most common task in human social communication. Human can identify the gender by facial appearance, by speech and also by body gait. Indeed, human gender recognition is a multi-modal data acquisition and processing procedure. However, computational multimodal gender recognition has not been extensively investigated in the literature. In this paper, speech and facial image are fused to perform a mutli-modal gender recognition for exploring the improvement of combining different modalities.

Paper Details

Date Published: 15 November 2007
PDF: 5 pages
Proc. SPIE 6788, MIPPR 2007: Pattern Recognition and Computer Vision, 678803 (15 November 2007); doi: 10.1117/12.774687
Show Author Affiliations
Ming Liu, Univ. of Illinois at Urbana-Champaign (United States)
Xun Xu, Univ. of Illinois at Urbana-Champaign (United States)
Thomas S. Huang, Univ. of Illinois at Urbana-Champaign (United States)


Published in SPIE Proceedings Vol. 6788:
MIPPR 2007: Pattern Recognition and Computer Vision

© SPIE. Terms of Use
Back to Top