Share Email Print

Proceedings Paper

Comparison of weighting strategies in early and late fusion approaches to audio-visual person authentication
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Person authentication can be strongly enhanced by the combination of different modalities. This is also true for the face and voice signals, which can be obtained with minimal inconvenience for the user. However, features from each modality can be combined at various different levels of processing and for face and voice signals the advantage of fusion depends strongly on the way they are combined. The aim of the work presented is to investigate the optimal strategy for combining voice and face modalities for signals of varying quality. The experimental data are taken from a newly acquired database using a PDA, which contains audio-visual recordings in different conditions. Voice features use mel-frequency cepstral coefficients, while the face signal is parameterised using wavelet coefficients in certain subbands. Results are presented for both early (feature-level) and late (score-level) fusion. At each level different fixed and variable weightings are used, both to weight between frames within each modality and to weight between modalities, where weights are based on some measure of signal reliability, such as the accuracy of automatic face detection or the audio signal to noise ratio. In addition, the contribution to authentication of information from different areas of the face is explored to determine a regional weighting for the face coefficients.

Paper Details

Date Published: 2 May 2006
PDF: 12 pages
Proc. SPIE 6250, Mobile Multimedia/Image Processing for Military and Security Applications, 62500C (2 May 2006); doi: 10.1117/12.667214
Show Author Affiliations
Harin Sellahewa, Univ. of Buckingham (United Kingdom)
Naseer Al-Jawad, Univ. of Buckingham (United Kingdom)
Andrew C. Morris, Saarland Univ. (Germany)
Dalei Wu, Saarland Univ. (Germany)
Jacques Koreman, Saarland Univ. (Germany)
Sabah A. Jassim, Univ. of Buckingham (United Kingdom)

Published in SPIE Proceedings Vol. 6250:
Mobile Multimedia/Image Processing for Military and Security Applications
Sos S. Agaian; Sabah A. Jassim, Editor(s)

© SPIE. Terms of Use
Back to Top