Share Email Print

Proceedings Paper

Generalized dimensions applied to speaker identification
Author(s): Limin Hou; Shuozhong Wang
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

This paper describes an application of fractal dimensions to speech processing and speaker identification. There are several dimensions that can be used to characterize speech signals such as box dimension, correlation dimension, etc. We are mainly concerned with the generalized dimensions of speech signals as they provide more information than individual dimensions. Generalized dimensions of arbitrary orders are used in speaker identification in this work. Based on the experimental data, the artificial phase space is generated and smooth behavior of correlation integral is obtained in a straightforward and accurate analysis. Using the dimension D(2) derived from the correlation integral, the generalized dimension D(q) of an arbitrary order q is calculated. Moreover, experiments applying the generalized dimension in speaker identification have been carried out. A speaker recognition dedicated Chinese language speech corpus with PKU-SRSC, recorded by Peking University, was used in the experiments. The results are compared to a baseline speaker identification that uses MFCC features. Experimental results have indicated the usefulness of fractal dimensions in characterizing speaker's identity.

Paper Details

Date Published: 25 August 2004
PDF: 6 pages
Proc. SPIE 5404, Biometric Technology for Human Identification, (25 August 2004);
Show Author Affiliations
Limin Hou, Shanghai Univ. (China)
Shuozhong Wang, Shanghai Univ. (China)

Published in SPIE Proceedings Vol. 5404:
Biometric Technology for Human Identification
Anil K. Jain; Nalini K. Ratha, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?