Proceedings PaperSynchrosqueezed representation yields a new reading of the wavelet transform
|Format||Member Price||Non-Member Price|
|GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free.||Check Access|
In automatic speech or speaker recognition, the features used for classification are usually derived from a power spectrum representation. Only a few models present a spectral representation derived from a phase analysis of the signal. This paper presents a new transformation of the time-scale plane, obtained by a quasi-continuous wavelet transform, into a time-frequency plane: the synchrosqueezed plane representation. This analysis is `phase- oriented' and it formalizes naturally the IFD, SBS, and EIH models which are algorithmic representations for a speech signal derived from auditory nerve models. The representation is applied for closed-set speaker identification in the context of the modulation model. This leads to the introduction of a `phase-oriented' cepstral parameter: the wastrum. Experimental results are presented on KING database. The analysis is inverted to extract the primary components of a speech signal in the framework of the modulation model. Extensions to other time- frequency analyses are also discussed.