Share Email Print

Proceedings Paper

Multicomponent FM demodulation of speech based on the short-time Fourier transform (STFT) phase
Author(s): Douglas J. Nelson
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Speech is a signal which is produced as a combination of frication and a quasi periodic train of glottal pulses excites the vocal tract and causes it to resonate. Information is encoded on the signal as the vocal tract changes configuration, resulting in a rapid change of the resonant frequencies. We develop methods, based on differentiation of the short time Fourier transform (STFT) phase, which effectively demodulates the speech signal and produces accurate, high resolution time-frequency estimates of both the resonances and the signal excitation. The method effectively condenses the STFT surface along curves representing the instantaneous frequencies of the vocal tract resonances and the channel group delay function.

Paper Details

Date Published: 26 March 2001
PDF: 12 pages
Proc. SPIE 4391, Wavelet Applications VIII, (26 March 2001); doi: 10.1117/12.421230
Show Author Affiliations
Douglas J. Nelson, U.S. Department of Defense (United States)

Published in SPIE Proceedings Vol. 4391:
Wavelet Applications VIII
Harold H. Szu; David L. Donoho; Adolf W. Lohmann; William J. Campbell; James R. Buss, Editor(s)

© SPIE. Terms of Use
Back to Top