Share Email Print

Proceedings Paper

Quality of Arabic utterances transformed using different residual prediction techniques
Author(s): Rania Elmanfaloty; N. Korany; El-Sayed A. Youssef
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Voice conversion (VC) is a process which modifies the speech signal produced by one source speaker so that it sounds like another target speaker. In this paper the transformation is determined by using equal Arabic utterances from source and target speakers; these utterances are time-aligned using dynamic time warping algorithm. A conversion function based on Gaussian mixture model (GMM) is used for transforming the spectral envelope described by line spectral frequencies (LSF) and the residuals are converted using three residual prediction techniques. We also compare between these techniques in the conversion of some Arabic utterances. The quality of the transformed utterances is measured using subjective and objective evaluations.

Paper Details

Date Published: 1 October 2011
PDF: 7 pages
Proc. SPIE 8285, International Conference on Graphic and Image Processing (ICGIP 2011), 82854C (1 October 2011); doi: 10.1117/12.913264
Show Author Affiliations
Rania Elmanfaloty, Alexandria Univ. (Egypt)
N. Korany, Alexandria Univ. (Egypt)
El-Sayed A. Youssef, Alexandria Univ. (Egypt)

Published in SPIE Proceedings Vol. 8285:
International Conference on Graphic and Image Processing (ICGIP 2011)
Yi Xie; Yanjun Zheng, Editor(s)

© SPIE. Terms of Use
Back to Top