Share Email Print
cover

Proceedings Paper

Preparation of sound base for a text-to-speech synthesis system
Author(s): Vladimir M. Degtyarev; Mikhail N. Gusev
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We are giving several recommendations for the choice of parameters of the sound fragments in this report. The sound fragments are components of the sound base, used in Russian speech synthesis system by a text. It isn't the secret that quality of concatenation synthesis in many respects is defined at the stage of a speaker choice and preparation of base of speaker's voice samples. Formulated recommendations are received on the basis of the statistic analysis of big amount of various types of texts and concern both separate sound fragments and their groups. Parameters of sounds were taken with the help of the automatic linguistic processor including phonetic and prosodic transcriptors. The duration, intensity and main pitch frequency of sounds in various contexts and intonational contours were analyzed. The sound base produced according to the worked out recommendations, allows to make better intelligibility and naturalness of synthetic speech due to minimization of changes of speaker's voice samples.

Paper Details

Date Published: 29 April 2005
PDF: 7 pages
Proc. SPIE 5831, Eighth International Workshop on Nondestructive Testing and Computer Simulations in Science and Engineering, (29 April 2005); doi: 10.1117/12.619703
Show Author Affiliations
Vladimir M. Degtyarev, St. Petersburg State Telecommunications Univ. (Russia)
Mikhail N. Gusev, St. Petersburg State Telecommunications Univ. (Russia)


Published in SPIE Proceedings Vol. 5831:
Eighth International Workshop on Nondestructive Testing and Computer Simulations in Science and Engineering
Alexander I. Melker, Editor(s)

© SPIE. Terms of Use
Back to Top