Title :
Duration modeling in a restricted-domain female-voice synthesis in Spanish using neural networks
Author :
Córdoba, R. ; Montero, J.M. ; Gutierrez-Arriola, J. ; Pardo, J.M.
Author_Institution :
Dept. de Ingenieria Electron., Univ. Politecnica de Madrid, Spain
Abstract :
The objective of this paper is the accurate prediction of segmental duration in a Spanish text-to-speech system. There are many parameters that affect duration, but not all of them are always relevant. We present a complete environment in which to decide which parameters are more relevant and the best way to code them. This work is the continuation of Cordoba et al. (1999), where all efforts were dedicated to an unrestricted-domain database for a male voice. In this case, we are considering a female voice in a restricted-domain environment. This restricted-domain offers several advantages to the modeling: the variation in the different patterns is reduced, and so most of the decisions we have made about the parameters are now based in more significant results. So, the conclusions that we present now show clearly which parameters are best. The system is based in a neural network absolutely configurable
Keywords :
neural nets; speech synthesis; Spanish text-to-speech system; duration modeling; neural networks; restricted-domain environment; restricted-domain female voice synthesis; segmental duration; Databases; Decision trees; Elasticity; Intelligent networks; Network synthesis; Neural networks; Predictive models; Speech synthesis; Stress; Telecommunications;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941034