Title :
New rule-based and data-driven strategy to incorporate Fujisaki´s F 0 model to a text-to-speech system in Castillian Spanish
Author :
Gutiérrez-Arriola, J.M. ; Montero, J.M. ; Saiz, D. ; Pardo, J.M.
Author_Institution :
Dept. de Ingenieria Electron., Univ. Politecnica de Madrid, Spain
Abstract :
We present the analysis of a Spanish prosody database by estimating the parameters of Fujisaki´s (1981) model for F0 contours. These parameters are classified attending to linguistic features and they form the analysis database. When synthesizing F0 contours we extract the linguistic features from the text and perform a k-nearest neighbour search. Linguistic feature comparison distance is trained using data from the prosody database. To avoid artifacts we perform a rule-base filtering on synthesis parameters. The results of our evaluation test show that the proposed system is significantly better than the previous neural network approach. This evaluation confirms the ability of Fujisaki´s model to represent prosody information based on linguistic features
Keywords :
knowledge based systems; parameter estimation; search problems; speech synthesis; Castillian Spanish; Fujisaki F0 model; Spanish prosody database; analysis database; k-nearest neighbour search; linguistic feature comparison; linguistic features; rule-base filtering; rule-based data-driven strategy; text-to-speech system; Circuits; Control systems; Damping; Data analysis; Data mining; Feature extraction; Neural networks; Parameter estimation; Spatial databases; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941041