DocumentCode :
1909295
Title :
Selection of important input parameters for a text-to-speech synthesis by neural networks
Author :
Sebesta, Vaclav ; Tuckova, Jana
Author_Institution :
Inst. of Comput. Sci., Czechoslovak Acad. of Sci., Czech Republic
Volume :
5
fYear :
1999
fDate :
1999
Firstpage :
2965
Abstract :
Speech signal synthesis in real time, with an unlimited vocabulary is very complicated for all languages. These synthesizers usually work in the frequency domain and the fundamental frequency contours F0 must be determined for all phonemes or diphones by conventional equipment based on the linguistic rules. Our effort is to minimize the difference between the synthetic speech of the synthesizer, which is usually more monotonous and natural speech of people. Because of this, a special building block is included into the synthesizer for prosody control. The multilayer artificial neural network (ANN) is used for prosody control in our case. In this part of the synthesizer the fundamental frequency is “a little bit” modified in such a way so that speech can sound as natural as possible. The number of input training parameters for ANN training must be generally kept as small as possible because of the optimal generalization ability of the network. An original method for the determination of the most important input parameters for the training of ANN for prosody control is described in the paper. The method is based on the data mining from the database of the training patterns by the GUHA method
Keywords :
learning (artificial intelligence); multilayer perceptrons; speech synthesis; GUHA method; diphones; fundamental frequency; important input parameters; linguistic rules; optimal generalization ability; phonemes; prosody control; speech signal synthesis; text-to-speech synthesis; training patterns; unlimited vocabulary; Artificial neural networks; Data mining; Databases; Frequency domain analysis; Frequency synthesizers; Multi-layer neural network; Natural languages; Signal synthesis; Speech synthesis; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location :
Washington, DC
ISSN :
1098-7576
Print_ISBN :
0-7803-5529-6
Type :
conf
DOI :
10.1109/IJCNN.1999.835992
Filename :
835992
Link To Document :
بازگشت