DocumentCode
411439
Title
Recurrent neural network with both side input context dependence for text-to-phoneme mapping
Author
Bilcu, Eniko Beatrice ; Astola, Juakko ; Saarinen, Jari
Author_Institution
Inst. of Signal Process., Tampere Univ. of Technol., Finland
fYear
2004
fDate
2004
Firstpage
599
Lastpage
602
Abstract
Among many neural network architectures that exist in the literature, the recurrent neural networks (RNN´s) are of special interest due to their ability to deal with spatial temporal problems. However, in an earlier published paper, the authors shown that RNN´s have poor performance in terms of phoneme accuracy when applied to the specific problem of converting text streams into their phonetic transcriptions. This is due to the fact that RNN´s contains a weak left side context dependence between letters and the right side context dependence is not included. In this paper, we study the behavior of RNN that includes the context information between adjacent letters at the input. The results in terms of phoneme accuracy, for the RNN with both side input context dependence, multilayer perception and RNN, in the context of text-to-phoneme mapping, are shown.
Keywords
multilayer perceptrons; neural net architecture; recurrent neural nets; speech recognition; speech synthesis; text analysis; context information; multilayer perception; neural network architectures; phonetic transcriptions; recurrent neural networks; spatial temporal problems; speech recognition; text-to-phoneme mapping; Automatic speech recognition; Multilayer perceptrons; Neural networks; Neurons; Recurrent neural networks; Signal mapping; Speech processing; Speech recognition; Speech synthesis; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Control, Communications and Signal Processing, 2004. First International Symposium on
Print_ISBN
0-7803-8379-6
Type
conf
DOI
10.1109/ISCCSP.2004.1296463
Filename
1296463
Link To Document