Title :
A study on different neural network architectures applied to text-to-phoneme mapping
Author :
E.B. Bilcu;J. Suontausta;J. Saarinen
Author_Institution :
Digital Comput. Syst. Lab., Tampere Univ. of Technol., Finland
fDate :
6/25/1905 12:00:00 AM
Abstract :
In this paper, the problem of text-to-phoneme mapping of isolated words for the English language is studied. Multilayer perceptron, recurrent and bidirectional recurrent neural network architectures are compared in the text-to-phoneme mapping task. Multilayer perceptron neural networks utilize contextual information due to the orthography of a word. In our study, recurrent and bidirectional recurrent neural networks, on the other hand, do not take the letter context into account. Instead, these networks utilize the contextual information due to the previously transcribed phonemes as introduced by the feedback loop in the networks.
Keywords :
"Neural networks","Dictionaries","Recurrent neural networks","Speech synthesis","Decision trees","Multi-layer neural network","Automatic speech recognition","Testing","Laboratories","Multilayer perceptrons"
Conference_Titel :
Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the 3rd International Symposium on
Print_ISBN :
953-184-061-X
DOI :
10.1109/ISPA.2003.1296405