DocumentCode
698821
Title
Comparative study of letter encoding for text-to-phoneme mapping
Author
Bilcu, Eniko Beatrice ; Astola, Jaakko ; Saarinen, Jukka
Author_Institution
Inst. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
Text-to-phoneme mapping is a very important preliminary step in any text-to-speech synthesis system. In this paper, we study the performances of the multilayer perceptron (MLP) neural network for the problem of text-to-phoneme mapping. Specifically, we study the influence of the input letter encoding in the conversion accuracy of such system. We show, that for large network complexities the orthogonal binary codes (as introduced in NetTalk) gives better performance. On the other hand in applications that require very small memory load and computational complexity other compact codes may be more suitable. This study is a first step toward implementation a neural network based text-to-phoneme mapping in mobile devices.
Keywords
binary codes; computational complexity; multilayer perceptrons; orthogonal codes; smart phones; speech coding; speech synthesis; MLP; computational complexity; input letter encoding; mobile device; multilayer perceptron neural network; network complexity; orthogonal binary codes; text-to-phoneme mapping; text-to-speech synthesis system; Accuracy; Artificial neural networks; Biological neural networks; Discrete cosine transforms; Neurons; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078415
Link To Document