• DocumentCode
    698821
  • Title

    Comparative study of letter encoding for text-to-phoneme mapping

  • Author

    Bilcu, Eniko Beatrice ; Astola, Jaakko ; Saarinen, Jukka

  • Author_Institution
    Inst. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
  • fYear
    2005
  • fDate
    4-8 Sept. 2005
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Text-to-phoneme mapping is a very important preliminary step in any text-to-speech synthesis system. In this paper, we study the performances of the multilayer perceptron (MLP) neural network for the problem of text-to-phoneme mapping. Specifically, we study the influence of the input letter encoding in the conversion accuracy of such system. We show, that for large network complexities the orthogonal binary codes (as introduced in NetTalk) gives better performance. On the other hand in applications that require very small memory load and computational complexity other compact codes may be more suitable. This study is a first step toward implementation a neural network based text-to-phoneme mapping in mobile devices.
  • Keywords
    binary codes; computational complexity; multilayer perceptrons; orthogonal codes; smart phones; speech coding; speech synthesis; MLP; computational complexity; input letter encoding; mobile device; multilayer perceptron neural network; network complexity; orthogonal binary codes; text-to-phoneme mapping; text-to-speech synthesis system; Accuracy; Artificial neural networks; Biological neural networks; Discrete cosine transforms; Neurons; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2005 13th European
  • Conference_Location
    Antalya
  • Print_ISBN
    978-160-4238-21-1
  • Type

    conf

  • Filename
    7078415