• DocumentCode
    3615363
  • Title

    A study on different neural network architectures applied to text-to-phoneme mapping

  • Author

    E.B. Bilcu;J. Suontausta;J. Saarinen

  • Author_Institution
    Digital Comput. Syst. Lab., Tampere Univ. of Technol., Finland
  • Volume
    2
  • fYear
    2003
  • fDate
    6/25/1905 12:00:00 AM
  • Firstpage
    892
  • Abstract
    In this paper, the problem of text-to-phoneme mapping of isolated words for the English language is studied. Multilayer perceptron, recurrent and bidirectional recurrent neural network architectures are compared in the text-to-phoneme mapping task. Multilayer perceptron neural networks utilize contextual information due to the orthography of a word. In our study, recurrent and bidirectional recurrent neural networks, on the other hand, do not take the letter context into account. Instead, these networks utilize the contextual information due to the previously transcribed phonemes as introduced by the feedback loop in the networks.
  • Keywords
    "Neural networks","Dictionaries","Recurrent neural networks","Speech synthesis","Decision trees","Multi-layer neural network","Automatic speech recognition","Testing","Laboratories","Multilayer perceptrons"
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the 3rd International Symposium on
  • Print_ISBN
    953-184-061-X
  • Type

    conf

  • DOI
    10.1109/ISPA.2003.1296405
  • Filename
    1296405