• DocumentCode
    353326
  • Title

    Phoneme recognition with staged neural networks

  • Author

    Arciniegas, Fabio ; Embrechts, Mark J.

  • Author_Institution
    Dept. of Decision Sci. & Eng. Syst., Rensselaer Polytech. Inst., Troy, NY, USA
  • Volume
    5
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    259
  • Abstract
    Presents a staged series of artificial neural networks (ANNs) for phoneme recognition for text-to-speech applications. Contrary to much of the prior published literature this approach is not restricted to monosyllabic words or the pronunciation of single multi-syllabic words, but can readily be embodied in a program that allows for the reading of a complete text. Also, it does not require pre-processing to align the letters and phonemes on the training data. The training data utilized are the 2000 most common words in American English. As an illustration it is shown that the staged neural neural network approach works excellent for a sample text (in this case the first paragraph of Frank Baum´s “The Wonderful Wizard of Oz”)
  • Keywords
    neural nets; speech synthesis; American English; phoneme recognition; staged neural networks; text-to-speech processing; training data; Artificial neural networks; Biological neural networks; Character recognition; Encoding; Humans; Neural networks; Speech processing; Speech synthesis; Systems engineering and theory; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
  • Conference_Location
    Como
  • ISSN
    1098-7576
  • Print_ISBN
    0-7695-0619-4
  • Type

    conf

  • DOI
    10.1109/IJCNN.2000.861467
  • Filename
    861467