Title : 
Phoneme recognition with staged neural networks
         
        
            Author : 
Arciniegas, Fabio ; Embrechts, Mark J.
         
        
            Author_Institution : 
Dept. of Decision Sci. & Eng. Syst., Rensselaer Polytech. Inst., Troy, NY, USA
         
        
        
        
        
        
            Abstract : 
Presents a staged series of artificial neural networks (ANNs) for phoneme recognition for text-to-speech applications. Contrary to much of the prior published literature this approach is not restricted to monosyllabic words or the pronunciation of single multi-syllabic words, but can readily be embodied in a program that allows for the reading of a complete text. Also, it does not require pre-processing to align the letters and phonemes on the training data. The training data utilized are the 2000 most common words in American English. As an illustration it is shown that the staged neural neural network approach works excellent for a sample text (in this case the first paragraph of Frank Baum´s “The Wonderful Wizard of Oz”)
         
        
            Keywords : 
neural nets; speech synthesis; American English; phoneme recognition; staged neural networks; text-to-speech processing; training data; Artificial neural networks; Biological neural networks; Character recognition; Encoding; Humans; Neural networks; Speech processing; Speech synthesis; Systems engineering and theory; Training data;
         
        
        
        
            Conference_Titel : 
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
         
        
            Conference_Location : 
Como
         
        
        
            Print_ISBN : 
0-7695-0619-4
         
        
        
            DOI : 
10.1109/IJCNN.2000.861467