DocumentCode
353326
Title
Phoneme recognition with staged neural networks
Author
Arciniegas, Fabio ; Embrechts, Mark J.
Author_Institution
Dept. of Decision Sci. & Eng. Syst., Rensselaer Polytech. Inst., Troy, NY, USA
Volume
5
fYear
2000
fDate
2000
Firstpage
259
Abstract
Presents a staged series of artificial neural networks (ANNs) for phoneme recognition for text-to-speech applications. Contrary to much of the prior published literature this approach is not restricted to monosyllabic words or the pronunciation of single multi-syllabic words, but can readily be embodied in a program that allows for the reading of a complete text. Also, it does not require pre-processing to align the letters and phonemes on the training data. The training data utilized are the 2000 most common words in American English. As an illustration it is shown that the staged neural neural network approach works excellent for a sample text (in this case the first paragraph of Frank Baum´s “The Wonderful Wizard of Oz”)
Keywords
neural nets; speech synthesis; American English; phoneme recognition; staged neural networks; text-to-speech processing; training data; Artificial neural networks; Biological neural networks; Character recognition; Encoding; Humans; Neural networks; Speech processing; Speech synthesis; Systems engineering and theory; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
Conference_Location
Como
ISSN
1098-7576
Print_ISBN
0-7695-0619-4
Type
conf
DOI
10.1109/IJCNN.2000.861467
Filename
861467
Link To Document