Phoneme recognition with staged neural networks

Author

Arciniegas, Fabio ; Embrechts, Mark J.

Author_Institution

Dept. of Decision Sci. & Eng. Syst., Rensselaer Polytech. Inst., Troy, NY, USA

Volume

5

fYear

2000

fDate

2000

Firstpage

259

Abstract

Presents a staged series of artificial neural networks (ANNs) for phoneme recognition for text-to-speech applications. Contrary to much of the prior published literature this approach is not restricted to monosyllabic words or the pronunciation of single multi-syllabic words, but can readily be embodied in a program that allows for the reading of a complete text. Also, it does not require pre-processing to align the letters and phonemes on the training data. The training data utilized are the 2000 most common words in American English. As an illustration it is shown that the staged neural neural network approach works excellent for a sample text (in this case the first paragraph of Frank Baum´s “The Wonderful Wizard of Oz”)

Keywords

neural nets; speech synthesis; American English; phoneme recognition; staged neural networks; text-to-speech processing; training data; Artificial neural networks; Biological neural networks; Character recognition; Encoding; Humans; Neural networks; Speech processing; Speech synthesis; Systems engineering and theory; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on

Conference_Location

Como

ISSN

1098-7576

Print_ISBN

0-7695-0619-4

Type

conf

DOI

10.1109/IJCNN.2000.861467

Filename

861467