DocumentCode
2444578
Title
Artificial neural networks for phoneme recognition
Author
Brunet, Peter T. ; Pandya, A.S. ; Pinera, Carlos V.
Author_Institution
Special Needs Syst. Dev., IBM Corp., Boca Raton, FL, USA
Volume
7
fYear
1994
fDate
27 Jun-2 Jul 1994
Firstpage
4473
Abstract
This paper describes the use of a backpropagation artificial neural network (ANN) to recognize sustained phonemes. The inputs to the neural network were taken from 74 points of an LPC spectrum. This LPC data was augmented by adding slope information to each point in an attempt to add knowledge of the shape of the spectrum. The approach was verified by merging the ANN into an existing speech therapy product, IBM SpeechViewer II, and then testing the ANN with a number of male and female speakers. Results are shown which demonstrate the viability of the approach. It was also discovered that the ANN was able to function in a speaker independent manner. However, results are also shown which point out limitations of ANNs in classifying phonemes which are quite similar such as the m and n phonemes
Keywords
backpropagation; linear predictive coding; neural nets; speech recognition; speech recognition equipment; IBM SpeechViewer II; LPC spectrum; backpropagation artificial neural network; phoneme recognition; slope information; speech therapy product; sustained phonemes; Artificial neural networks; Computer networks; Equations; Feedforward systems; Linear predictive coding; Medical treatment; Merging; Neural networks; Shape; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location
Orlando, FL
Print_ISBN
0-7803-1901-X
Type
conf
DOI
10.1109/ICNN.1994.374992
Filename
374992
Link To Document