DocumentCode :
2444578
Title :
Artificial neural networks for phoneme recognition
Author :
Brunet, Peter T. ; Pandya, A.S. ; Pinera, Carlos V.
Author_Institution :
Special Needs Syst. Dev., IBM Corp., Boca Raton, FL, USA
Volume :
7
fYear :
1994
fDate :
27 Jun-2 Jul 1994
Firstpage :
4473
Abstract :
This paper describes the use of a backpropagation artificial neural network (ANN) to recognize sustained phonemes. The inputs to the neural network were taken from 74 points of an LPC spectrum. This LPC data was augmented by adding slope information to each point in an attempt to add knowledge of the shape of the spectrum. The approach was verified by merging the ANN into an existing speech therapy product, IBM SpeechViewer II, and then testing the ANN with a number of male and female speakers. Results are shown which demonstrate the viability of the approach. It was also discovered that the ANN was able to function in a speaker independent manner. However, results are also shown which point out limitations of ANNs in classifying phonemes which are quite similar such as the m and n phonemes
Keywords :
backpropagation; linear predictive coding; neural nets; speech recognition; speech recognition equipment; IBM SpeechViewer II; LPC spectrum; backpropagation artificial neural network; phoneme recognition; slope information; speech therapy product; sustained phonemes; Artificial neural networks; Computer networks; Equations; Feedforward systems; Linear predictive coding; Medical treatment; Merging; Neural networks; Shape; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1901-X
Type :
conf
DOI :
10.1109/ICNN.1994.374992
Filename :
374992
Link To Document :
بازگشت