Title :
Artificial neural networks for phoneme recognition
Author :
Brunet, Peter T. ; Pandya, A.S. ; Pinera, Carlos V.
Author_Institution :
Special Needs Syst. Dev., IBM Corp., Boca Raton, FL, USA
fDate :
27 Jun-2 Jul 1994
Abstract :
This paper describes the use of a backpropagation artificial neural network (ANN) to recognize sustained phonemes. The inputs to the neural network were taken from 74 points of an LPC spectrum. This LPC data was augmented by adding slope information to each point in an attempt to add knowledge of the shape of the spectrum. The approach was verified by merging the ANN into an existing speech therapy product, IBM SpeechViewer II, and then testing the ANN with a number of male and female speakers. Results are shown which demonstrate the viability of the approach. It was also discovered that the ANN was able to function in a speaker independent manner. However, results are also shown which point out limitations of ANNs in classifying phonemes which are quite similar such as the m and n phonemes
Keywords :
backpropagation; linear predictive coding; neural nets; speech recognition; speech recognition equipment; IBM SpeechViewer II; LPC spectrum; backpropagation artificial neural network; phoneme recognition; slope information; speech therapy product; sustained phonemes; Artificial neural networks; Computer networks; Equations; Feedforward systems; Linear predictive coding; Medical treatment; Merging; Neural networks; Shape; Speech;
Conference_Titel :
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1901-X
DOI :
10.1109/ICNN.1994.374992