Title :
Speech recognition using dynamic neural networks
Author :
Botros, Nazeih M. ; Premnath, S.
Author_Institution :
Dept. of Electr. Eng., Southern Illinois Univ., Carbondale, IL, USA
Abstract :
The authors present an algorithm for isolated-word recognition that takes into consideration the duration variability of the different utterances of the same word. The algorithm is based on extracting acoustical features from the speech signal and using them as the input to a sequence of multilayer perceptron neural networks. The networks were implemented as predictors for the speech samples for a certain duration of time. The networks were trained by a combination of the back-propagation and the dynamic time warping (DTW) techniques. The DTW technique was implemented to normalize the duration variability. The networks were trained to recognize the correct words and to reject the wrong words. The training set consisted of ten words, each uttered seven times by three different speakers. The test set consisted of three utterances of each of the ten words. The results show that all these words could be recognized
Keywords :
dynamic programming; feedforward neural nets; learning (artificial intelligence); speech recognition; acoustical features; back-propagation; duration variability; dynamic neural networks; dynamic time warping; isolated-word recognition; multilayer perceptron neural networks; predictors; speech recognition; utterances; Automatic speech recognition; Cepstral analysis; Feature extraction; Linear predictive coding; Multilayer perceptrons; Neural networks; Pattern recognition; Signal processing algorithms; Speech recognition; Vectors;
Conference_Titel :
Neural Networks, 1992. IJCNN., International Joint Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
0-7803-0559-0
DOI :
10.1109/IJCNN.1992.227230