Title :
A spatio-temporal pattern recognition approach to word recognition
Author :
Tom, M. Daniel ; Tenorio, M.Fernando
Author_Institution :
Sch. of Electr. Eng., Purdue Univ., West Lafayette, IN, USA
Abstract :
A limited-vocabulary, speaker-independent, isolated-word recognition system has been built. This system recognizes digitized isolated words without performing segmentation, phoneme identification, or dynamic time warping. It uses a static pattern recognition approach to recognize a spatio-temporal pattern. The preprocessing only includes endpoint identification, preceding and trailing silence removal and word length determination. A fourth-order linear prediction coding (LPC) analysis is preformed on each of 32 equally spaced frames. The four LPC coefficients plus four other features from each frame are input to a neural network consisting of 50 hidden-layer units and four output-layer units. The authors have trained the system for four words spoken by one single speaker. The system is then able to recognize four words from each of five other speakers.<>
Keywords :
encoding; filtering and prediction theory; neural nets; speech recognition; encoding; endpoint identification; fourth order linear prediction coding analysis; hidden-layer units; isolated-word recognition system; limited-vocabulary; neural network; output-layer units; silence removal; spatio-temporal pattern recognition approach; speaker-independent; word length determination; word recognition; Encoding; Filtering; Neural networks; Prediction methods; Speech recognition;
Conference_Titel :
Neural Networks, 1989. IJCNN., International Joint Conference on
Conference_Location :
Washington, DC, USA
DOI :
10.1109/IJCNN.1989.118608