DocumentCode :
323836
Title :
Simplified neural network architectures for a hybrid speech recognition system with small vocabulary size
Author :
Sedarat, Hossein ; Khadem, Rasool ; Franco, Horacio
Author_Institution :
Dept. of Electr. Eng., Stanford Univ., CA, USA
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
1113
Abstract :
Previous studies suggest that a hybrid speech recognition system based on a hidden Markov model (HMM) with a neural network (NN) subsystem as the estimator of the state conditional observation probability may have some advantages over the conventional HMMs with Gaussian mixture models for the observation probabilities. The HMM and NN modules are typically treated as separate entities in a hybrid system. This paper, however, suggests that the a priori knowledge of the HMM structure can be beneficial in the design of the NN subsystem. A case of isolated word recognition is studied to demonstrate that a substantially simplified NN can be achieved in a structured HMM by applying a Bayesian factorization and pre-classification. The results indicate a similar performance to that obtained with the classical approach with much less complexity in the NN structure
Keywords :
Bayes methods; backpropagation; hidden Markov models; multilayer perceptrons; neural net architecture; pattern classification; probability; speech recognition; state estimation; Bayesian factorization; Gaussian mixture models; HMM; adaptive learning rate; error back propagation algorithm; hidden Markov model; hybrid speech recognition system; isolated word recognition; multilayer perceptron; neural network architectures; neural network subsystem; performance; pre-classification; small vocabulary size; state conditional observation probability; state estimator; Bayesian methods; Equations; Gaussian distribution; Hidden Markov models; Maximum likelihood estimation; Multilayer perceptrons; Neural networks; Probability distribution; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675464
Filename :
675464
Link To Document :
بازگشت