Title :
Hybrid speech recognition system with discriminative training applied for Romanian language
Author :
Gavat, Inge ; Zirra, Matei ; Cula, Oana
Author_Institution :
Polytech.. Univ. of Bucharest, Romania
Abstract :
This paper describes a hybrid connectionist-statistical system consisting of a neural network integrated in a hidden Markov model (HMM). The neural network used is the multilayer perceptron (MLP) and that network is the mechanism that computes the a-posteriori probability of a sequence of HMMs states. The classifier is based on total scores computed by Viterbi alignment for each hybrid model corresponding to the words in the vocabulary. Because of the lack of discrimination between the models and the unintended discrimination between the states in each model, we propose a solution that improves the system, namely an additional training task based on a cost function that approximates the misclassification rate of the hybrid system. The optimization criterion is based on a descent algorithm and the result is a minimum classification error. Our experiments on a 35 word vocabulary, show an improvement of the recognition rate from 92.4% for the case of a statistical system based only on HMMs, to 94.7% for the case of a hybrid HMM-MLP system, and to 97.9% for the case of an improved hybrid system with an extra discriminative training
Keywords :
hidden Markov models; learning (artificial intelligence); multilayer perceptrons; natural languages; probability; signal classification; speech recognition; statistical analysis; Romanian language; Viterbi alignment; a-posteriori probability; cost function; descent algorithm; discriminative training; experiments; hidden Markov model; hybrid HMM-MLP system; hybrid connectionist-statistical system; hybrid speech recognition system; minimum classification error; misclassification rate approximation; multilayer perceptron; recognition rate; speech processing; training task; vocabulary; Artificial neural networks; Backpropagation algorithms; Hidden Markov models; Natural languages; Neural networks; Power system modeling; Speech processing; Speech recognition; Viterbi algorithm; Vocabulary;
Conference_Titel :
Electrotechnical Conference, 1998. MELECON 98., 9th Mediterranean
Conference_Location :
Tel-Aviv
Print_ISBN :
0-7803-3879-0
DOI :
10.1109/MELCON.1998.692163