DocumentCode :
3484134
Title :
Hybrid HMM-NN for speech recognition and prior class probabilities
Author :
Albesano, Dario ; Gemello, Roberto ; Mana, F.
Author_Institution :
Loquendo S.p.A., Torino, Italy
Volume :
5
fYear :
2002
fDate :
18-22 Nov. 2002
Firstpage :
2391
Abstract :
During the last years, speech recognition technologies have started their migration from research laboratories to real word applications gaining market shares. Although this shows that paradigms like Neural Networks have reached a high level of accuracy in modeling speech, it must be realized that there is still room for improving recognition performances exploiting the feedbacks coming from the applicative fields. In these cases, in fact, precious application dependent speech material can be recorded, and used to train the acoustic models in order to improve the behaviour of the recognizer on target dictionaries. The best results can be achieved when an iterative, refining process is set up. Unfortunately, speech corpora coming from the field are seldom phonetically balanced and this can cause the performances of the Neural Network to get worse, wasting the benefits of the refining process. In this paper, the problem of Prior Probability normalization has been faced and a method for Prior Probability normalization has been investigated, with the important characteristic of being applicable simply through a modification of the biases at the end of the training phase (therefore on trained nets). An experimentation on several languages is reported, showing the Prior Probability normalization seems quite useful to improve recognition accuracy and to get rid of some undesired effects of training data-bases not perfectly phonetically balanced.
Keywords :
Bayes methods; hidden Markov models; learning (artificial intelligence); multilayer perceptrons; probability; speech recognition; Bayesian probabilities; MLP; automata states; hybrid HMM-neural net; prior class probabilities; prior probability normalization; recognition accuracy; speech recognition; Acoustic applications; Acoustic materials; Dictionaries; Hidden Markov models; Laboratories; Neural networks; Neurofeedback; Speech processing; Speech recognition; Target recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN :
981-04-7524-1
Type :
conf
DOI :
10.1109/ICONIP.2002.1201922
Filename :
1201922
Link To Document :
بازگشت