DocumentCode :
3641892
Title :
Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion
Author :
Andi Buzo;Horia Cucu;Corneliu Burileanu;Miruna Pasca;Vladimir Popescu
Author_Institution :
Faculty ETTI, University “
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
1
Lastpage :
8
Abstract :
In this paper, a study about the uncertainty of the trained acoustic models and the confusion among these models is made in the context of speech recognition. The purpose is to find the most relevant voice features, hence the analysis is made on a per-feature basis. Model uncertainty is defined as a measure of feature distribution overlapping. A model is compared only to the models it is more similar to. Hence, confusion matrices are built from both feature distributions and recognition results. Next, the voice features are weighted according to their relevance in order to increase the discrimination among models, while relevance itself is deduced from the values of model uncertainty. Experimental results show that, by appropriate weighting, the recognition accuracy, in terms of Word Error Rate (WER), improves. Moreover, by removing the features with lower weights, the recognition accuracy is maintained, but the number of calculations is significantly reduced.
Keywords :
"Hidden Markov models","Uncertainty","Speech recognition","Databases","Training","Decoding","Computational modeling"
Publisher :
ieee
Conference_Titel :
Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on
Print_ISBN :
978-1-4577-0440-6
Type :
conf
DOI :
10.1109/SPED.2011.5940731
Filename :
5940731
Link To Document :
بازگشت