مرکز منطقه ای اطلاع رساني علوم و فناوري - Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

DocumentCode :

3641892

Title :

Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

Author :

Andi Buzo;Horia Cucu;Corneliu Burileanu;Miruna Pasca;Vladimir Popescu

Author_Institution :

Faculty ETTI, University “

fYear :

2011

fDate :

5/1/2011 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

In this paper, a study about the uncertainty of the trained acoustic models and the confusion among these models is made in the context of speech recognition. The purpose is to find the most relevant voice features, hence the analysis is made on a per-feature basis. Model uncertainty is defined as a measure of feature distribution overlapping. A model is compared only to the models it is more similar to. Hence, confusion matrices are built from both feature distributions and recognition results. Next, the voice features are weighted according to their relevance in order to increase the discrimination among models, while relevance itself is deduced from the values of model uncertainty. Experimental results show that, by appropriate weighting, the recognition accuracy, in terms of Word Error Rate (WER), improves. Moreover, by removing the features with lower weights, the recognition accuracy is maintained, but the number of calculations is significantly reduced.

Keywords :

"Hidden Markov models","Uncertainty","Speech recognition","Databases","Training","Decoding","Computational modeling"

Publisher :

ieee

Conference_Titel :

Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on

Print_ISBN :

978-1-4577-0440-6

Type :

conf

DOI :

10.1109/SPED.2011.5940731

Filename :

5940731

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3641892