DocumentCode :
1684582
Title :
Combination of improved Katz and mutual information for speech recognition based on lattice
Author :
Lei, Zhang ; Dong, Lu ; Xue-zhi, Xiang
Author_Institution :
Dept. of Commun. & Inf., Harbin Eng. Univ., Harbin, China
fYear :
2010
Firstpage :
6379
Lastpage :
6382
Abstract :
In the language model based on Chinese, only the number of occurring count of n-gram can not account for the reliability. Some n-grams have strong meaning in Chinese although their occurring count is low in training data. Combined the number of count with mutual information, whether the syllables in n-gram are highly associated can be better described. Further, in most smoothing approaches, the discount idea is widely adopted, that is for less reliable n-grams, the probabilities calculated by training data may be over-estimated, so they should be discounted. In this paper, not only the over-estimated n-grams are discounted, but also the most reliable n-grams which may be under-estimated during training procedure are enhanced. For all these modification is based on Katz which is the most commonly used method in speech recognition, the proposed approach is named as improved Katz. From the experiment results, it can be drawn that the performance of improved Katz is better that only Katz.
Keywords :
speech recognition; Chinese; Katz; language model; lattice; mutual information; speech recognition; training data; Lattices; Mutual information; Smoothing methods; Speech; Speech recognition; Training; Training data; Improved Katz; Lattice; Mutual Information; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation (WCICA), 2010 8th World Congress on
Conference_Location :
Jinan
Print_ISBN :
978-1-4244-6712-9
Type :
conf
DOI :
10.1109/WCICA.2010.5554331
Filename :
5554331
Link To Document :
بازگشت