DocumentCode
2576765
Title
Methods for improving robustness of decision tree in Mandarin speech recognition
Author
Xu, Xianghua ; Zhu, Jie ; Guo, Qiang
Author_Institution
Dept. of Electron. Eng., Shanghai Jiao Tong Univ., China
Volume
3
fYear
2004
fDate
27-30 June 2004
Firstpage
1975
Abstract
Phonetic decision tree based state tying has been widely used in most large vocabulary continuous speech recognition (LVCSR) systems. However, in most cases, the samples of different leaf nodes are very unbalanced, which may affect the recognition performance. In This work, node merging techniques are proposed to alleviate the problem and further decrease the number of senones. On the other hand, in order to lessen the impact of rare triphones on the quality of the decision tree based state tying and improve the accuracy of every final senone, two methods of dealing with rare triphones are added to hidden Markov model (HMM) acoustic modeling before state tying. Experimental results show that these methods greatly improve the robustness of the decision tree and can achieve better performance with even fewer parameters.
Keywords
decision trees; hidden Markov models; natural languages; speech recognition; HMM acoustic modeling; LVCSR; Mandarin speech recognition robustness; hidden Markov models; large vocabulary continuous speech recognition systems; leaf node sample balance; node merging techniques; phonetic decision tree based state tying; rare triphones; senones number reduction; Context modeling; Decision trees; Frequency; Hidden Markov models; Maximum likelihood estimation; Merging; Robustness; Speech recognition; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN
0-7803-8603-5
Type
conf
DOI
10.1109/ICME.2004.1394649
Filename
1394649
Link To Document