DocumentCode
2705195
Title
Novel Lookahead Decision Tree State Tying for Acoustic Modeling
Author
Jian Xue ; Yuxin Zhao
Author_Institution
Dept. of Comput. Sci., Missouri Univ., Columbia, MO, USA
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
This paper presents two new lookahead methods of constructing phonetic decision trees (PDTs) for acoustic model state tying, a constrained method and a stochastic method. The constrained lookahead method searches for optimal phonetic questions among pre-selected question sets, and reduces contributions of deeper decedents as a function of their levels in the tree. The stochastic full lookahead method uses subtree size instead of likelihood gain as a judgment in selecting a phonetic question for a node split, in order to find a compact tree that is consistent with training data. Since the computational cost of exhaustive lookahead is prohibitively high, a stochastic subtree generation method is used to explore most promising question at each node. We also propose using a phone-state dependent threshold instead of a fixed threshold of likelihood gain to decide if a node split should continue or not. Furthermore, we use a fast confusion network (CN) algorithm to combine recognition hypotheses produced by using acoustic models from different PDT training methods. Experimental results show that the proposed lookahead methods consistently decrease model size, and the integration of recognition hypotheses consistently improves recognition accuracy.
Keywords
decision trees; speech processing; stochastic processes; acoustic model state tying; confusion network algorithm; constrained lookahead method; lookahead decision tree state; optimal phonetic questions; phone-state dependent threshold; stochastic method; stochastic subtree generation method; Computational efficiency; Computer science; Context modeling; Decision trees; Merging; Profitability; Speech recognition; Stochastic processes; Training data; Vocabulary; constrained lookahead; phone-state dependent threshold; phonetic decision trees; stochastic full lookahead;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367274
Filename
4218305
Link To Document