مرکز منطقه ای اطلاع رساني علوم و فناوري - Novel Lookahead Decision Tree State Tying for Acoustic Modeling

DocumentCode :

2705195

Title :

Novel Lookahead Decision Tree State Tying for Acoustic Modeling

Author :

Jian Xue ; Yuxin Zhao

Author_Institution :

Dept. of Comput. Sci., Missouri Univ., Columbia, MO, USA

Volume :

fYear :

2007

fDate :

15-20 April 2007

Abstract :

This paper presents two new lookahead methods of constructing phonetic decision trees (PDTs) for acoustic model state tying, a constrained method and a stochastic method. The constrained lookahead method searches for optimal phonetic questions among pre-selected question sets, and reduces contributions of deeper decedents as a function of their levels in the tree. The stochastic full lookahead method uses subtree size instead of likelihood gain as a judgment in selecting a phonetic question for a node split, in order to find a compact tree that is consistent with training data. Since the computational cost of exhaustive lookahead is prohibitively high, a stochastic subtree generation method is used to explore most promising question at each node. We also propose using a phone-state dependent threshold instead of a fixed threshold of likelihood gain to decide if a node split should continue or not. Furthermore, we use a fast confusion network (CN) algorithm to combine recognition hypotheses produced by using acoustic models from different PDT training methods. Experimental results show that the proposed lookahead methods consistently decrease model size, and the integration of recognition hypotheses consistently improves recognition accuracy.

Keywords :

decision trees; speech processing; stochastic processes; acoustic model state tying; confusion network algorithm; constrained lookahead method; lookahead decision tree state; optimal phonetic questions; phone-state dependent threshold; stochastic method; stochastic subtree generation method; Computational efficiency; Computer science; Context modeling; Decision trees; Merging; Profitability; Speech recognition; Stochastic processes; Training data; Vocabulary; constrained lookahead; phone-state dependent threshold; phonetic decision trees; stochastic full lookahead;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location :

Honolulu, HI

ISSN :

1520-6149

Print_ISBN :

1-4244-0727-3

Type :

conf

DOI :

10.1109/ICASSP.2007.367274

Filename :

4218305

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2705195