DocumentCode :
1097059
Title :
Knowledge-Based Adaptive Decision Tree State Tying for Conversational Speech Recognition
Author :
Hu, Rusheng ; Zhao, Yunxin
Author_Institution :
Univ. of Missouri-Columbia, Columbia
Volume :
15
Issue :
7
fYear :
2007
Firstpage :
2160
Lastpage :
2168
Abstract :
This paper presents a new method of constructing phonetic decision trees (PDTs) for acoustic model state tying based on implicitly induced prior knowledge. Our hypothesis is that knowledge on pronunciation variation in spontaneous, conversational speech contained in a relatively large corpus can be used for building domain-specific or speaker-dependent PDTs. In view of tree-structure adaptation, this method leads to transformation of tree topology in contrast to keeping fixed tree structure as in traditional methods of speaker adaptation. A Bayesian learning framework is proposed to incorporate prior knowledge on decision rules in a greedy search of new decision trees, where the prior is generated by a decision tree growing process on a large data set. Experimental results on the telemedicine automatic captioning task demonstrate that the proposed approach results in consistent improvement in model quality and recognition accuracy.
Keywords :
belief networks; decision trees; learning (artificial intelligence); speaker recognition; speech processing; Bayesian learning framework; acoustic model state tying; conversational speech recognition; decision rules; greedy search; knowledge-based adaptive phonetic decision tree structure; pronunciation variation; speaker adaptation; telemedicine automatic captioning task; tree structure; tree topology transformation; Automatic speech recognition; Bayesian methods; Buildings; Computer science; Context modeling; Decision trees; Dictionaries; Hidden Markov models; Robustness; Speech recognition; Acoustic modeling; approximate Bayesian; decision tree state tying; implicit prior; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.901830
Filename :
4291602
Link To Document :
بازگشت