DocumentCode :
2665731
Title :
A new combined modeling of continuous speech recognition
Author :
Han, Zhciobing ; Jia, Lei ; Zhang, Shzrwu ; Xu, Bo
Author_Institution :
High-Tech Innovation Center, Chinese Acad. of Sci., Beijing, China
fYear :
2003
fDate :
26-29 Oct. 2003
Firstpage :
597
Lastpage :
602
Abstract :
Robust estimate of a large number of parameters against the availability of training data is a crucial issue in triphone based continuous speech recognition. To cope with the issue, two major context-clustering methods, agglomerative (AGG) and tree-based (TB), have been widely studied. In this paper, we analyze two algorithms with respect to their advantages and disadvantages and introduce a novel combined method that takes advantage of each method to cluster and tie similar acoustic states for highly detailed acoustic models. In addition, we devise a two-level clustering approach for TB, which uses the tree-based state tying for rare acoustic phonetic events twice. For LVCSR, the experimental results showed the performance could be highly improved by using the proposed combined method, compared with those of using the popular TB method alone.
Keywords :
decision trees; hidden Markov models; parameter estimation; pattern clustering; speech processing; speech recognition; acoustic phonetics; agglomerative clustering; context-clustering method; decision tree; hidden Markov models; parameter estimation; training data; triphone based continuous speech recognition; Automatic speech recognition; Classification tree analysis; Clustering algorithms; Context modeling; Decision trees; Pattern recognition; Power system modeling; Robustness; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
Type :
conf
DOI :
10.1109/NLPKE.2003.1275976
Filename :
1275976
Link To Document :
بازگشت