مرکز منطقه ای اطلاع رساني علوم و فناوري - A new combined modeling of continuous speech recognition

DocumentCode :

2665731

Title :

A new combined modeling of continuous speech recognition

Author :

Han, Zhciobing ; Jia, Lei ; Zhang, Shzrwu ; Xu, Bo

Author_Institution :

High-Tech Innovation Center, Chinese Acad. of Sci., Beijing, China

fYear :

2003

fDate :

26-29 Oct. 2003

Firstpage :

597

Lastpage :

602

Abstract :

Robust estimate of a large number of parameters against the availability of training data is a crucial issue in triphone based continuous speech recognition. To cope with the issue, two major context-clustering methods, agglomerative (AGG) and tree-based (TB), have been widely studied. In this paper, we analyze two algorithms with respect to their advantages and disadvantages and introduce a novel combined method that takes advantage of each method to cluster and tie similar acoustic states for highly detailed acoustic models. In addition, we devise a two-level clustering approach for TB, which uses the tree-based state tying for rare acoustic phonetic events twice. For LVCSR, the experimental results showed the performance could be highly improved by using the proposed combined method, compared with those of using the popular TB method alone.

Keywords :

decision trees; hidden Markov models; parameter estimation; pattern clustering; speech processing; speech recognition; acoustic phonetics; agglomerative clustering; context-clustering method; decision tree; hidden Markov models; parameter estimation; training data; triphone based continuous speech recognition; Automatic speech recognition; Classification tree analysis; Clustering algorithms; Context modeling; Decision trees; Pattern recognition; Power system modeling; Robustness; Speech recognition; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on

Conference_Location :

Beijing, China

Print_ISBN :

0-7803-7902-0

Type :

conf

DOI :

10.1109/NLPKE.2003.1275976

Filename :

1275976

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2665731