مرکز منطقه ای اطلاع رساني علوم و فناوري - Substate tying with combined parameter training and reduction in tied-mixture HMM design

DocumentCode :

746455

Title :

Substate tying with combined parameter training and reduction in tied-mixture HMM design

Author :

Gu, Liang ; Rose, Kenneth

Author_Institution :

Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA

Volume :

Issue :

fYear :

2002

fDate :

3/1/2002 12:00:00 AM

Firstpage :

137

Lastpage :

145

Abstract :

Two approaches are proposed for the design of tied-mixture hidden Markov models (TMHMM). One approach improves parameter sharing via partial tying of TMHMM states. To facilitate tying at the substate level, the state emission probabilities are constructed in two stages or, equivalently, are viewed as a "mixture of mixtures of Gaussians." This paradigm allows, and is complemented with, an optimization technique to seek the best complexity-accuracy tradeoff solution, which jointly exploits Gaussian density sharing and substate tying. Another approach to enhance model training is combined training and reduction of model parameters. The procedure starts by training a system with a large universal codebook of Gaussian densities. It then iteratively reduces the size of both the codebook and the mixing coefficient matrix, followed by parameter re-training. The additional cost in design complexity is modest. Experimental results on the ISOLET database and its E-set subset show that substate tying reduces the classification error rate by over 15%, compared to standard Gaussian sharing and whole-state tying. TMHMM design with combined training and reduction of parameters reduces the classification error rate by over 20% compared to conventional TMHMM design. When the two proposed approaches were integrated, 25% error rate reduction over TMHMM with whole-state tying was achieved

Keywords :

Gaussian processes; hidden Markov models; matrix algebra; optimisation; probability; speech coding; speech recognition; E-set subset; Gaussian density sharing; ISOLET database; TMHMM; classification error rate reduction; codebook size reduction; combined parameter training; complexity-accuracy tradeoff; design complexity; mixing coefficient matrix; mixture of mixtures of Gaussians; model parameters reduction; model training; optimization; parameter re-training; parameter sharing; speech recognizers; state emission probabilities; substate tying; tied-mixture HMM design; tied-mixture hidden Markov models; Automatic speech recognition; Costs; Databases; Error analysis; Gaussian processes; Helium; Hidden Markov models; Robustness; Speech recognition; Training data;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2002.1001978

Filename :

1001978

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=746455