DocumentCode :
746455
Title :
Substate tying with combined parameter training and reduction in tied-mixture HMM design
Author :
Gu, Liang ; Rose, Kenneth
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
Volume :
10
Issue :
3
fYear :
2002
fDate :
3/1/2002 12:00:00 AM
Firstpage :
137
Lastpage :
145
Abstract :
Two approaches are proposed for the design of tied-mixture hidden Markov models (TMHMM). One approach improves parameter sharing via partial tying of TMHMM states. To facilitate tying at the substate level, the state emission probabilities are constructed in two stages or, equivalently, are viewed as a "mixture of mixtures of Gaussians." This paradigm allows, and is complemented with, an optimization technique to seek the best complexity-accuracy tradeoff solution, which jointly exploits Gaussian density sharing and substate tying. Another approach to enhance model training is combined training and reduction of model parameters. The procedure starts by training a system with a large universal codebook of Gaussian densities. It then iteratively reduces the size of both the codebook and the mixing coefficient matrix, followed by parameter re-training. The additional cost in design complexity is modest. Experimental results on the ISOLET database and its E-set subset show that substate tying reduces the classification error rate by over 15%, compared to standard Gaussian sharing and whole-state tying. TMHMM design with combined training and reduction of parameters reduces the classification error rate by over 20% compared to conventional TMHMM design. When the two proposed approaches were integrated, 25% error rate reduction over TMHMM with whole-state tying was achieved
Keywords :
Gaussian processes; hidden Markov models; matrix algebra; optimisation; probability; speech coding; speech recognition; E-set subset; Gaussian density sharing; ISOLET database; TMHMM; classification error rate reduction; codebook size reduction; combined parameter training; complexity-accuracy tradeoff; design complexity; mixing coefficient matrix; mixture of mixtures of Gaussians; model parameters reduction; model training; optimization; parameter re-training; parameter sharing; speech recognizers; state emission probabilities; substate tying; tied-mixture HMM design; tied-mixture hidden Markov models; Automatic speech recognition; Costs; Databases; Error analysis; Gaussian processes; Helium; Hidden Markov models; Robustness; Speech recognition; Training data;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2002.1001978
Filename :
1001978
Link To Document :
بازگشت