DocumentCode :
1135573
Title :
Pruning Hidden Markov Models With Optimal Brain Surgeon
Author :
Mak, Brian ; Chan, Kin-Wah
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., China
Volume :
13
Issue :
5
fYear :
2005
Firstpage :
993
Lastpage :
1003
Abstract :
A method of pruning hidden Markov models (HMMs) is presented. The main purpose is to find a good HMM topology for a given task with improved generalization capability. As a side effect, the resulting model will also save memory and computation costs. The first goal falls into the active research area of model selection. From the model-theoretic research community, various measures such as Bayesian information criterion, minimum description length, minimum message length have been proposed and used with some success. In this paper, we are considering another approach in which a well-performed HMM, though perhaps oversized, is optimally pruned so that the loss in the model training cost function is minimal. The method is known as optimal brain surgeon (OBS) that has been applied to pruning neural networks (NNs) in the past. In this paper, the OBS algorithm is modified to prune HMMs. While the application of OBS to NNs is a constrained optimization problem with only equality constraints that can be solved by Lagrange multipliers, its application to HMMs requires significant modifications, resulting in a quadratic programming problem with both equality and inequality constraints. The detailed formulation of pruning an HMM with OBS is presented. It was evaluated by two experiments: one simulation using a discrete HMM, and another with continuous density HMMs trained for the TIDIGITS task. It is found that our novel OBS algorithm was able to “re-discover” the true topology of the discrete HMM in the first simulation experiment; in the second speech recognition experiment, up to about 30% of HMM transitions were successfully pruned, and yet the reduced models gave better generalization performance on unseen test data.
Keywords :
hidden Markov models; neural nets; quadratic programming; speech recognition; Bayesian information criterion; Lagrange multipliers; constrained optimization problem; equality constraints; minimum description length; minimum message length; model-theoretic research community; optimal brain surgeon; pruning hidden Markov models; pruning neural networks; quadratic programming problem; Bayesian methods; Biological neural networks; Computational efficiency; Constraint optimization; Cost function; Hidden Markov models; Lagrangian functions; Length measurement; Surges; Topology; Hidden Markov model; model pruning; optimal brain surgeon; quadratic programming;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2005.851974
Filename :
1495481
Link To Document :
بازگشت