DocumentCode :
754337
Title :
A Constrained Line Search Optimization Method for Discriminative Training of HMMs
Author :
Liu, Peng ; Liu, Cong ; Jiang, Hui ; Soong, Frank ; Wang, Ren Hua
Author_Institution :
Microsoft Res. Asia, Beijing
Volume :
16
Issue :
5
fYear :
2008
fDate :
7/1/2008 12:00:00 AM
Firstpage :
900
Lastpage :
909
Abstract :
In this paper, we propose a novel optimization algorithm called constrained line search (CLS) for discriminative training (DT) of Gaussian mixture continuous density hidden Markov model (CDHMM) in speech recognition. The CLS method is formulated under a general framework for optimizing any discriminative objective functions including maximum mutual information (MMI), minimum classification error (MCE), minimum phone error (MPE)/minimum word error (MWE), etc. In this method, discriminative training of HMM is first cast as a constrained optimization problem, where Kullback-Leibler divergence (KLD) between models is explicitly imposed as a constraint during optimization. Based upon the idea of line search, we show that a simple formula of HMM parameters can be found by constraining the KLD between HMM of two successive iterations in an quadratic form. The proposed CLS method can be applied to optimize all model parameters in Gaussian mixture CDHMMs, including means, covariances, and mixture weights. We have investigated the proposed CLS approach on several benchmark speech recognition databases, including TIDIGITS, Resource Management (RM), and Switchboard. Experimental results show that the new CLS optimization method consistently outperforms the conventional EBW method in both recognition performance and convergence behavior.
Keywords :
hidden Markov models; optimisation; speech recognition; Gaussian mixture; Kullback-Leibler divergence; TIDIGITS; constrained line search; continuous density hidden Markov model; discriminative training; maximum mutual information; minimum classification error; minimum phone error; minimum word error; optimization; resource management; speech recognition; switchboard; Automatic speech recognition; Constraint optimization; Databases; Hidden Markov models; Management training; Mutual information; Optimization methods; Resource management; Speech recognition; Vocabulary; Discriminative training (DT); Kullback–Leibler divergence (KLD); line search; optimization algorithm;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.925882
Filename :
4544825
Link To Document :
بازگشت