Title :
Large margin hidden Markov models for speech recognition
Author :
Jiang, Hui ; Li, Xinwei ; Liu, Chaojun
Author_Institution :
Dept. of Comput. Sci. & Eng., York Univ., Toronto, Ont.
Abstract :
In this paper, motivated by large margin classifiers in machine learning, we propose a novel method to estimate continuous-density hidden Markov model (CDHMM) for speech recognition according to the principle of maximizing the minimum multiclass separation margin. The approach is named large margin HMM. First, we show this type of large margin HMM estimation problem can be formulated as a constrained minimax optimization problem. Second, we propose to solve this constrained minimax optimization problem by using a penalized gradient descent algorithm, where the original objective function, i.e., minimum margin, is approximated by a differentiable function and the constraints are cast as penalty terms in the objective function. The new training method is evaluated in the speaker-independent isolated E-set recognition and the TIDIGITS connected digit string recognition tasks. Experimental results clearly show that the large margin HMMs consistently outperform the conventional HMM training methods. It has been consistently observed that the large margin training method yields significant recognition error rate reduction even on top of some popular discriminative training methods
Keywords :
error statistics; gradient methods; hidden Markov models; learning (artificial intelligence); minimax techniques; speech recognition; TIDIGITS connected digit string recognition task; constrained minimax optimization problem; continuous-density hidden Markov model; differentiable function; discriminative training methods; large margin classifiers; large margin hidden Markov model estimation problem; machine learning; minimum margin; minimum multiclass separation margin; objective function; penalized gradient descent algorithm; penalty terms; recognition error rate reduction; speaker-independent isolated E-set recognition; speech recognition; Automatic speech recognition; Computer science; Constraint optimization; Hidden Markov models; Maximum likelihood estimation; Minimax techniques; Mutual information; Optimization methods; Speech recognition; Training data; Continuous-density hidden Markov models (CDHMMs); gradient descent search; large margin classifiers; minimax optimization; support vector machine;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.879805