Title :
Largemargin training of semi-Markov model for phonetic recognition
Author :
Kim, Sungwoong ; Yun, Sungrack ; Yoo, Chang D.
Author_Institution :
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
Abstract :
This paper considers a large margin training of semi-Markov model (SMM) for phonetic recognition. The SMM framework is better suited for phonetic recognition than the hidden Markov model (HMM) framework in that the SMM framework is capable of simultaneously segmenting the uttered speech into phones and labeling the segment-based features. In this paper, the SMM framework is used to define a discriminant function that is linear in the joint feature map which attempts to capture the long-range statistical dependencies within a segment and between adjacent segments of variable length. The parameters of the discriminant function are estimated by a large margin learning criterion for structured prediction. The parameter estimation problem, which is an optimization problem with many margin constraints, is solved by using a stochastic subgradient descent algorithm. The proposed large margin SMM outperforms the large margin HMM on the TIMIT corpus.
Keywords :
parameter estimation; speech processing; speech recognition; HMM; discriminant function; hidden Markov model; optimization; parameter estimation; phonetic recognition; semi-Markov model; speech segmentation; Automatic speech recognition; Constraint optimization; Hidden Markov models; Intelligent robots; Labeling; Parameter estimation; Probability; Speech recognition; Stochastic processes; Support vector machines; Hidden Markov model; phonetic recognition; semi-Markov model; structured support vector machine;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495329