DocumentCode
2972133
Title
Large-margin feature adaptation for automatic speech recognition
Author
Cheng, Chih-Chieh ; Sha, Fei ; Saul, Lawrence K.
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of California, San Diego, CA, USA
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
87
Lastpage
92
Abstract
We consider how to optimize the acoustic features used by hidden Markov models (HMMs) for automatic speech recognition (ASR). We investigate a mistake-driven algorithm that discriminatively reweights the acoustic features in order to separate the log-likelihoods of correct and incorrect transcriptions by a large margin. The algorithm simultaneously optimizes the HMM parameters in the back end by adapting them to the reweighted features computed by the front end. Using an online approach, we incrementally update feature weights and model parameters after the decoding of each training utterance. To mitigate the strongly biased gradients from individual training utterances, we train several different recognizers in parallel while tying the feature transformations in their front ends. We show that this parameter-tying across different recognizers leads to more stable updates and generally fewer recognition errors.
Keywords
decoding; hidden Markov models; speech coding; speech recognition; automatic speech recognition; hidden Markov models; individual training utterances; large-margin feature adaptation; mistake-driven algorithm; training utterance decoding; update feature weights; Acoustical engineering; Automatic speech recognition; Cepstral analysis; Computer science; Feature extraction; Hidden Markov models; Linear discriminant analysis; Maximum likelihood decoding; Pattern recognition; Signal processing algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5373320
Filename
5373320
Link To Document