DocumentCode :
2769390
Title :
Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations
Author :
Sainath, Tara N. ; Kanevsky, Dimitri ; Ramabhadran, Bhuvana
Author_Institution :
IBM, Yorktown Heights
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
306
Lastpage :
311
Abstract :
In many pattern recognition tasks, given some input data and a model, a probabilistic likelihood score is often computed to measure how well the model describes the data. Extended Baum-Welch (EBW) transformations are most commonly used as a discriminative technique for estimating parameters of Gaussian mixtures, though recently they have been used to derive a gradient steepness measurement to evaluate the quality of the model to match the distribution of the data. In this paper, we explore applying the EBW gradient steepness metric in the context of Hidden Markov Models (HMMs) for recognition of broad phonetic classes and present a detailed analysis and results on the use of this gradient metric on the TIMIT corpus. We find that our gradient metric is able to outperform the baseline likelihood method, and offers improvements in noisy conditions.
Keywords :
Gaussian processes; gradient methods; hidden Markov models; parameter estimation; probability; speech recognition; Gaussian mixture; extended Baum-Welch transformation; gradient steepness measurement; hidden Markov model; parameter estimation; pattern recognition; phonetic class recognition; probabilistic likelihood score; speech recognition; Bayesian methods; Decoding; Gradient methods; Hidden Markov models; Parameter estimation; Pattern recognition; Signal to noise ratio; Speech recognition; Testing; Viterbi algorithm; Gradient Methods; Hidden Markov Models; Speech Recognition; Viterbi Decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430129
Filename :
4430129
Link To Document :
بازگشت