DocumentCode
2163064
Title
PAC-Bayesian approach for minimization of phoneme error rate
Author
Keshet, Joseph ; McAllester, David ; Hazan, Tamir
Author_Institution
Toyota Technol. Inst. at Chicago, Chicago, IL, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
2224
Lastpage
2227
Abstract
We describe a new approach for phoneme recognition which aims at minimizing the phoneme error rate. Building on structured prediction techniques, we formulate the phoneme recognizer as a linear combination of feature functions. We state a PAC-Bayesian generalization bound, which gives an upper-bound on the expected phoneme error rate in terms of the empirical phoneme error rate. Our algorithm is derived by finding the gradient of the PAC-Bayesian bound and minimizing it by stochastic gradient descent. The resulting algorithm is iterative and easy to implement. Experiments on the TIMIT corpus show that our method achieves the lowest phoneme error rate compared to other discriminative and generative models with the same expressive power.
Keywords
Bayes methods; gradient methods; speech recognition; stochastic processes; PAC-Bayesian generalization bound; TIMIT corpus; feature function; iterative algorithm; phoneme error rate minimization; phoneme recognition; phoneme recognizer; stochastic gradient descent; structured prediction technique; Acoustics; Error analysis; Hidden Markov models; Kernel; Speech; Speech recognition; Training; PAC-Bayesian theorem; discriminative training; kernels; phoneme recognition; structured prediction;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946923
Filename
5946923
Link To Document