DocumentCode :
3526958
Title :
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
Author :
Kingsbury, Brian
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3761
Lastpage :
3764
Abstract :
Acoustic models used in hidden Markov model/neural-network (HMM/NN) speech recognition systems are usually trained with a frame-based cross-entropy error criterion. In contrast, Gaussian mixture HMM systems are discriminatively trained using sequence-based criteria, such as minimum phone error or maximum mutual information, that are more directly related to speech recognition accuracy. This paper demonstrates that neural-network acoustic models can be trained with sequence classification criteria using exactly the same lattice-based methods that have been developed for Gaussian mixture HMMs, and that using a sequence classification criterion in training leads to considerably better performance. A neural network acoustic model with 153K weights trained on 50 hours of broadcast news has a word error rate of 34.0% on the rt04 English broadcast news test set. When this model is trained with the state-level minimum Bayes risk criterion, the rt04 word error rate is 27.7%.
Keywords :
hidden Markov models; neural nets; speech recognition; Gaussian mixture; hidden Markov model; lattice-based optimization; neural-network acoustic modeling; sequence classification criteria; word error rate; Acoustic testing; Backpropagation algorithms; Broadcasting; Error analysis; Hidden Markov models; Lattices; Maximum likelihood estimation; Mutual information; Neural networks; Speech recognition; discriminative training; neural networks; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960445
Filename :
4960445
Link To Document :
بازگشت