DocumentCode :
1686416
Title :
Revisiting hybrid and GMM-HMM system combination techniques
Author :
Swietojanski, Pawel ; Ghoshal, Arnab ; Renals, Steve
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fYear :
2013
Firstpage :
6744
Lastpage :
6748
Abstract :
In this paper we investigate techniques to combine hybrid HMM-DNN (hidden Markov model - deep neural network) and tandem HMM-GMM (hidden Markov model - Gaussian mixture model) acoustic models using: (1) model averaging, and (2) lattice combination with Minimum Bayes Risk decoding. We have performed experiments on the “TED Talks” task following the protocol of the IWSLT-2012 evaluation. Our experimental results suggest that DNN-based and GMM-based acoustic models are complementary, with error rates being reduced by up to 8% relative when the DNN and GMM systems are combined at model-level in a multi-pass automatic speech recognition (ASR) system. Additionally, further gains were obtained by combining model-averaged lattices with the one obtained from baseline systems.
Keywords :
Bayes methods; hidden Markov models; neural nets; speech recognition; GMM-HMM system combination techniques; Gaussian mixture model; IWSLT-2012 evaluation; TED Talks; acoustic models; baseline systems; deep neural network; error rates; hidden Markov model; hybrid system combination techniques; lattice combination; minimum Bayes risk decoding; model averaging; model-averaged lattices; multipass automatic speech recognition system; tandem HMM-GMM; Acoustics; Hidden Markov models; Lattices; Neural networks; Speech; Speech recognition; Training; TED; deep neural networks; hybrid; system combination; tandem;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638967
Filename :
6638967
Link To Document :
بازگشت