• DocumentCode
    3161842
  • Title

    Revisiting Recurrent Neural Networks for robust ASR

  • Author

    Vinyals, Oriol ; Ravuri, Suman V. ; Povey, Daniel

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4085
  • Lastpage
    4088
  • Abstract
    In this paper, we show how new training principles and optimization techniques for neural networks can be used for different network structures. In particular, we revisit the Recurrent Neural Network (RNN), which explicitly models the Markovian dynamics of a set of observations through a non-linear function with a much larger hidden state space than traditional sequence models such as an HMM. We apply pretraining principles used for Deep Neural Networks (DNNs) and second-order optimization techniques to train an RNN. Moreover, we explore its application in the Aurora2 speech recognition task under mismatched noise conditions using a Tandem approach. We observe top performance on clean speech, and under high noise conditions, compared to multi-layer perceptrons (MLPs) and DNNs, with the added benefit of being a “deeper” model than an MLP but more compact than a DNN.
  • Keywords
    hidden Markov models; neural nets; optimisation; perceptrons; speech recognition; Aurora2 speech recognition task; DNN; HMM; MLP; Markovian dynamics; RNN; clean speech performance; deep neural networks; hidden state space; multilayer perceptrons; network structures; nonlinear function; recurrent neural networks; robust ASR; second-order optimization techniques; sequence models; tandem approach; training principles; Context; Hidden Markov models; Noise; Recurrent neural networks; Speech; Speech recognition; Training; Automatic Speech Recognition; Deep Learning; Recurrent Neural Networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288816
  • Filename
    6288816