• DocumentCode
    2174592
  • Title

    Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR

  • Author

    Vinyals, Oriol ; Ravuri, Suman V.

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4596
  • Lastpage
    4599
  • Abstract
    In this paper, we extend the work done on integrating multilayer perceptron (MLP) networks with HMM systems via the Tandem approach. In particular, we explore whether the use of Deep Belief Networks (DBN) adds any substantial gain over MLPs on the Aurora2 speech recognition task under mismatched noise conditions. Our findings suggest that DBNs outperform single layer MLPs under the clean condition, but the gains diminish as the noise level is increased. Furthermore, using MFCCs in conjunction with the posteriors from DBNs outperforms merely using single DBNs in low to moderate noise conditions. MFCCs, however, do not help for the high noise settings.
  • Keywords
    belief networks; hidden Markov models; multilayer perceptrons; speech recognition; DBN; HMM system; MLP network; belief network tandem feature; deep belief network; mismatched noise conditions; multilayer perceptron; robust ASR; speech recognition; Accuracy; Mel frequency cepstral coefficient; Noise measurement; Signal to noise ratio; Speech recognition; Training; Automatic Speech Recognition; Deep Belief Network; Multilayer Perceptron;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947378
  • Filename
    5947378