• DocumentCode
    179586
  • Title

    Joint training of convolutional and non-convolutional neural networks

  • Author

    Soltau, Hagen ; Saon, George ; Sainath, Tara N.

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    5572
  • Lastpage
    5576
  • Abstract
    We describe a simple modification of neural networks which consists in extending the commonly used linear layer structure to an arbitrary graph structure. This allows us to combine the benefits of convolutional neural networks with the benefits of regular networks. The joint model has only a small increase in parameter size and training and decoding time are virtually unaffected. We report significant improvements over very strong baselines on two LVCSR tasks and one speech activity detection task.
  • Keywords
    convolutional codes; decoding; neural nets; LVCSR tasks; arbitrary graph structure; convolutional neural networks; decoding time; joint training; linear layer structure; nonconvolutional neural networks; regular networks; speech activity detection task; Acoustics; Error analysis; Hidden Markov models; Joints; Neural networks; Speech; Training; Acoustic Modeling; CNN; MLP; Neural Networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854669
  • Filename
    6854669