• DocumentCode
    3152689
  • Title

    Scalable stacking and learning for building deep architectures

  • Author

    Deng, Li ; Yu, Dong ; Platt, John

  • Author_Institution
    Microsoft Res., Redmond, WA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    2133
  • Lastpage
    2136
  • Abstract
    Deep Neural Networks (DNNs) have shown remarkable success in pattern recognition tasks. However, parallelizing DNN training across computers has been difficult. We present the Deep Stacking Network (DSN), which overcomes the problem of parallelizing learning algorithms for deep architectures. The DSN provides a method of stacking simple processing modules in buiding deep architectures, with a convex learning problem in each module. Additional fine tuning further improves the DSN, while introducing minor non-convexity. Full learning in the DSN is batch-mode, making it amenable to parallel training over many machines and thus be scalable over the potentially huge size of the training data. Experimental results on both the MNIST (image) and TIMIT (speech) classification tasks demonstrate that the DSN learning algorithm developed in this work is not only parallelizable in implementation but it also attains higher classification accuracy than the DNN.
  • Keywords
    convex programming; learning (artificial intelligence); neural net architecture; pattern classification; MNIST classification task; TIMIT classification task; classification accuracy; convex learning problem; deep architecture; deep neural network training parallelization; deep stacking network learning algorithm; image classification task; learning algorithm parallelization; pattern recognition task; scalable stacking; speech classification task; Computer architecture; Error analysis; Speech; Stacking; Training; Tuning; Vectors; DNN; DSN; convexity; deep learning; stacking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288333
  • Filename
    6288333