• DocumentCode
    2791278
  • Title

    Parallel implementation of artificial neural network training

  • Author

    Scanzio, Stefano ; Cumani, Sandro ; Gemello, Roberto ; Mana, Franco ; Laface, P.

  • Author_Institution
    Politec. di Torino, Turin, Italy
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4902
  • Lastpage
    4905
  • Abstract
    In this paper we describe the implementation of a complete ANN training procedure for speech recognition using the block mode back-propagation learning algorithm. We exploit the high performance SIMD architecture of GPU using CUDA and its C-like language interface. We also compare the speed-up obtained implementing the training procedure only taking advantage of the multi-thread capabilities of multi-core processors. Our approach has been tested by training acoustic models for large vocabulary speech recognition tasks, showing a 6 times reduction of the time required to train real-world large size networks with respect to an already optimized implementation using the Intel MKL libraries.
  • Keywords
    learning (artificial intelligence); neural nets; speech recognition; CUDA; GPU; SIMD architecture; artificial neural network training; block mode back-propagation learning algorithm; speech recognition; Acoustic testing; Artificial neural networks; Feedforward systems; Hidden Markov models; Libraries; Matrix converters; Multicore processing; Speech recognition; State estimation; Vocabulary; Artificial Neural Network; CUDA; Fast Training; Focused Attention Back-Propagation; GPU;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495108
  • Filename
    5495108