Title :
Parallel implementation of artificial neural network training
Author :
Scanzio, Stefano ; Cumani, Sandro ; Gemello, Roberto ; Mana, Franco ; Laface, P.
Author_Institution :
Politec. di Torino, Turin, Italy
Abstract :
In this paper we describe the implementation of a complete ANN training procedure for speech recognition using the block mode back-propagation learning algorithm. We exploit the high performance SIMD architecture of GPU using CUDA and its C-like language interface. We also compare the speed-up obtained implementing the training procedure only taking advantage of the multi-thread capabilities of multi-core processors. Our approach has been tested by training acoustic models for large vocabulary speech recognition tasks, showing a 6 times reduction of the time required to train real-world large size networks with respect to an already optimized implementation using the Intel MKL libraries.
Keywords :
learning (artificial intelligence); neural nets; speech recognition; CUDA; GPU; SIMD architecture; artificial neural network training; block mode back-propagation learning algorithm; speech recognition; Acoustic testing; Artificial neural networks; Feedforward systems; Hidden Markov models; Libraries; Matrix converters; Multicore processing; Speech recognition; State estimation; Vocabulary; Artificial Neural Network; CUDA; Fast Training; Focused Attention Back-Propagation; GPU;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495108