Title :
Fast ICA on Modern GPU Architectures
Author :
Plauth, Max ; Feinbube, Frank ; Troger, Peter ; Polze, Andreas
Author_Institution :
Hasso Plattner Inst., Univ. of Potsdam, Potsdam, Germany
Abstract :
Blind Signal Separation is an algorithmic problem class that deals with the restoration of original signal data from a signal mixture. Implementations, such as Fast ICA, are optimized for parallelization on CPU or first-generation GPU hardware. With the advent of modern, compute centered GPU hardware with powerful features such as dynamic parallelism support, these solutions no longer leverage the available hardware performance in the best-possible way. We present an optimized implementation of the FastICA algorithm, which is specifically tailored for next-generation GPU architectures such as Nvidia Kepler. Our proposal achieves a two digit factor of speedup in the prototype implementation, compared to a multithreaded CPU implementation. Our custom matrix multiplication kernels, tailored specifically for the use case, contribute to the speedup by delivering better performance than the state-of-the-art CUBLAS library.
Keywords :
audio signal processing; blind source separation; graphics processing units; independent component analysis; parallel algorithms; parallel architectures; signal restoration; CPU parallelization; CUBLAS library; Nvidia Kepler; audio signal processing; blind signal separation; digit factor; fastICA algorithm; first-generation GPU hardware; matrix multiplication kernels; modern GPU architectures; multithreaded CPU; next-generation GPU architectures; signal data restoration; signal mixture; Computer architecture; Covariance matrices; Graphics processing units; Hardware; Kernel; Matrix decomposition; Signal processing algorithms; CUDA; GPU compute devices; audio signal processing; blind signal separation; independent component analysis; parallel processing;
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2014 15th International Conference on
DOI :
10.1109/PDCAT.2014.19