Title :
Sparse Fast Fourier Transform on GPUs and Multi-core CPUs
Author :
Hu, Jiaxi ; Wang, Zhaosen ; Qiu, Qiyuan ; Xiao, Weijun ; Lilja, David J.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Minnesota, Minneapolis, MN, USA
Abstract :
Given an N-point sequence, finding its k largest components in the frequency domain is a problem of great interest. This problem, which is usually referred to as a sparse Fourier Transform, was recently brought back on stage by a newly proposed algorithm called the sFFT. In this paper, we present a parallel implementation of sFFT on both multi-core CPUs and GPUs using a human voice signal as a case study. Using this example, an estimate of k for the 3dB cutoff points was conducted through concrete experiments. In addition, three optimization strategies are presented in this paper. We demonstrate that the multi-core-based sFFT achieves speedups of up to three times a single-threaded sFFT while a GPU-based version achieves up to ten times speedup. For large scale cases, the GPU-based sFFT also shows its considerable advantages, which is about 40 times speedup compared to the latest out-of-card FFT implementations [2].
Keywords :
fast Fourier transforms; frequency-domain analysis; graphics processing units; multiprocessing systems; speech processing; GPU; N-point sequence; frequency domain; human voice signal; multicore CPU; optimization strategies; single-threaded sFFT; sparse fast Fourier transform; Estimation; Frequency domain analysis; Graphics processing units; Human voice; Instruction sets; Kernel; Libraries; GPUs; Multi-core CPUs; Sparse FFT; performance speedup;
Conference_Titel :
Computer Architecture and High Performance Computing (SBAC-PAD), 2012 IEEE 24th International Symposium on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4673-4790-7
DOI :
10.1109/SBAC-PAD.2012.34