Title :
Design and Implementation of Parallel FFT on CUDA
Author :
Xueqin Zhang ; Kai Shen ; Chengguang Xu ; Kaifang Wang
Author_Institution :
Sch. of Electron. Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
Fast Fourier Transform (FFT) algorithm has an important role in the image processing and scientific computing, and it´s a highly parallel divide-and-conquer algorithm. In this paper, we exploited the Compute Unified Device Architecture CUDA technology and contemporary graphics processing units (GPUs) to achieve higher performance. We focused on two aspects to optimize the ordinary FFT algorithm, multi-threaded parallelism and memory hierarchy. We also proposed parallelism optimization strategies when the data volume occurs and predicted the possible situation when the amount of data increased further.it can be seen from the results that Parallel FFT algorithm is more efficient than the ordinary FFT algorithm.
Keywords :
fast Fourier transforms; graphics processing units; optimisation; parallel algorithms; parallel architectures; CUDA technology; Compute Unified Device Architecture; GPUs; fast Fourier transform; graphics processing units; parallel FFT algorithm; parallelism optimization; Algorithm design and analysis; Discrete Fourier transforms; Educational institutions; Graphics processing units; Instruction sets; Optimization; Parallel processing; CUDA technology; memory hierarchy; thread/thread block;
Conference_Titel :
Dependable, Autonomic and Secure Computing (DASC), 2013 IEEE 11th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4799-3380-8
DOI :
10.1109/DASC.2013.130