Title :
Architecture independent short vector FFTs
Author :
Franchetti, Franz ; Karner, Herbert ; Kral, Stefan ; Ueberhuber, Christoph K.
Author_Institution :
Dept. of Appl. & Numerical Math., Tech. Univ. of Vienna, Austria
Abstract :
This paper introduces an SIMD vectorization for FFTW-the "fastest Fourier transform in the west" proposed by Frigo and Johnson (see Proceedings of the ACM SIGPLAN \´99 , p.169-180, 1999). The new method leads to an architecture independent short vector SIMD FFT vectorization that utilizes the architecture adaptivity of FFTW. It is based on special FFT kernels (up to size 64 and more) that are utilized by FFTW to compute the whole transform. This vectorization supports all features of complex transforms in FFTW (arbitrary size, dimension and stride of the data vector; in-place and out-of-place transforms) and is fully transparent to the user. It is suitable for arbitrary vector sizes of the underlying hardware
Keywords :
fast Fourier transforms; microprocessor chips; parallel architectures; FFT kernels; SIMD FFT vectorization; architecture independent short vector FFT; complex transforms; data vector; fastest Fourier transform; general purpose microprocessors; in-place transforms; out-of-place transforms; vector size; Computer architecture; Flexible printed circuits; Fourier transforms; Instruction sets; Kernel; Mathematics; Microprocessors; Parallel processing; Registers; Runtime;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941115