Title :
Faster FFTs via architecture-cognizance
Author :
Gatlin, Kang Su ; Carter, Larry
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
The Fast Fourier Transform (FFT) is one of the most important algorithms in computational science, accounting for large amounts of computing time. One major problem with modern FFT implementations is that they poorly scale to large problem. As the problem size increases, stride and associativity effects play a larger role. The result is a severe drop-off in performance. We use architecture-cognizance, a method for exploiting the interaction between architecture, compiler, and algorithm, to create a more scalable FFT package based on FFTW. Experiments validate our approach on four architectures: two generations of HPs (PA-8000 and 8500), an IBM POWER2, and a DEC Alpha 21164a. Performance increases of up to 65% are obtained
Keywords :
fast Fourier transforms; optimising compilers; FFTs; ILP; TLB; architecture-cognizance; associativity; cache; compiler optimization; divide-and-conquer; feedback; memory hierarchy; registers; runtime systems; Computer architecture; Computer science; Fast Fourier transforms; Feedback; Flexible printed circuits; Hafnium; Optimizing compilers; Packaging; Power generation; Registers;
Conference_Titel :
Parallel Architectures and Compilation Techniques, 2000. Proceedings. International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7695-0622-4
DOI :
10.1109/PACT.2000.888349