DocumentCode :
2686163
Title :
Faster FFTs via architecture-cognizance
Author :
Gatlin, Kang Su ; Carter, Larry
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
fYear :
2000
fDate :
2000
Firstpage :
249
Lastpage :
260
Abstract :
The Fast Fourier Transform (FFT) is one of the most important algorithms in computational science, accounting for large amounts of computing time. One major problem with modern FFT implementations is that they poorly scale to large problem. As the problem size increases, stride and associativity effects play a larger role. The result is a severe drop-off in performance. We use architecture-cognizance, a method for exploiting the interaction between architecture, compiler, and algorithm, to create a more scalable FFT package based on FFTW. Experiments validate our approach on four architectures: two generations of HPs (PA-8000 and 8500), an IBM POWER2, and a DEC Alpha 21164a. Performance increases of up to 65% are obtained
Keywords :
fast Fourier transforms; optimising compilers; FFTs; ILP; TLB; architecture-cognizance; associativity; cache; compiler optimization; divide-and-conquer; feedback; memory hierarchy; registers; runtime systems; Computer architecture; Computer science; Fast Fourier transforms; Feedback; Flexible printed circuits; Hafnium; Optimizing compilers; Packaging; Power generation; Registers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures and Compilation Techniques, 2000. Proceedings. International Conference on
Conference_Location :
Philadelphia, PA
ISSN :
1089-795X
Print_ISBN :
0-7695-0622-4
Type :
conf
DOI :
10.1109/PACT.2000.888349
Filename :
888349
Link To Document :
بازگشت