DocumentCode
2686163
Title
Faster FFTs via architecture-cognizance
Author
Gatlin, Kang Su ; Carter, Larry
Author_Institution
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
fYear
2000
fDate
2000
Firstpage
249
Lastpage
260
Abstract
The Fast Fourier Transform (FFT) is one of the most important algorithms in computational science, accounting for large amounts of computing time. One major problem with modern FFT implementations is that they poorly scale to large problem. As the problem size increases, stride and associativity effects play a larger role. The result is a severe drop-off in performance. We use architecture-cognizance, a method for exploiting the interaction between architecture, compiler, and algorithm, to create a more scalable FFT package based on FFTW. Experiments validate our approach on four architectures: two generations of HPs (PA-8000 and 8500), an IBM POWER2, and a DEC Alpha 21164a. Performance increases of up to 65% are obtained
Keywords
fast Fourier transforms; optimising compilers; FFTs; ILP; TLB; architecture-cognizance; associativity; cache; compiler optimization; divide-and-conquer; feedback; memory hierarchy; registers; runtime systems; Computer architecture; Computer science; Fast Fourier transforms; Feedback; Flexible printed circuits; Hafnium; Optimizing compilers; Packaging; Power generation; Registers;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architectures and Compilation Techniques, 2000. Proceedings. International Conference on
Conference_Location
Philadelphia, PA
ISSN
1089-795X
Print_ISBN
0-7695-0622-4
Type
conf
DOI
10.1109/PACT.2000.888349
Filename
888349
Link To Document