• DocumentCode
    2686163
  • Title

    Faster FFTs via architecture-cognizance

  • Author

    Gatlin, Kang Su ; Carter, Larry

  • Author_Institution
    Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    249
  • Lastpage
    260
  • Abstract
    The Fast Fourier Transform (FFT) is one of the most important algorithms in computational science, accounting for large amounts of computing time. One major problem with modern FFT implementations is that they poorly scale to large problem. As the problem size increases, stride and associativity effects play a larger role. The result is a severe drop-off in performance. We use architecture-cognizance, a method for exploiting the interaction between architecture, compiler, and algorithm, to create a more scalable FFT package based on FFTW. Experiments validate our approach on four architectures: two generations of HPs (PA-8000 and 8500), an IBM POWER2, and a DEC Alpha 21164a. Performance increases of up to 65% are obtained
  • Keywords
    fast Fourier transforms; optimising compilers; FFTs; ILP; TLB; architecture-cognizance; associativity; cache; compiler optimization; divide-and-conquer; feedback; memory hierarchy; registers; runtime systems; Computer architecture; Computer science; Fast Fourier transforms; Feedback; Flexible printed circuits; Hafnium; Optimizing compilers; Packaging; Power generation; Registers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Architectures and Compilation Techniques, 2000. Proceedings. International Conference on
  • Conference_Location
    Philadelphia, PA
  • ISSN
    1089-795X
  • Print_ISBN
    0-7695-0622-4
  • Type

    conf

  • DOI
    10.1109/PACT.2000.888349
  • Filename
    888349