• DocumentCode
    3426359
  • Title

    FFT Program Generation for Shared Memory: SMP and Multicore

  • Author

    Franchetti, Franz ; Voronenko, Yevgen ; Puschel, Markus

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA
  • fYear
    2006
  • fDate
    Nov. 2006
  • Firstpage
    51
  • Lastpage
    51
  • Abstract
    The chip maker´s response to the approaching end of CPU frequency scaling are multicore systems, which offer the same programming paradigm as traditional shared memory platforms but have different performance characteristics. This situation considerably increases the burden on library developers and strengthens the case for automatic performance tuning frameworks like Spiral, a program generator and optimizer for linear transforms such as the discrete Fourier transform (DFT). We present a shared memory extension of Spiral. The extension within Spiral consists of a rewriting system that manipulates the structure of transform algorithms to achieve load balancing and avoids false sharing, and of a backend to generate multithreaded code. Application to the DFT produces a novel class of algorithms suitable for multicore systems as validated by experimental results: we demonstrate a parallelization speed-up already for sizes that fit into L1 cache and compare favorably to other DFT libraries across all small and midsize DFTs and considered platforms
  • Keywords
    automatic programming; discrete Fourier transforms; mathematics computing; microprocessor chips; parallelising compilers; resource allocation; rewriting systems; shared memory systems; DFT; FFT program generation; SMP; Spiral program generator; automatic performance tuning frameworks; chip multiprocessor; discrete Fourier transform; linear transforms; load balancing; multicore systems; multithreaded code generation; rewriting system; shared memory system; Automatic programming; Discrete Fourier transforms; Discrete transforms; Fourier transforms; Frequency; Libraries; Load management; Multicore processing; Spirals; Tuning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SC 2006 Conference, Proceedings of the ACM/IEEE
  • Conference_Location
    Tampa, FL
  • Print_ISBN
    0-7695-2700-0
  • Electronic_ISBN
    0-7695-2700-0
  • Type

    conf

  • DOI
    10.1109/SC.2006.31
  • Filename
    4090225