• DocumentCode
    2449169
  • Title

    Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory

  • Author

    Wittmann, Markus ; Hager, Georg ; Wellein, Gerhard

  • Author_Institution
    Erlangen Regional Comput. Center, Univ. of Erlangen-Nuremberg, Erlangen, Germany
  • fYear
    2010
  • fDate
    19-23 April 2010
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    New algorithms and optimization techniques are needed to balance the accelerating trend towards bandwidth-starved multicore chips. It is well known that the performance of stencil codes can be improved by temporal blocking, lessening the pressure on the memory interface. We introduce a new pipelined approach that makes explicit use of shared caches in multicore environments and minimizes synchronization and boundary overhead. For clusters of shared-memory nodes we demonstrate how temporal blocking can be employed successfully in a hybrid shared/distributed-memory environment.
  • Keywords
    cache storage; codes; distributed memory systems; pipeline processing; shared memory systems; bandwidth-starved multicore chips; distributed memory; hybrid shared-distributed-memory environment; memory interface; multicore environments; multicore-aware parallel temporal blocking; optimization techniques; pipelined approach; shared caches; shared memory; stencil codes; Acceleration; Bandwidth; Concurrent computing; Distributed computing; Jacobian matrices; Lattices; Multicore processing; Sockets; Testing; Waste materials; multi-core; multi-halo exchange; shared caches; temporal blocking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4244-6533-0
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2010.5470813
  • Filename
    5470813