Title :
Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory
Author :
Wittmann, Markus ; Hager, Georg ; Wellein, Gerhard
Author_Institution :
Erlangen Regional Comput. Center, Univ. of Erlangen-Nuremberg, Erlangen, Germany
Abstract :
New algorithms and optimization techniques are needed to balance the accelerating trend towards bandwidth-starved multicore chips. It is well known that the performance of stencil codes can be improved by temporal blocking, lessening the pressure on the memory interface. We introduce a new pipelined approach that makes explicit use of shared caches in multicore environments and minimizes synchronization and boundary overhead. For clusters of shared-memory nodes we demonstrate how temporal blocking can be employed successfully in a hybrid shared/distributed-memory environment.
Keywords :
cache storage; codes; distributed memory systems; pipeline processing; shared memory systems; bandwidth-starved multicore chips; distributed memory; hybrid shared-distributed-memory environment; memory interface; multicore environments; multicore-aware parallel temporal blocking; optimization techniques; pipelined approach; shared caches; shared memory; stencil codes; Acceleration; Bandwidth; Concurrent computing; Distributed computing; Jacobian matrices; Lattices; Multicore processing; Sockets; Testing; Waste materials; multi-core; multi-halo exchange; shared caches; temporal blocking;
Conference_Titel :
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-6533-0
DOI :
10.1109/IPDPSW.2010.5470813