DocumentCode :
3704259
Title :
The Loop-of-Stencil-Reduce Paradigm
Author :
Marco Aldinucci;Marco Danelutto;Maurizio Drocco;Peter Kilpatrick;Guilherme Peretti Pezzi;Massimo Torquati
Author_Institution :
Comput. Sci. Dept., Univ. of Turin, Turin, Italy
Volume :
3
fYear :
2015
Firstpage :
172
Lastpage :
177
Abstract :
In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel programming of heterogeneous platforms (multicore+GPUs). Loop-of-Stencil-reduce is general enough to subsume map, reduce, map-reduce, stencil, stencil-reduce, and, crucially, their usage in a loop. It transparently targets (by using OpenCL) combinations of CPU cores and GPUs, and it makes it possible to simplify the deployment of a single stencil computation kernel on different GPUs. The paper discusses the implementation of Loop-of-stencil-reduce within the FastFlow parallel framework, considering a simple iterative data-parallel application as running example (Game of Life) and a highly effective parallel filter for visual data restoration to assess performance. Thanks to the high-level design of the Loop-of-stencil-reduce, it was possible to run the filter seamlessly on a multicore machine, on multi-GPUs, and on both.
Keywords :
"Graphics processing units","Kernel","Skeleton","Games","Parallel programming"
Publisher :
ieee
Conference_Titel :
Trustcom/BigDataSE/ISPA, 2015 IEEE
Type :
conf
DOI :
10.1109/Trustcom.2015.628
Filename :
7345644
Link To Document :
بازگشت