DocumentCode :
1960942
Title :
ELMO: A User-Friendly API to Enable Local Memory in OpenCL Kernels
Author :
Jianbin Fang ; Varbanescu, Ana Lucia ; Jie Shen ; Sips, Henk
Author_Institution :
Parallel & Distrib. Syst. Group, Delft Univ. of Technol., Delft, Netherlands
fYear :
2013
fDate :
Feb. 27 2013-March 1 2013
Firstpage :
375
Lastpage :
383
Abstract :
Recent parallel architectures are equipped with local memory, which simplifies hardware design at the cost of increased program complexity due to explicit management. To simplify this extra-burden that programmers have, we introduce an easy-to-use API, ELMO, that improves productivity while preserving high performance of local memory operations. Specifically, ELMO is a generic API that covers different local memory use-cases. We also present prototype implementations for these APIs and perform multiple GPU-inspired optimizations to maximize their performance. Experimental results on the NVIDIA Quadro5000 GPU show that performance is significantly improved by using ELMO on native implementations: the achieved speedup ranges from 1.3x to 3.7x. Furthermore, using ELMO we still achieve performance comparable (if not better) with that of hand-tuned applications, while the code is shorter, clearer, and safer.
Keywords :
application program interfaces; graphics processing units; multiprocessing systems; operating system kernels; parallel architectures; performance evaluation; storage management; ELMO; NVIDIA Quadro5000 GPU; OpenCL kernels; explicit management; generic API; local memory operations; local memory use-cases; manycore processors; multicore processors; multiple GPU-inspired optimizations; parallel architectures; performance maximization; program complexity; Bandwidth; Geometry; Indexes; Kernel; Memory management; Optimization; Registers; API; GPUs; Local Memory; OpenCL;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2013 21st Euromicro International Conference on
Conference_Location :
Belfast
ISSN :
1066-6192
Print_ISBN :
978-1-4673-5321-2
Electronic_ISBN :
1066-6192
Type :
conf
DOI :
10.1109/PDP.2013.61
Filename :
6498578
Link To Document :
بازگشت