DocumentCode :
2958030
Title :
Heterogeneous Task Scheduling for Accelerated OpenMP
Author :
Scogland, Thomas R W ; Rountree, Barry ; Feng, Wu-chun ; De Supinski, Bronis R.
Author_Institution :
Dept. of Comput. Sci., Virginia Tech, Blacksburg, VA, USA
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
144
Lastpage :
155
Abstract :
Heterogeneous systems with CPUs and computational accelerators such as GPUs, FPGAs or the upcoming Intel MIC are becoming mainstream. In these systems, peak performance includes the performance of not just the CPUs but also all available accelerators. In spite of this fact, the majority of programming models for heterogeneous computing focus on only one of these. With the development of Accelerated Open MP for GPUs, both from PGI and Cray, we have a clear path to extend traditional Open MP applications incrementally to use GPUs. The extensions are geared toward switching from CPU parallelism to GPU parallelism. However they do not preserve the former while adding the latter. Thus computational potential is wasted since either the CPU cores or the GPU cores are left idle. Our goal is to create a runtime system that can intelligently divide an accelerated Open MP region across all available resources automatically. This paper presents our proof-of-concept runtime system for dynamic task scheduling across CPUs and GPUs. Further, we motivate the addition of this system into the proposed Open MP for Accelerators standard. Finally, we show that this option can produce as much as a two-fold performance improvement over using either the CPU or GPU alone.
Keywords :
graphics processing units; multiprocessing systems; performance evaluation; scheduling; CPU; Cray; FPGA; GPU; Intel MIC; PGI; accelerated Open MP; computational accelerators; heterogeneous computing; heterogeneous task scheduling; Acceleration; Dynamic scheduling; Graphics processing unit; Integrated circuits; Programming; Runtime; Schedules; GPGPU; OpenMP; Programming models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
Conference_Location :
Shanghai
ISSN :
1530-2075
Print_ISBN :
978-1-4673-0975-2
Type :
conf
DOI :
10.1109/IPDPS.2012.23
Filename :
6267831
Link To Document :
بازگشت