DocumentCode :
1954847
Title :
A study of Persistent Threads style GPU programming for GPGPU workloads
Author :
Gupta, Kunal ; Stuart, J.A. ; Owens, John D.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, Davis, CA, USA
fYear :
2012
fDate :
13-14 May 2012
Firstpage :
1
Lastpage :
14
Abstract :
In this paper, we characterize and analyze an increasingly popular style of programming for the GPU called Persistent Threads (PT). We present a concise formal definition for this programming style, and discuss the difference between the traditional GPU programming style (nonPT) and PT, why PT is attractive for some high-performance usage scenarios, and when using PT may or may not be appropriate. We identify limitations of the nonPT style and identify four primary use cases it could be useful in addressing-CPU-GPU synchronization, load balancing/irregular parallelism, producer-consumer locality, and global synchronization. Through micro-kernel benchmarks we show the PT approach can achieve up to an order-of-magnitude speedup over nonPT kernels, but can also result in performance loss in many cases. We conclude by discussing the hardware and software fundamentals that will influence the development of Persistent Threads as a programming style in future systems.
Keywords :
graphics processing units; parallel processing; programming; resource allocation; CPU-GPU synchronization; GPGPU workloads; global synchronization; irregular parallelism; load balancing; micro-kernel benchmarks; nonPT style; persistent threads style GPU programming; producer-consumer locality; Abstracts; Argon; Computer aided software engineering; Computers; Graphics processing unit; Programming; Switches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Parallel Computing (InPar), 2012
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4673-2632-2
Electronic_ISBN :
978-1-4673-2631-5
Type :
conf
DOI :
10.1109/InPar.2012.6339596
Filename :
6339596
Link To Document :
بازگشت