DocumentCode
1954847
Title
A study of Persistent Threads style GPU programming for GPGPU workloads
Author
Gupta, Kunal ; Stuart, J.A. ; Owens, John D.
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, Davis, CA, USA
fYear
2012
fDate
13-14 May 2012
Firstpage
1
Lastpage
14
Abstract
In this paper, we characterize and analyze an increasingly popular style of programming for the GPU called Persistent Threads (PT). We present a concise formal definition for this programming style, and discuss the difference between the traditional GPU programming style (nonPT) and PT, why PT is attractive for some high-performance usage scenarios, and when using PT may or may not be appropriate. We identify limitations of the nonPT style and identify four primary use cases it could be useful in addressing-CPU-GPU synchronization, load balancing/irregular parallelism, producer-consumer locality, and global synchronization. Through micro-kernel benchmarks we show the PT approach can achieve up to an order-of-magnitude speedup over nonPT kernels, but can also result in performance loss in many cases. We conclude by discussing the hardware and software fundamentals that will influence the development of Persistent Threads as a programming style in future systems.
Keywords
graphics processing units; parallel processing; programming; resource allocation; CPU-GPU synchronization; GPGPU workloads; global synchronization; irregular parallelism; load balancing; micro-kernel benchmarks; nonPT style; persistent threads style GPU programming; producer-consumer locality; Abstracts; Argon; Computer aided software engineering; Computers; Graphics processing unit; Programming; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovative Parallel Computing (InPar), 2012
Conference_Location
San Jose, CA
Print_ISBN
978-1-4673-2632-2
Electronic_ISBN
978-1-4673-2631-5
Type
conf
DOI
10.1109/InPar.2012.6339596
Filename
6339596
Link To Document