• DocumentCode
    1954847
  • Title

    A study of Persistent Threads style GPU programming for GPGPU workloads

  • Author

    Gupta, Kunal ; Stuart, J.A. ; Owens, John D.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of California, Davis, Davis, CA, USA
  • fYear
    2012
  • fDate
    13-14 May 2012
  • Firstpage
    1
  • Lastpage
    14
  • Abstract
    In this paper, we characterize and analyze an increasingly popular style of programming for the GPU called Persistent Threads (PT). We present a concise formal definition for this programming style, and discuss the difference between the traditional GPU programming style (nonPT) and PT, why PT is attractive for some high-performance usage scenarios, and when using PT may or may not be appropriate. We identify limitations of the nonPT style and identify four primary use cases it could be useful in addressing-CPU-GPU synchronization, load balancing/irregular parallelism, producer-consumer locality, and global synchronization. Through micro-kernel benchmarks we show the PT approach can achieve up to an order-of-magnitude speedup over nonPT kernels, but can also result in performance loss in many cases. We conclude by discussing the hardware and software fundamentals that will influence the development of Persistent Threads as a programming style in future systems.
  • Keywords
    graphics processing units; parallel processing; programming; resource allocation; CPU-GPU synchronization; GPGPU workloads; global synchronization; irregular parallelism; load balancing; micro-kernel benchmarks; nonPT style; persistent threads style GPU programming; producer-consumer locality; Abstracts; Argon; Computer aided software engineering; Computers; Graphics processing unit; Programming; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Parallel Computing (InPar), 2012
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4673-2632-2
  • Electronic_ISBN
    978-1-4673-2631-5
  • Type

    conf

  • DOI
    10.1109/InPar.2012.6339596
  • Filename
    6339596