• DocumentCode
    2481739
  • Title

    Dynamic warp resizing: Analysis and benefits in high-performance SIMT

  • Author

    Lashgar, Ahmad ; Baniasadi, Amirali ; Khonsari, Ahmad

  • fYear
    2012
  • fDate
    Sept. 30 2012-Oct. 3 2012
  • Firstpage
    502
  • Lastpage
    503
  • Abstract
    Modern GPUs synchronize threads grouped in warps. The number of threads included in each warp (or warp size) affects divergence, synchronization overhead, and the efficiency of memory access coalescing. Small warps reduce the performance penalty associated with branch and memory divergence at the expense of a reduction in memory coalescing. Large warps enhance memory coalescing significantly but also increase branch and memory divergence. Dynamic workload behavior, including branch/memory divergence and coalescing, is an important factor in determining the warp size returning best performance. Based on this observation, we propose Dynamic Warp Resizing (DWR). DWR outperforms static warp size decisions, up to 2.28X.
  • Keywords
    graphics processing units; synchronisation; GPU; dynamic warp resizing; high-performance SIMT; synchronization; Benchmark testing; Computer architecture; Educational institutions; Graphics processing units; Hardware; Instruction sets; Synchronization; Branch divergence; GPU architecture; Memory access coalescing; Performance; Warp size;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Design (ICCD), 2012 IEEE 30th International Conference on
  • Conference_Location
    Montreal, QC
  • ISSN
    1063-6404
  • Print_ISBN
    978-1-4673-3051-0
  • Type

    conf

  • DOI
    10.1109/ICCD.2012.6378694
  • Filename
    6378694