• DocumentCode
    1830617
  • Title

    Improve GPGPU Latency Hiding with a Hybrid Recovery Stack and a Window Based Warp Scheduling Policy

  • Author

    Chen, Tianzhou ; Tang, Xingsheng ; Yu, Licheng ; Ma, Jianliang ; Wu, Minghui

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
  • fYear
    2012
  • fDate
    25-27 June 2012
  • Firstpage
    1288
  • Lastpage
    1293
  • Abstract
    Branch divergence phenomenon usually has very serious impact on SIMD pipeline´s efficiency. However Dynamic Warp Subdivision´s branch method utilizes the branch divergence phenomenon to hide memory latency by interleaving issue among all branch paths of a warp. But this method may experience serious over-subdivision problem. So, we propose a hybrid stack mechanism that enables the PDOM stack can issue any ready sub-warps without losing the logical structure of PDOM stack. To maximize our hybrid stack´s potential we propose a window based scheduling policy to reinforce the memory latency hiding. The experiment result shows that our window based scheduling policy and the hybrid stack hardware´s combination can improve the performance by 10% compared with the baseline configuration with PDOM loose round-robin method and 6.8% over DWS-PC with our window based scheduling policy in our selected 7 benchmark programs.
  • Keywords
    data encapsulation; graphics processing units; parallel architectures; pipeline processing; processor scheduling; tree searching; DWS-PC; GPGPU; PDOM stack; SIMD; baseline configuration; branch divergence phenomenon; dynamic warp subdivision branch method; hybrid recovery stack; logical structure; memory latency hiding; oversubdivision problem; pipeline efficiency; round-robin method; window based warp scheduling policy; Benchmark testing; Dynamic scheduling; Graphics processing unit; Hardware; Pipelines; Processor scheduling; Switches; Architecture; GPGPU; Performance; Scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
  • Conference_Location
    Liverpool
  • Print_ISBN
    978-1-4673-2164-8
  • Type

    conf

  • DOI
    10.1109/HPCC.2012.190
  • Filename
    6332326