• DocumentCode
    625609
  • Title

    Optimizing and Auto-Tuning Iterative Stencil Loops for GPUs with the In-Plane Method

  • Author

    Wai Teng Tang ; Wen Jun Tan ; Krishnamoorthy, R. ; Yi Wen Wong ; Shyh-hao Kuo ; Goh, Rick Siow Mong ; Turner, Stephen John ; Weng-Fai Wong

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    452
  • Lastpage
    462
  • Abstract
    Stencils represent an important class of computations that are used in many scientific disciplines. Increasingly, many of the stencil computations in scientific applications are being offloaded to GPUs to improve running times. Since a large part of the simulation time is spent inside the stencil kernels, optimizing the kernel is therefore important in the context of achieving greater computation efficiencies and reducing simulation time. In this work, we proposed a novel in-plane method for stencil computations on GPUs and compared its performance with the conventional method implemented in the Nvidia SDK. We also implemented an auto-tuning framework for our method to select the optimal parameters for different GPU architectures. A performance model was developed for our proposed method, and is used to speed up the auto-tuning process. Our results show that a speedup of nearly 2× can be achieved compared to Nvidia´s implementation.
  • Keywords
    graphics processing units; natural sciences computing; optimisation; parallel architectures; GPU architectures; autotuning process framework; in-plane method; iterative stencil loop autotuning; optimal parameters; scientific applications; stencil computations; stencil kernel optimization; Bandwidth; Computer architecture; Graphics processing units; Instruction sets; Kernel; Loading; Registers; GPU; auto-tuning; stencil;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.79
  • Filename
    6569833