• DocumentCode
    2482604
  • Title

    A cross-input adaptive framework for GPU program optimizations

  • Author

    Liu, Yixun ; Zhang, Eddy Z. ; Shen, Xipeng

  • Author_Institution
    Comput. Sci. Dept., Coll. of William & Mary, Williamsburg, VA, USA
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Recent years have seen a trend in using graphic processing units (GPU) as accelerators for general-purpose computing. The inexpensive, single-chip, massively parallel architecture of GPU has evidentially brought factors of speedup to many numerical applications. However, the development of a high-quality GPU application is challenging, due to the large optimization space and complex unpredictable effects of optimizations on GPU program performance. Recently, several studies have attempted to use empirical search to help the optimization. Although those studies have shown promising results, one important factor-program inputs-in the optimization has remained unexplored. In this work, we initiate the exploration in this new dimension. By conducting a series of measurement, we find that the ability to adapt to program inputs is important for some applications to achieve their best performance on GPU. In light of the findings, we develop an input-adaptive optimization framework, namely G-ADAPT, to address the influence by constructing cross-input predictive models for automatically predicting the (near-)optimal configurations for an arbitrary input to a GPU program. The results demonstrate the promise of the framework in serving as a tool to alleviate the productivity bottleneck in GPU programming.
  • Keywords
    optimising compilers; parallel architectures; parallel programming; G-ADAPT; GPU program optimizations; GPU program performance; GPU programming; cross-input adaptive framework; cross-input predictive models; general-purpose computing; graphic processing units; input-adaptive optimization framework; parallel architecture; productivity bottleneck; single-chip architecture; Application software; Computer architecture; Computer graphics; Computer science; Educational institutions; Parallel architectures; Predictive models; Productivity; Program processors; Yarn; CUDA; Cross-input Adaptation; Empirical Search; G-ADAPT; GPU; Program Optimizations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5160988
  • Filename
    5160988