Title :
A cross-input adaptive framework for GPU program optimizations
Author :
Liu, Yixun ; Zhang, Eddy Z. ; Shen, Xipeng
Author_Institution :
Comput. Sci. Dept., Coll. of William & Mary, Williamsburg, VA, USA
Abstract :
Recent years have seen a trend in using graphic processing units (GPU) as accelerators for general-purpose computing. The inexpensive, single-chip, massively parallel architecture of GPU has evidentially brought factors of speedup to many numerical applications. However, the development of a high-quality GPU application is challenging, due to the large optimization space and complex unpredictable effects of optimizations on GPU program performance. Recently, several studies have attempted to use empirical search to help the optimization. Although those studies have shown promising results, one important factor-program inputs-in the optimization has remained unexplored. In this work, we initiate the exploration in this new dimension. By conducting a series of measurement, we find that the ability to adapt to program inputs is important for some applications to achieve their best performance on GPU. In light of the findings, we develop an input-adaptive optimization framework, namely G-ADAPT, to address the influence by constructing cross-input predictive models for automatically predicting the (near-)optimal configurations for an arbitrary input to a GPU program. The results demonstrate the promise of the framework in serving as a tool to alleviate the productivity bottleneck in GPU programming.
Keywords :
optimising compilers; parallel architectures; parallel programming; G-ADAPT; GPU program optimizations; GPU program performance; GPU programming; cross-input adaptive framework; cross-input predictive models; general-purpose computing; graphic processing units; input-adaptive optimization framework; parallel architecture; productivity bottleneck; single-chip architecture; Application software; Computer architecture; Computer graphics; Computer science; Educational institutions; Parallel architectures; Predictive models; Productivity; Program processors; Yarn; CUDA; Cross-input Adaptation; Empirical Search; G-ADAPT; GPU; Program Optimizations;
Conference_Titel :
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location :
Rome
Print_ISBN :
978-1-4244-3751-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2009.5160988