DocumentCode
2482604
Title
A cross-input adaptive framework for GPU program optimizations
Author
Liu, Yixun ; Zhang, Eddy Z. ; Shen, Xipeng
Author_Institution
Comput. Sci. Dept., Coll. of William & Mary, Williamsburg, VA, USA
fYear
2009
fDate
23-29 May 2009
Firstpage
1
Lastpage
10
Abstract
Recent years have seen a trend in using graphic processing units (GPU) as accelerators for general-purpose computing. The inexpensive, single-chip, massively parallel architecture of GPU has evidentially brought factors of speedup to many numerical applications. However, the development of a high-quality GPU application is challenging, due to the large optimization space and complex unpredictable effects of optimizations on GPU program performance. Recently, several studies have attempted to use empirical search to help the optimization. Although those studies have shown promising results, one important factor-program inputs-in the optimization has remained unexplored. In this work, we initiate the exploration in this new dimension. By conducting a series of measurement, we find that the ability to adapt to program inputs is important for some applications to achieve their best performance on GPU. In light of the findings, we develop an input-adaptive optimization framework, namely G-ADAPT, to address the influence by constructing cross-input predictive models for automatically predicting the (near-)optimal configurations for an arbitrary input to a GPU program. The results demonstrate the promise of the framework in serving as a tool to alleviate the productivity bottleneck in GPU programming.
Keywords
optimising compilers; parallel architectures; parallel programming; G-ADAPT; GPU program optimizations; GPU program performance; GPU programming; cross-input adaptive framework; cross-input predictive models; general-purpose computing; graphic processing units; input-adaptive optimization framework; parallel architecture; productivity bottleneck; single-chip architecture; Application software; Computer architecture; Computer graphics; Computer science; Educational institutions; Parallel architectures; Predictive models; Productivity; Program processors; Yarn; CUDA; Cross-input Adaptation; Empirical Search; G-ADAPT; GPU; Program Optimizations;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location
Rome
ISSN
1530-2075
Print_ISBN
978-1-4244-3751-1
Electronic_ISBN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2009.5160988
Filename
5160988
Link To Document