DocumentCode :
3029218
Title :
An Empirically Optimized Radix Sort for GPU
Author :
Huang, Bonan ; Gao, Jinlan ; Li, Xiaoming
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Delaware, Newark, DE, USA
fYear :
2009
fDate :
10-12 Aug. 2009
Firstpage :
234
Lastpage :
241
Abstract :
In this paper, we propose an empirical optimization technique for one of the most important sorting routines on GPU, the radix sort, that generates highly efficient code for a number of representative NVIDIA GPUs with a wide variety of architectural specifications. Our study has been focused on the algorithmic parameters of radix sort that can be adapted to different environments and the GPU architectural factors that affect the performance of radix sort. We present a powerful empirical optimization approach that is shown to be able to find highly efficient code for different NVIDIA GPUs. Our results show that such an empirical optimization approach is quite effective at taking into account the complex interactions between architectural characteristics and that the resulting code performs significantly better than two radix sort implementations that have been shown outperforming other GPU sort routines with the maximal speedup of 33.4%.
Keywords :
computer graphic equipment; computer graphics; optimisation; sorting; GPU architectural factors; GPU sort routines; NVIDIA GPU; algorithmic parameters; empirical optimization technique; empirically optimized radix sort; radix sort implementations; sorting routines; Application software; Central Processing Unit; Concurrent computing; Distributed processing; Frequency; Graphics; High performance computing; Programming profession; Sorting; Tuning; GPU; performance tuning; radix sort;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing with Applications, 2009 IEEE International Symposium on
Conference_Location :
Chengdu
Print_ISBN :
978-0-7695-3747-4
Type :
conf
DOI :
10.1109/ISPA.2009.89
Filename :
5207930
Link To Document :
بازگشت