Title :
An Empirically Optimized Radix Sort for GPU
Author :
Huang, Bonan ; Gao, Jinlan ; Li, Xiaoming
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Delaware, Newark, DE, USA
Abstract :
In this paper, we propose an empirical optimization technique for one of the most important sorting routines on GPU, the radix sort, that generates highly efficient code for a number of representative NVIDIA GPUs with a wide variety of architectural specifications. Our study has been focused on the algorithmic parameters of radix sort that can be adapted to different environments and the GPU architectural factors that affect the performance of radix sort. We present a powerful empirical optimization approach that is shown to be able to find highly efficient code for different NVIDIA GPUs. Our results show that such an empirical optimization approach is quite effective at taking into account the complex interactions between architectural characteristics and that the resulting code performs significantly better than two radix sort implementations that have been shown outperforming other GPU sort routines with the maximal speedup of 33.4%.
Keywords :
computer graphic equipment; computer graphics; optimisation; sorting; GPU architectural factors; GPU sort routines; NVIDIA GPU; algorithmic parameters; empirical optimization technique; empirically optimized radix sort; radix sort implementations; sorting routines; Application software; Central Processing Unit; Concurrent computing; Distributed processing; Frequency; Graphics; High performance computing; Programming profession; Sorting; Tuning; GPU; performance tuning; radix sort;
Conference_Titel :
Parallel and Distributed Processing with Applications, 2009 IEEE International Symposium on
Conference_Location :
Chengdu
Print_ISBN :
978-0-7695-3747-4
DOI :
10.1109/ISPA.2009.89