Title :
Optimized GPU Sorting Algorithms on Special Input Distributions
Author :
Yang, Quan ; Du, Zhihui ; Zhang, Sen
Author_Institution :
Tsinghua Nat. Lab. for Inf. Sci. & Technol., Beijing, China
Abstract :
We present a high performance graphics processing unit (GPU) sorting algorithm ISSD (Improved Sorting considering Special Distributions) implemented with the Compute Unified Device Architecture (CUDA). The ISSD focuses on two aspects to improve parallel sorting performance. One is how to decompose the sorting tasks into independent and balanced subtasks which can then be easily distributed to thousands of threads to realize the concept of “parallel sorting” as well as to efficiently explore the power of GPU. The other one is how to take advantage of special data distributions to further optimize the algorithms and improve their performance. The algorithm is redesigned based on our previous general data distribution version and optimized both on general implementation methods and special input distributions. Experimental results show that for the general data distribution inputs, the ISSD outperforms the existing parallel sorting algorithms by about 10% in performance due to its practical optimization in implementation; and for three special data distribution inputs, the ISSD outperforms the existing algorithms by more than 40% due to its special optimization based on the data distributions. Therefore, the algorithm is viable and efficient when dealing with specific data distributions.
Keywords :
graphics processing units; parallel algorithms; parallel architectures; sorting; CUDA; GPU sorting algorithm; ISSD algorithm; compute unified device architecture; data distribution; general implementation method; graphics processing unit; improved sorting considering special distributions; parallel sorting concept; parallel sorting performance; sorting task; special input distribution; Algorithm design and analysis; Arrays; Distributed databases; Graphics processing units; Instruction sets; Optimization; Sorting; GPU; parallel sorting algorithm; special data distribution;
Conference_Titel :
Distributed Computing and Applications to Business, Engineering & Science (DCABES), 2012 11th International Symposium on
Conference_Location :
Guilin
Print_ISBN :
978-1-4673-2630-8
DOI :
10.1109/DCABES.2012.57