DocumentCode
1919350
Title
A CUDA-MPI Hybrid Bitonic Sorting Algorithm for GPU Clusters
Author
White, Sam ; Verosky, Niels ; Newhall, Tia
Author_Institution
Comput. Sci. Dept., Swarthmore Coll., Swarthmore, PA, USA
fYear
2012
fDate
10-13 Sept. 2012
Firstpage
588
Lastpage
589
Abstract
We present a hybrid CUDA-MPI sorting algorithm that makes use of GPU clusters to sort large data sets. Our algorithm has two phases. In the first phase each node sorts a portion of the data on its GPU using a parallel bitonic sort. In the second phase the sorted subsequences are merged together in parallel using a reduction sorting network implemented in MPI across the cluster nodes. Performance results comparing our sorting algorithm to sequential quick sort yield speed-up values of up to 9.8 for sorting 4GB of data on a 32 node GPU cluster. We anticipate even better speed-up values using our algorithm on larger data sets and larger sized clusters.
Keywords
application program interfaces; data reduction; graphics processing units; parallel architectures; pattern clustering; sorting; CUDA; GPU cluster; MPI; cluster node; data set; hybrid bitonic sorting algorithm; parallel bitonic sort; reduction sorting network; Algorithm design and analysis; Clustering algorithms; Graphics processing unit; Parallel processing; Random access memory; Runtime; Sorting; GPU clusters; hybrid CUDA-MPI; parallel sorting algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Workshops (ICPPW), 2012 41st International Conference on
Conference_Location
Pittsburgh, PA
ISSN
1530-2016
Print_ISBN
978-1-4673-2509-7
Type
conf
DOI
10.1109/ICPPW.2012.82
Filename
6337530
Link To Document