• DocumentCode
    1919350
  • Title

    A CUDA-MPI Hybrid Bitonic Sorting Algorithm for GPU Clusters

  • Author

    White, Sam ; Verosky, Niels ; Newhall, Tia

  • Author_Institution
    Comput. Sci. Dept., Swarthmore Coll., Swarthmore, PA, USA
  • fYear
    2012
  • fDate
    10-13 Sept. 2012
  • Firstpage
    588
  • Lastpage
    589
  • Abstract
    We present a hybrid CUDA-MPI sorting algorithm that makes use of GPU clusters to sort large data sets. Our algorithm has two phases. In the first phase each node sorts a portion of the data on its GPU using a parallel bitonic sort. In the second phase the sorted subsequences are merged together in parallel using a reduction sorting network implemented in MPI across the cluster nodes. Performance results comparing our sorting algorithm to sequential quick sort yield speed-up values of up to 9.8 for sorting 4GB of data on a 32 node GPU cluster. We anticipate even better speed-up values using our algorithm on larger data sets and larger sized clusters.
  • Keywords
    application program interfaces; data reduction; graphics processing units; parallel architectures; pattern clustering; sorting; CUDA; GPU cluster; MPI; cluster node; data set; hybrid bitonic sorting algorithm; parallel bitonic sort; reduction sorting network; Algorithm design and analysis; Clustering algorithms; Graphics processing unit; Parallel processing; Random access memory; Runtime; Sorting; GPU clusters; hybrid CUDA-MPI; parallel sorting algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Workshops (ICPPW), 2012 41st International Conference on
  • Conference_Location
    Pittsburgh, PA
  • ISSN
    1530-2016
  • Print_ISBN
    978-1-4673-2509-7
  • Type

    conf

  • DOI
    10.1109/ICPPW.2012.82
  • Filename
    6337530