• DocumentCode
    3536997
  • Title

    Designing Efficient Parallel Prefix Sum Algorithms for GPUs

  • Author

    Capannini, Gabriele

  • Author_Institution
    Inf. Sci. & Technol. Inst., Nat. Res. Council, Pisa, Italy
  • fYear
    2011
  • fDate
    Aug. 31 2011-Sept. 2 2011
  • Firstpage
    189
  • Lastpage
    196
  • Abstract
    This paper presents a novel and efficient method to compute one of the simplest and most useful building block for parallel algorithms: the parallel prefix sum operation. Besides its practical relevance, the problem achieves further interest in parallel-computation theory. We firstly describe step-by-step how parallel prefix sum is performed in parallel on GPUs. Next we propose a more efficient technique properly developed for modern graphics processors and alike processors. Our technique is able to perform the computation in such a way that minimizes both memory conflicts and memory usage. Finally we evaluate theoretically and empirically all the considered solutions in terms of efficiency, space complexity, and computational time. In order to properly conduct the theoretical analysis we used a novel computational model proposed by us in a previous work: K-model. Concerning the experiments, the results show that the proposed solution obtains better performance than the existing ones.
  • Keywords
    computer graphic equipment; coprocessors; parallel algorithms; GPU; graphics processors; parallel algorithms; parallel computation theory; parallel prefix sum algorithms; Algorithm design and analysis; Arrays; Complexity theory; Computational modeling; Graphics processing unit; Memory management; parallel algorithms; performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (CIT), 2011 IEEE 11th International Conference on
  • Conference_Location
    Pafos
  • Print_ISBN
    978-1-4577-0383-6
  • Electronic_ISBN
    978-0-7695-4388-8
  • Type

    conf

  • DOI
    10.1109/CIT.2011.11
  • Filename
    6036747