• DocumentCode
    167364
  • Title

    A Novel Computational Model for GPUs with Application to I/O Optimal Sorting Algorithms

  • Author

    Koike, A. ; Sadakane, K.

  • Author_Institution
    Principles of Inf. Res. Div., Nat. Inst. of Inf., Tokyo, Japan
  • fYear
    2014
  • fDate
    19-23 May 2014
  • Firstpage
    614
  • Lastpage
    623
  • Abstract
    We propose a novel computational model for GPU. Known parallel computational models such as the PRAM model are not appropriate for evaluating GPU algorithms. Our model, called AGPU, abstracts the essence of current GPU architectures such as global and shared memory, memory coalescing and bank conflicts. We can therefore evaluate asymptotic behavior of GPU algorithms more accurately than known models and we can develop algorithms that are efficient on many real architectures. As a showcase, we first analyze known comparison-based sorting algorithms using the AGPU model and show that they are not I/O optimal, that is, the number of global memory accesses is more than necessary. Then we propose a new algorithm which uses an asymptotically optimal number of global memory accesses and whose time complexity is also nearly optimal.
  • Keywords
    computational complexity; graphics processing units; input-output programs; parallel architectures; shared memory systems; sorting; AGPU computational model; GPU architectures; I/O optimal sorting algorithms; bank conflicts; comparison-based sorting algorithms; global memory accesses; memory coalescing; shared memory; time complexity; Algorithm design and analysis; Complexity theory; Computational modeling; Computer architecture; Graphics processing units; Instruction sets; Synchronization; GPGPU; GPU; parallel computational models; sorting algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
  • Conference_Location
    Phoenix, AZ
  • Print_ISBN
    978-1-4799-4117-9
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2014.72
  • Filename
    6969442