• DocumentCode
    2891096
  • Title

    Designing Efficient Many-Core Parallel Algorithms for All-Pairs Shortest-Paths Using CUDA

  • Author

    Tran, Quoc-Nam

  • Author_Institution
    Lamar Univ., Beaumont, TX, USA
  • fYear
    2010
  • fDate
    12-14 April 2010
  • Firstpage
    7
  • Lastpage
    12
  • Abstract
    Finding the all-pairs shortest-paths on a large graph is a fundamental problem in many practical applications such as bioinformatics, internet node traffic and network routing. In this paper, we present the designs of two efficient parallel algorithms for many-core GPUs using CUDA. Our algorithms expose substantial fine-grained parallelism while maintaining minimal global communication. By using the global scope of the GPU´s global memory, coalescing the global memory reads and writes, and avoiding on-chip shared memory bank conflicts, we are able to achieve a large performance benefit with a speed-up of 2,500x on a desktop computer in comparison with a single core program. Our algorithms are scalable, which can handle graphs with size larger than the memory available on the GPUs and when multiple GPUs are added into the system.
  • Keywords
    computer graphic equipment; coprocessors; graph theory; parallel algorithms; shared memory systems; CUDA; all-pairs shortest-paths; desktop computer; global memory; large graph; many-core GPU; many-core parallel algorithms; on-chip shared memory bank conflicts; substantial fine-grained parallelism; Graph Algorithms; Multi-core; Multi-threaded Algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: New Generations (ITNG), 2010 Seventh International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4244-6270-4
  • Type

    conf

  • DOI
    10.1109/ITNG.2010.230
  • Filename
    5501465