• DocumentCode
    3205765
  • Title

    Deadlock-Free Oblivious Routing for Arbitrary Topologies

  • Author

    Domke, Jens ; Hoefler, Torsten ; Nagel, Wolfgang E.

  • Author_Institution
    Center for Inf. Services & High Performance Comput., Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    616
  • Lastpage
    627
  • Abstract
    Efficient deadlock-free routing strategies are crucial to the performance of large-scale computing systems. There are many methods but it remains a challenge to achieve lowest latency and highest bandwidth for irregular or unstructured high-performance networks. % We investigate a novel routing strategy based on the single-source-shortest-path routing algorithm and extend it to use virtual channels to guarantee deadlock-freedom. We show that this algorithm achieves minimal latency and high bandwidth with only a low number of virtual channels and can be implemented in practice. % We demonstrate that the problem of finding the minimal number of virtual channels needed to route a general network deadlock-free is NP-complete and we propose different heuristics to solve the problem. We implement all proposed algorithms in the Open Subnet Manager of InfiniBand and compare the number of needed virtual channels and the bandwidths of multiple real and artificial network topologies which are established in practice. % Our approach allows to use the existing virtual channels more effectively to guarantee deadlock-freedom and increase the effective bandwidth of up to a factor of two. Application benchmarks show an improvement of up to 95%. Our routing scheme is not limited to InfiniBand but can be deployed on existing InfiniBand installations to increase network performance transparently without modifications to the user applications.
  • Keywords
    computational complexity; computer networks; graph theory; telecommunication network routing; telecommunication network topology; NP-complete; arbitrary topologies; artificial network topologies; deadlock-free oblivious routing; large-scale computing systems; single-source-shortest-path routing algorithm; unstructured high-performance networks; virtual channels; Bandwidth; Generators; Network topology; Polynomials; Routing; System recovery; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
  • Conference_Location
    Anchorage, AK
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-372-8
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.65
  • Filename
    6012874