DocumentCode
3205765
Title
Deadlock-Free Oblivious Routing for Arbitrary Topologies
Author
Domke, Jens ; Hoefler, Torsten ; Nagel, Wolfgang E.
Author_Institution
Center for Inf. Services & High Performance Comput., Tech. Univ. Dresden, Dresden, Germany
fYear
2011
fDate
16-20 May 2011
Firstpage
616
Lastpage
627
Abstract
Efficient deadlock-free routing strategies are crucial to the performance of large-scale computing systems. There are many methods but it remains a challenge to achieve lowest latency and highest bandwidth for irregular or unstructured high-performance networks. % We investigate a novel routing strategy based on the single-source-shortest-path routing algorithm and extend it to use virtual channels to guarantee deadlock-freedom. We show that this algorithm achieves minimal latency and high bandwidth with only a low number of virtual channels and can be implemented in practice. % We demonstrate that the problem of finding the minimal number of virtual channels needed to route a general network deadlock-free is NP-complete and we propose different heuristics to solve the problem. We implement all proposed algorithms in the Open Subnet Manager of InfiniBand and compare the number of needed virtual channels and the bandwidths of multiple real and artificial network topologies which are established in practice. % Our approach allows to use the existing virtual channels more effectively to guarantee deadlock-freedom and increase the effective bandwidth of up to a factor of two. Application benchmarks show an improvement of up to 95%. Our routing scheme is not limited to InfiniBand but can be deployed on existing InfiniBand installations to increase network performance transparently without modifications to the user applications.
Keywords
computational complexity; computer networks; graph theory; telecommunication network routing; telecommunication network topology; NP-complete; arbitrary topologies; artificial network topologies; deadlock-free oblivious routing; large-scale computing systems; single-source-shortest-path routing algorithm; unstructured high-performance networks; virtual channels; Bandwidth; Generators; Network topology; Polynomials; Routing; System recovery; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
Conference_Location
Anchorage, AK
ISSN
1530-2075
Print_ISBN
978-1-61284-372-8
Electronic_ISBN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2011.65
Filename
6012874
Link To Document