DocumentCode :
3205099
Title :
vFtree - A Fat-Tree Routing Algorithm Using Virtual Lanes to Alleviate Congestion
Author :
Guay, Wei Lin ; Bogdanski, Bartosz ; Reinemo, Sven-Arne ; Lysne, Olav ; Skeie, Tor
Author_Institution :
Simula Res. Lab., Lysaker, Norway
fYear :
2011
fDate :
16-20 May 2011
Firstpage :
197
Lastpage :
208
Abstract :
It is a well known fact that multiple virtual lanes can improve performance in interconnection networks, but this knowledge has had little impact on real clusters. Currently, a large number of clusters using InfiniBand is based on fat-tree topologies that can be routed deadlock-free using only one virtual lane. Consequently, all the remaining virtual lanes are left unused. In this paper we suggest an enhancement to the fat-tree algorithm that utilizes virtual lanes to improve performance when hot-spots are present. Even though the bisection bandwidth in a fat-tree is constant, hot-spots are still possible and they will degrade performance for flows not contributing to them due to head-of-line blocking. Such a situation may be alleviated through adaptive routing or congestion control, however, these methods are not yet readily available in InfiniBand technology. To remedy this problem, we have implemented an enhanced fat-tree algorithm in OpenSM that distributes traffic across all available virtual lanes without any configuration needed. We evaluated the performance of the algorithm on a small cluster and did a large-scale evaluation through simulations. In a congested environment, results show that we are able to achieve throughput increases up to 38% on a small cluster and from 221% to 757% depending on the hot-spot scenario for a 648-port simulated cluster.
Keywords :
multiprocessor interconnection networks; telecommunication congestion control; telecommunication network routing; workstation clusters; InfiniBand; OpenSM; adaptive routing; congestion alleviation; congestion control; fat-tree routing algorithm; interconnection networks; vFtree; virtual lanes; Bandwidth; Hardware; Network topology; Proposals; Routing; System recovery; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
Conference_Location :
Anchorage, AK
ISSN :
1530-2075
Print_ISBN :
978-1-61284-372-8
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2011.28
Filename :
6012837
Link To Document :
بازگشت