• DocumentCode
    2959737
  • Title

    Exploring the Scope of the InfiniBand Congestion Control Mechanism

  • Author

    Gran, Ernst Gunnar ; Reinemo, Sven-Arne ; Lysne, Olav ; Skeie, Tor ; Zahavi, Eitan ; Shainer, Gilad

  • Author_Institution
    Simula Res. Lab., Fornebu, Norway
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    1131
  • Lastpage
    1143
  • Abstract
    In a loss less interconnection network, network congestion needs to be detected and resolved to ensure high performance and good utilization of network resources at high network load. If no countermeasure is taken, congestion at a node in the network will stimulate the growth of a congestion tree that not only affects contributors to congestion, but also other traffic flows in the network. Left untouched, the congestion tree will block traffic flows, lead to underutilization of network resources and result in a severe drop in network performance. The InfiniBand standard specifies a congestion control (CC) mechanism to detect and resolve congestion before a congestion tree is able to grow and, by that, hamper the network performance. The InfiniBand CC mechanism includes a rich set of parameters that can be tuned in order to achieve effective CC. Even though it has been shown that the CC mechanism, properly tuned, is able to improve both throughput and fairness in an interconnection network, it has been questioned whether the mechanism is fast enough to keep up with dynamic network traffic, and if a given set of parameter values for a topology is robust when it comes to different traffic patterns, or if the parameters need to be tuned depending on the applications in use. In this paper we address both these questions. Using the three-stage fat-tree topology from the Sun Data center InfiniBand Switch 648 as a basis, and a simulator tuned against CC capable InfiniBand hardware, we conduct a systematic study of the efficiency of the InfiniBand CC mechanism as the network traffic becomes increasingly more dynamic. Our studies show that the InfiniBand CC, even when using a single set of parameter values, performs very well as the traffic patterns becomes increasingly more dynamic, outperforming a network without CC in all cases. Our results show throughput increases varying from a few percent, to a seventeen-fold increase.
  • Keywords
    computer centres; multiprocessor interconnection networks; network topology; telecommunication congestion control; telecommunication traffic; InfiniBand CC mechanism; InfiniBand Switch 648; InfiniBand congestion control mechanism; InfiniBand hardware; InfiniBand standard; Sun Data center; congestion tree; dynamic network traffic; interconnection network; network congestion; network fairness; network load; network performance; network resource; network throughput; network topology; network traffic flow; parameter value; three-stage fat-tree topology; traffic pattern; Multiprocessor interconnection; Network topology; Peer to peer computing; Standards; Switches; Vegetation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-0975-2
  • Type

    conf

  • DOI
    10.1109/IPDPS.2012.104
  • Filename
    6267917