• DocumentCode
    2766635
  • Title

    Efficient On-Demand Connection Management Mechanisms with PGAS Models over InfiniBand

  • Author

    Vishnu, Abhinav ; Krishnan, Manoj

  • fYear
    2010
  • fDate
    17-20 May 2010
  • Firstpage
    175
  • Lastpage
    184
  • Abstract
    In the last decade or so, clusters have observed a tremendous rise in popularity due to the excellent price to performance ratio. A variety of Interconnects have been proposed during this period, with InfiniBand leading the way due to its high performance and open standard. At the same time, multiple programming models have emerged in order to meet the requirements of various applications and their programming models. To support requirements of multiple programming models, InfiniBand provides multiple transport semantics, ranging from unreliable connectionless to reliable connected characteristics. Among them, the reliable connection (RC) semantics is being widely used due to its high performance and support for novel features like Remote Direct Memory Acesss (RDMA), hardware atomics and Network Fault Tolerance. However, the pair wise connection oriented nature of the RC transport semantics limits its scalability and usage at the increasing processor counts. In this paper, we design and implement on-demand connection management approaches in the context of Partitioned Global Address Space (PGAS) programming models, which provided shared memory abstraction and one-sided communication semantics, leading to the development of multiple languages (UPC, X10, Chapel) and libraries (Global Arrays, MPI-RMA). Using Global Arrays as the research vehicle, we implement this approach with Aggregate Remote Memory Copy Interface (ARMCI), the runtime system of Global Arrays. We evaluate our approach, ARMCI-On Demand Connection Management (ARMCI-ODCM) using various micro benchmarks and benchmarks (LU Factorization, Random-Access and Lennard Jones simulation) and application (Subsurface transport over multiple phases (STOMP)). With the performance evaluation for up to 4096 processors, we are able to have a multi-fold reduction in connection memory with a negligible degradation in performance. Using STOMP at 4096 processors, reduces the overall connection memory by 66 times with no per- - formance degradation. To the best of our knowledge, this is the first design, implementation and evaluation of on-demand connection management with InfiniBand using PGAS models.
  • Keywords
    Aggregates; Context modeling; Degradation; Electronics packaging; Fault tolerance; Hardware; Libraries; Memory management; Scalability; Vehicles; InfiniBand; PGAS;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2010 10th IEEE/ACM International Conference on
  • Conference_Location
    Melbourne, Australia
  • Print_ISBN
    978-1-4244-6987-1
  • Type

    conf

  • DOI
    10.1109/CCGRID.2010.58
  • Filename
    5493482