• DocumentCode
    1674270
  • Title

    Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL

  • Author

    Kandalla, K. ; Subramoni, H. ; Vienne, J. ; Raikar, S. Pai ; Tomko, K. ; Sur, S. ; Panda, D.K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2011
  • Firstpage
    27
  • Lastpage
    34
  • Abstract
    The upcoming MPI-3.0 standard is expected to include non-blocking collective operations. Non-blocking collectives offer a new MPI interface, using which an application can decouple the initiation and completion of collective operations. However, to be effective, the MPI library should provide a high performance and scalable implementation. One of the major challenges in designing an effective non-blocking collective operation is to ensure progress of the operation while processors are busy in application-level computation. The recently introduced Mellanox ConnectX-2 InfiniBand adapters offer a task offload interface (CORE-Direct) that enables communication progress without requiring CPU cycles. In this paper, we present the design of a non-blocking broadcast operation (MPI Ibcast) using the CORE-Direct offload interface. Our experimental evaluations show that our implementation delivers near perfect overlap, without penalizing the latency of the MPI Ibcast operation. Since existing MPI implementations do not provide non-blocking collective communication, scientific applications have been modified to implement collectives on top of MPI point-to-point operations to achieve overlap. HPL is an example of an application use case scenario for non-blocking collectives. We have explored the benefits of our proposed network offload based MPI Ibcast implementation with HPL and we observe that HPL can achieve its peak throughput with significantly smaller problem sizes, which also leads to an improvement in its run-time by up to 78%, with 512 processors. We also observe that our proposed designs can minimize the impact of system noise on applications.
  • Keywords
    message passing; CORE-direct offload interface; InfiniBand cluster; MPI interface; MPI library; MPI point-to-point operation; MPI-3.0 standard; Mellanox ConnectX-2 InfiniBand; collective offload; communication progress; message passing interface; nonblocking broadcast; nonblocking collective operation; Algorithm design and analysis; Benchmark testing; Libraries; Noise; Program processors; Size measurement; Throughput; High Performance Linpack; InfiniBand; MPI; Non-Blocking Collective Communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Interconnects (HOTI), 2011 IEEE 19th Annual Symposium on
  • Conference_Location
    Santa Clara, CA
  • Print_ISBN
    978-1-4577-1563-1
  • Electronic_ISBN
    978-0-7695-4537-0
  • Type

    conf

  • DOI
    10.1109/HOTI.2011.14
  • Filename
    6041531