• DocumentCode
    2626275
  • Title

    Design and Implementation of an RDMA Gateway for Heterogeneous Clusters

  • Author

    Kim, Shin Gyu ; Han, Hyuck ; Jung, Hyungsoo ; Yeom, Heon Y.

  • Author_Institution
    Seoul Nat. Univ., Seoul
  • fYear
    2007
  • fDate
    21-23 Nov. 2007
  • Firstpage
    1003
  • Lastpage
    1009
  • Abstract
    Building high-performance clusters using one of the two leading network technologies, Myrinet and InfiniBand, has been thought as a de facto way to achieve several teraflops computing power. Meanwhile, maintaining both types of clusters, it appears, may have created an another challenge for the MPI programming system, the most popular parallel programming library that has been successfully used on both networks. The belief that extending cluster resources across two different types of networks may increase computing parallelism has driven many researchers to tackle this challenge with various viewpoints. We approach this challenge with a different perspective, application transparency, which is accomplishing the goal without any modification of legacy MPI applications. We, therefore, focus on the design of an RDMA gateway that can relay messages very fast, and this design focus turns out to be a better way to preserve the application transparency. RDMA gateway (RG), our prototyped system, has a very efficient memory management mechanism that prevents RG from showing irregular spikes of a memory usage under a heavy load condition. Experimental results show that running parallel applications over heterogeneous clusters can be very promising with low performance overhead.
  • Keywords
    application program interfaces; file organisation; message passing; InfiniBand; MPI programming system; Myrinet; RDMA gateway; heterogeneous clusters; memory management mechanism; remote direct memory access; teraflops computing power; Buildings; Computer networks; Concurrent computing; Libraries; Memory management; Parallel processing; Parallel programming; Prototypes; Relays; Roentgenium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Convergence Information Technology, 2007. International Conference on
  • Conference_Location
    Gyeongju
  • Print_ISBN
    0-7695-3038-9
  • Type

    conf

  • DOI
    10.1109/ICCIT.2007.350
  • Filename
    4420390