• DocumentCode
    3507077
  • Title

    Operating Two InfiniBand Grid Clusters over 28 km Distance

  • Author

    Richling, Sabine ; Hau, Steffen ; Kredel, Heinz ; Kruse, Hans-Günther

  • Author_Institution
    IT-Center, Univ. of Heidelberg, Heidelberg, Germany
  • fYear
    2010
  • fDate
    4-6 Nov. 2010
  • Firstpage
    16
  • Lastpage
    23
  • Abstract
    This paper considers an InfiniBand connection between two bwGRiD clusters over a distance of 28 km in day-to-day production use. We discuss the hardware setup of InfiniBand messages converted and transported over a fiber optic connection. The two clusters can be operated as single system image, the batch system will enforce that all nodes for a job are allocated on one side of the cluster. This is to optimize MPI performance, which would not be sufficient for communication between nodes on opposite sides of the 28 km connection. We report on the successful solution of all technical and organizational integration hurdles. By a simple performance model we discuss the anticipated costs for a doubling in communication performance.
  • Keywords
    application program interfaces; grid computing; message passing; optical fibre LAN; workstation clusters; Ethernet; InfiniBand connection; InfiniBand grid clusters; MPI performance; batch system; fiber optic connection; long-distance InfiniBand; operating clusters; performance model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2010 International Conference on
  • Conference_Location
    Fukuoka
  • Print_ISBN
    978-1-4244-8538-3
  • Electronic_ISBN
    978-0-7695-4237-9
  • Type

    conf

  • DOI
    10.1109/3PGCIC.2010.8
  • Filename
    5662746