• DocumentCode
    1996294
  • Title

    Building Scalable PGAS Communication Subsystem on Blue Gene/Q

  • Author

    Vishnu, Abhinav ; Kerbyson, Darren J. ; Barker, Kevin ; van Dam, Hubertus

  • Author_Institution
    Performance & Archit. Lab., Pacific Northwest Nat. Lab., Richland, WA, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    825
  • Lastpage
    833
  • Abstract
    This paper presents a design of scalable Partitioned Global Address Space (PGAS) communication subsystems on recently proposed Blue Gene/Q architecture. The proposed design provides an in-depth modeling of communication infrastructure using Parallel Active Messaging Interface(PAMI). The communication infrastructure is used to design time-space efficient communication protocols for frequently used data-types (contiguous, uniformly non-contiguous) with Remote Direct Memory Access (RDMA) get/put primitives. The proposed design accelerates load balance counters by using asynchronous threads, which are required due to the missing network hardware support for generic Atomic Memory Operations (AMOs). Under the proposed design, the synchronization traffic is reduced by tracking conflicting memory accesses in distributed memory with a slight increment in space complexity. An evaluation with simple communication benchmarks show a adjacent node get latency of 2.89us and peak bandwidth of 1775 MB/s resulting in 99% communication efficiency. The evaluation shows a reduction in the execution time by up to 30% for NWChem self consistent field calculation on 4096 processes using the proposed asynchronous thread based design.
  • Keywords
    computational complexity; file organisation; parallel architectures; parallel machines; protocols; synchronisation; system-on-chip; AMO; Blue Gene/Q architecture; NWChem self consistent field calculation; PAMI; RDMA; asynchronous thread based design; communication infrastructure; conflicting memory accesses; generic atomic memory operations; get-put primitives; network hardware support; parallel active messaging interface; partitioned global address space; remote direct memory access; scalable PGAS communication subsystem; space complexity; synchronization traffic; time-space efficient communication protocols; Complexity theory; Context; Data transfer; Electronics packaging; Memory management; Message systems; Protocols; Blue Gene/Q; Communication; PGAS; RDMA;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.262
  • Filename
    6650961