DocumentCode :
1996294
Title :
Building Scalable PGAS Communication Subsystem on Blue Gene/Q
Author :
Vishnu, Abhinav ; Kerbyson, Darren J. ; Barker, Kevin ; van Dam, Hubertus
Author_Institution :
Performance & Archit. Lab., Pacific Northwest Nat. Lab., Richland, WA, USA
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
825
Lastpage :
833
Abstract :
This paper presents a design of scalable Partitioned Global Address Space (PGAS) communication subsystems on recently proposed Blue Gene/Q architecture. The proposed design provides an in-depth modeling of communication infrastructure using Parallel Active Messaging Interface(PAMI). The communication infrastructure is used to design time-space efficient communication protocols for frequently used data-types (contiguous, uniformly non-contiguous) with Remote Direct Memory Access (RDMA) get/put primitives. The proposed design accelerates load balance counters by using asynchronous threads, which are required due to the missing network hardware support for generic Atomic Memory Operations (AMOs). Under the proposed design, the synchronization traffic is reduced by tracking conflicting memory accesses in distributed memory with a slight increment in space complexity. An evaluation with simple communication benchmarks show a adjacent node get latency of 2.89us and peak bandwidth of 1775 MB/s resulting in 99% communication efficiency. The evaluation shows a reduction in the execution time by up to 30% for NWChem self consistent field calculation on 4096 processes using the proposed asynchronous thread based design.
Keywords :
computational complexity; file organisation; parallel architectures; parallel machines; protocols; synchronisation; system-on-chip; AMO; Blue Gene/Q architecture; NWChem self consistent field calculation; PAMI; RDMA; asynchronous thread based design; communication infrastructure; conflicting memory accesses; generic atomic memory operations; get-put primitives; network hardware support; parallel active messaging interface; partitioned global address space; remote direct memory access; scalable PGAS communication subsystem; space complexity; synchronization traffic; time-space efficient communication protocols; Complexity theory; Context; Data transfer; Electronics packaging; Memory management; Message systems; Protocols; Blue Gene/Q; Communication; PGAS; RDMA;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.262
Filename :
6650961
Link To Document :
بازگشت