DocumentCode
3429372
Title
Looking under the hood of the IBM Blue Gene/Q network
Author
Dong Chen ; Eisley, N. ; Heidelberger, P. ; Kumar, Sudhakar ; Mamidala, A. ; Petrini, Fabrizio ; Senger, R. ; Sugawara, Yoko ; Walkup, R. ; Choudhury, Alamgir ; Sabharwal, Yogish ; Singhal, Sharad ; Steinmacher-Burow, Burkhard ; Parker, Jeffrey J.
Author_Institution
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
1
Lastpage
12
Abstract
This paper explores the performance and optimization of the IBM Blue Gene/Q (BG/Q) five dimensional torus network on up to 16K nodes. The BG/Q hardware supports multiple dynamic routing algorithms and different traffic patterns may require different algorithms to achieve best performance. Between 85% to 95% of peak network performance is achieved for all-to-all traffic, while over 85% of peak is obtained for challenging bisection pairings. A new software-controlled algorithm is developed for bisection traffic that selects which hardware algorithm to employ and achieves better performance than any individual hardware algorithm. The benefit of dynamic routing is shown for a highly non-uniform "transpose" traffic pattern. To evaluate memory and network performance, the HPCC Random Access benchmark was tuned for BG/Q and achieved 858 Giga Updates per Second (GUPS) on 16K nodes. To further accelerate message processing, the message libraries on BG/Q enable the offloading of messaging overhead onto dedicated communication threads. Several applications, including Algebraic Multigrid (AMG), exhibit from 3 to 20% gain using communication threads.
Keywords
parallel processing; AMG; BG/Q hardware; HPCC random access benchmark; IBM Blue Gene/Q network; algebraic multigrid; bisection pairings; bisection traffic; five dimensional torus network; hardware algorithm; multiple dynamic routing algorithm; nonuniform transpose traffic pattern; software-controlled algorithm; Bandwidth; Hardware; Heuristic algorithms; Message systems; Routing; Software; Software algorithms; Blue Gene; GUPS; interconnection network; network performance; network routing;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
Conference_Location
Salt Lake City, UT
ISSN
2167-4329
Print_ISBN
978-1-4673-0805-2
Type
conf
DOI
10.1109/SC.2012.72
Filename
6468517
Link To Document