DocumentCode :
2856044
Title :
Characterizing the Performance of “Big Memory” on Blue Gene Linux
Author :
Yoshii, Kazutomo ; Iskra, Kamil ; Naik, Harish ; Beckman, Pete ; Broekema, P. Chris
Author_Institution :
Math. & Comput. Sci. Div., Argonne Nat. Lab., Argonne, IL, USA
fYear :
2009
fDate :
22-25 Sept. 2009
Firstpage :
65
Lastpage :
72
Abstract :
Efficient use of Linux for high-performance applications on Blue Gene/P (BG/P) compute nodes is challenging because of severe performance hits resulting from translation lookaside buffer (TLB) misses and a hard-to-program torus network DMA controller. To address these difficulties, we present the design and implementation of ¿Big Memory¿- an alternative, transparent memory space for computational processes. Big Memory uses extremely large memory pages available on PowerPC CPUs to create a TLB-miss-free, flat memory area that can be used for application code and data and is easier to use for DMA operations. One of our singlenode memory benchmarks shows that the performance gap between regular PowerPC Linux with 4KB pages and IBM BG/P compute node kernel (CNK) is about 68% in the worst case. Big Memory narrows the worst case performance gap to just 0.04%. We verify this result on 1024 nodes of Blue Gene/P using the NAS Parallel Benchmarks and find the performance under Linux with Big Memory to fluctuate within 0.7% of CNK. Originally intended exclusively for compute node tasks, our new memory subsystem turns out to dramatically improve the performance of certain I/O node applications as well. We demonstrate this performance using the central processor of the LOw Frequency ARray (LOFAR) radio telescope as an example.
Keywords :
Linux; file organisation; operating system kernels; parallel programming; BG/P compute node kernel; Blue Gene Linux; DMA controller; LOFAR radio telescope processor; NAS parallel benchmarks; PowerPC Linux; big memory performance; low frequency array; torus network; translation lookaside buffer; Computer architecture; Computer networks; Computer science; Concurrent computing; Hardware; High performance computing; Kernel; Linux; Operating systems; Radio astronomy; Blue Gene; I/O node; LOFAR; Linux; NAS Parallel Benchmarks; OS kernel; TLB; compute node; memory performance; petascale;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Workshops, 2009. ICPPW '09. International Conference on
Conference_Location :
Vienna
ISSN :
1530-2016
Print_ISBN :
978-1-4244-4923-1
Electronic_ISBN :
1530-2016
Type :
conf
DOI :
10.1109/ICPPW.2009.35
Filename :
5365704
Link To Document :
بازگشت