DocumentCode :
154079
Title :
Improving the Performance of the Extreme-Scale Simulator
Author :
Engelmann, Christian ; Naughton, Thomas
Author_Institution :
Comput. Sci. & Math. Div., Oak Ridge Nat. Lab., Oak Ridge, TN, USA
fYear :
2014
fDate :
1-3 Oct. 2014
Firstpage :
198
Lastpage :
207
Abstract :
Investigating the performance of parallel applications at scale on future high-performance computing (HPC) architectures and the performance impact of different architecture choices is an important component of HPC hardware/software co-design. The Extreme-scale Simulator (xSim) is a simulation-based toolkit for investigating the performance of parallel applications at scale. xSim scales to millions of simulated Message Passing Interface (MPI) processes. The overhead introduced by a simulation tool is an important performance and productivity aspect. This paper documents two improvements to xSim: (1) a new deadlock resolution protocol to reduce the parallel discrete event simulation management overhead and (2) a new simulated MPI message matching algorithm to reduce the oversubscription management overhead. The results clearly show a significant performance improvement, such as by reducing the simulation overhead for running the NAS Parallel Benchmark suite inside the simulator from 1,020% to 238% for the conjugate gradient (CG) benchmark and from 102% to 0% for the embarrassingly parallel (EP) and benchmark, as well as, from 37,511% to 13,808% for CG and from 3,332% to 204% for EP with accurate process failure simulation.
Keywords :
conjugate gradient methods; hardware-software codesign; message passing; parallel processing; software architecture; HPC architectures; MPI process; NAS parallel benchmark suite; conjugate gradient benchmark; extreme-scale simulator; hardware/software codesign; high-performance computing; message passing interface; parallel applications; simulation-based toolkit; xSim; Benchmark testing; Clocks; Computational modeling; Computer architecture; Context; Protocols; System recovery; High-performance Computing; Message Passing Interface; Parallel Discrete Event Simulation; Performance Prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Simulation and Real Time Applications (DS-RT), 2014 IEEE/ACM 18th International Symposium on
Conference_Location :
Toulouse
ISSN :
1550-6525
Print_ISBN :
978-1-4799-6143-6
Type :
conf
DOI :
10.1109/DS-RT.2014.32
Filename :
6957193
Link To Document :
بازگشت