Title :
Characterizing the Influence of System Noise on Large-Scale Applications by Simulation
Author :
Hoefler, Torsten ; Schneider, Timo ; Lumsdaine, Andrew
Author_Institution :
Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
This paper presents an in-depth analysis of the impact of system noise on large-scale parallel application performance in realistic settings. Our analytical model shows that not only collective operations but also point-to-point communications influence the application´s sensitivity to noise. We present a simulation toolchain that injects noise delays from traces gathered on common large-scale architectures into a LogGPS simulation and allows new insights into the scaling of applications in noisy environments. We investigate collective operations with up to 1 million processes and three applications (Sweep3D, AMG, and POP) with up to 32,000 processes.We show that the scale at which noise becomes a bottleneck is system-specific and depends on the structure of the noise. Simulations with different network speeds show that a 10x faster network does not improve application scalability. We quantify noise and conclude that our tools can be utilized to tune the noise signatures of a specific system.
Keywords :
multiprocessing systems; parallel architectures; AMG; LogGPS simulation; POP; Sweep3D; in-depth analysis; large-scale architectures; large-scale parallel application performance; noise delays; noise signatures; point-to-point communications; system noise; Analytical models; Benchmark testing; Delay; Noise; Noise measurement; Receivers; Synchronization;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7557-5
Electronic_ISBN :
978-1-4244-7558-2