Title :
Simulating billion-task parallel programs
Author :
Perumalla, Kalyan S. ; Park, Alfred J.
Author_Institution :
Oak Ridge Nat. Lab., Oak Ridge, TN, USA
Abstract :
In simulating large parallel systems, bottom-up approaches exercise detailed hardware models with effects from simplified software models or traces, whereas top-down approaches evaluate the timing and functionality of detailed software models over coarse hardware models. Here, we focus on the top-down approach and significantly advance the scale of the simulated parallel programs. Via the direct execution technique combined with parallel discrete event simulation, we stretch the limits of the top-down approach by simulating parallel programs with hundreds of millions of tasks. Although the scaling issues and solutions presented here are generally applicable, we focus on message passing interface (MPI) programs. Using a timing-validated benchmark application, a proof-of-concept scaling level is achieved to over 0.22 billion virtual MPI processes on 216,000 cores of a Cray XTS supercomputer, representing one of the largest direct execution simulations to date, combined with a multiplexing ratio of 1024 simulated tasks per real task.
Keywords :
discrete event simulation; message passing; parallel programming; timing; Cray XTS supercomputer; MPI programs; billion-task parallel program simulation; bottom-up approaches; direct execution technique; hardware models; large parallel system simulation; message passing interface programs; multiplexing ratio; parallel discrete event simulation; proof-of-concept scaling level; software models; timing-validated benchmark application; top-down approaches; virtual MPI processes; Bandwidth; Computational modeling; Hardware; Message systems; Multiplexing; Receivers; Semantics;
Conference_Titel :
Performance Evaluation of Computer and Telecommunication Systems (SPECTS 2014), International Symposium on
Conference_Location :
Monterey, CA
DOI :
10.1109/SPECTS.2014.6879997