Title :
Execution-driven parallel simulation of PGAS applications on heterogeneous tiled architectures
Author :
Roloff, Sascha ; Schafhauser, David ; Hannig, Frank ; Teich, Jurgen
Author_Institution :
Dept. of Comput. Sci., Friedrich-Alexander-Univ. Erlangen-Nurnberg, Erlangen, Germany
Abstract :
We present a parallel execution-driven simulator for the efficient simulation of heterogeneous tile-based multi-core architectures. Here, the architecture is composed of several tiles connected via a network-on-chip and each tile contains local memory as well as several possibly different types of compute resources. Partitioned Global Address Space (PGAS) is a programming model matching very well the needs for programming of such modern multi-core architectures. In order to provide performance estimations for parallel software and enable architecture design space exploration, fast functional and timing simulation techniques are required. Thus, we present a simulator that meets this requirement by combining a fast direct-execution simulation approach with different parallelization strategies. Here, we propose four novel parallel discrete-event simulation techniques, which map thread-level parallelism within the applications to core-level parallelism on the target architecture and back to thread-level parallelism on the host machine. In order to achieve this, the correct synchronization and activation of the host threads is necessary being the main focus of this paper. Experiments with parallel real-world applications are used to compare the different techniques against each other and demonstrate that 10.4 times faster simulations than a sequential simulation can be achieved on a 12-core Intel Xeon processor.
Keywords :
discrete event simulation; multiprocessing systems; network-on-chip; parallel architectures; parallel programming; Intel Xeon processor; PGAS application; PGAS programming model; architecture design space exploration; compute resource; core-level parallelism; direct-execution simulation approach; execution-driven parallel simulation; heterogeneous tile-based multicore architecture; memory resource; network-on-chip; parallel discrete-event simulation techniques; parallel software; parallelization strategy; partitioned global address space; sequential simulation; thread-level parallelism; timing simulation techniques; Adaptation models; Computational modeling; Heart beat; Multicore processing; Parallel processing; Synchronization;
Conference_Titel :
Design Automation Conference (DAC), 2015 52nd ACM/EDAC/IEEE
Conference_Location :
San Francisco, CA
DOI :
10.1145/2744769.2744840