• DocumentCode
    726319
  • Title

    Execution-driven parallel simulation of PGAS applications on heterogeneous tiled architectures

  • Author

    Roloff, Sascha ; Schafhauser, David ; Hannig, Frank ; Teich, Jurgen

  • Author_Institution
    Dept. of Comput. Sci., Friedrich-Alexander-Univ. Erlangen-Nurnberg, Erlangen, Germany
  • fYear
    2015
  • fDate
    8-12 June 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    We present a parallel execution-driven simulator for the efficient simulation of heterogeneous tile-based multi-core architectures. Here, the architecture is composed of several tiles connected via a network-on-chip and each tile contains local memory as well as several possibly different types of compute resources. Partitioned Global Address Space (PGAS) is a programming model matching very well the needs for programming of such modern multi-core architectures. In order to provide performance estimations for parallel software and enable architecture design space exploration, fast functional and timing simulation techniques are required. Thus, we present a simulator that meets this requirement by combining a fast direct-execution simulation approach with different parallelization strategies. Here, we propose four novel parallel discrete-event simulation techniques, which map thread-level parallelism within the applications to core-level parallelism on the target architecture and back to thread-level parallelism on the host machine. In order to achieve this, the correct synchronization and activation of the host threads is necessary being the main focus of this paper. Experiments with parallel real-world applications are used to compare the different techniques against each other and demonstrate that 10.4 times faster simulations than a sequential simulation can be achieved on a 12-core Intel Xeon processor.
  • Keywords
    discrete event simulation; multiprocessing systems; network-on-chip; parallel architectures; parallel programming; Intel Xeon processor; PGAS application; PGAS programming model; architecture design space exploration; compute resource; core-level parallelism; direct-execution simulation approach; execution-driven parallel simulation; heterogeneous tile-based multicore architecture; memory resource; network-on-chip; parallel discrete-event simulation techniques; parallel software; parallelization strategy; partitioned global address space; sequential simulation; thread-level parallelism; timing simulation techniques; Adaptation models; Computational modeling; Heart beat; Multicore processing; Parallel processing; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design Automation Conference (DAC), 2015 52nd ACM/EDAC/IEEE
  • Conference_Location
    San Francisco, CA
  • Type

    conf

  • DOI
    10.1145/2744769.2744840
  • Filename
    7167229