Title :
Dynamically Adjusting Core Frequencies to Accelerate Time Warp Simulations in Many-Core Processors
Author :
Child, Ryan ; Wilsey, Philip
Author_Institution :
Sch. of Electron. & Comput. Syst., Univ. of Cincinnati Cincinnati, Cincinnati, OH, USA
Abstract :
Time Warp synchronized parallel discrete event simulators are organized to operate asynchronously and aggressively without explicit synchronization between the concurrently executing simulators. In place of an explicit synchronization mechanism, the concurrent simulators maintain a common virtual clock model and implement a rollback/recovery mechanism to restore causal order when out-of-order events are detected. When the critical path of execution of the simulation is balanced across these parallel simulators, this can result in a highly effective, lightweight synchronization mechanism. However, imbalances in the workload across the parallel simulators can result in excessive rollback at some nodes and ultimately result in an overall slowing of the simulation as prematurely computed and transmitted events are processed. On small shared memory multi-core systems, a lowest time-stamp first scheduling policy can effectively balance the workload. However, on larger many-core chips, conventional load balancing and workload migration will once again become necessary. Fortunately, emerging many-core chips contain some interesting features that can potentially be exploited to improve the performance of parallel simulations. For example, the Intel Single-chip Cloud Computer (SCC) provides mechanisms that a running application can use to adjust the frequency/voltage of different regions (called islands) of the chip. These islands are network and processing core centric and thus, in a Time Warp simulation, one can increase the frequency of the cores executing threads on the critical path (those experiencing infrequent rollback) and decrease the frequency of the cores executing threads off the critical path (those experiencing excessive rollback). This paper investigates the run-time control and adjustment of core frequency in an AMD Phenom II X6 multi-core processor to explore and demonstrate that the dynamic run-time control of core frequency can sometimes improve the perfo- mance of a Time Warp synchronized parallel simulation.
Keywords :
parallel processing; shared memory systems; time warp simulation; AMD Phenom II X6 multicore processor; Intel single-chip cloud computer; SCC; concurrent simulators; core frequencies; explicit synchronization mechanism; lightweight synchronization mechanism; load balancing; lowest time-stamp first scheduling policy; many-core processors; parallel discrete event simulators; rollback/recovery mechanism; shared memory multi-core systems; time warp simulations; virtual clock model; workload migration; Acceleration; Clocks; Frequency control; Frequency measurement; Multicore processing; Program processors; Synchronization; many-core processors; parallel simulation; run time tuning; time warp synchronization;
Conference_Titel :
Principles of Advanced and Distributed Simulation (PADS), 2012 ACM/IEEE/SCS 26th Workshop on
Conference_Location :
Zhangjiajie
Print_ISBN :
978-1-4673-1797-9
DOI :
10.1109/PADS.2012.15