Title :
Procrustes1: Power Constrained Performance Improvement Using Extended Maximize-Then-Swap Algorithm
Author :
Guangshuo Liu ; Jinpyo Park ; Marculescu, Diana
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
This paper proposes an efficient algorithm that maximizes performance under power constraints and is applicable in the general context of traditional dynamic voltage/frequency (V/P) scaling, or core heterogeneity and emerging dynamic micro-architectural adaptation. Performance maximization in these scenarios can be essentially viewed as mapping application threads to appropriate core states that have various power/performance characteristics. Such problems are formulated as a generic 0-1 integer linear program (ILP). The proposed algorithm is an iterative heuristic-based solution. Compared with an optimal solution generated by commercial ILP solver, the proposed algorithm produces results less than 1% away from optimum on average, with more than two orders of magnitude improvement in runtime. The algorithm can be brought online for hundred-core heterogeneous systems as it scales to systems comprised of 256 cores with less than 1 ms in overhead in worst cases. The intrinsic history awareness also provides flexibility to control cost induced by switching V/F pairs, migrating threads across cores, or tuning on/off micro-architectural resources.
Keywords :
integer programming; linear programming; mathematics computing; multiprocessing systems; performance evaluation; power aware computing; commercial ILP solver; core heterogeneity; dynamic microarchitectural adaptation; dynamic voltage-frequency scaling; extended maximize-then-swap algorithm; generic 0-1 integer linear program; hundred-core heterogeneous systems; iterative heuristic-based solution; magnitude improvement; on-off microarchitectural resource tuning; performance maximization; power constrained performance improvement; power-performance characteristics; Algorithm design and analysis; Heuristic algorithms; Instruction sets; Multicore processing; Power demand; Runtime; Throughput; DVFS; Dynamic adaptation; Maximize-then-swap; dynamic adaptation; dynamic voltage/frequency scaling (DVFS); heterogeneous many-core; maximize-then-swap (MTS); performance maximization;
Journal_Title :
Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on
DOI :
10.1109/TCAD.2015.2421911