DocumentCode :
1799885
Title :
GPUMech: GPU Performance Modeling Technique Based on Interval Analysis
Author :
Jen-Cheng Huang ; Joo Hwan Lee ; Hyesoon Kim ; Lee, Hsien-Hsin S.
Author_Institution :
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2014
fDate :
13-17 Dec. 2014
Firstpage :
268
Lastpage :
279
Abstract :
GPU has become a first-order computing plat-form. Nonetheless, not many performance modeling techniques have been developed for architecture studies. Several GPU analytical performance models have been proposed, but they mostly target application optimizations rather than the study of different architecture design options. Interval analysis is a relatively accurate performance modeling technique, which traverses the instruction trace and uses functional simulators, e.g., Cache simulator, to track the stall events that cause performance loss. It shows hundred times of speedup compared to detailed timing simulations and better accuracy compared to pure analytical models. However, previous techniques are limited to CPUs and not applicable to multithreaded architectures. In this work, we propose GPU Mech, an interval analysis-based performance modeling technique for GPU architectures. GPU Mech models multithreading and resource contentions caused by memory divergence. We compare GPU Mech with a detailed timing simulator and show that on average, GPU Mechhas 13.2% error for modeling the round-robin scheduling policy and 14.0% error for modeling the greedy-then-oldest policy while achieving a 97x faster simulation speed. In addition, GPU Mech generates CPI stacks, which help hardware/software developers to visualize performance bottlenecks of a kernel.
Keywords :
graphics processing units; multi-threading; performance evaluation; CPI stacks; GPU architectures; GPU performance modeling technique; GPUMech; greedy-then-oldest policy; interval analysis; memory divergence; multithreading; resource contentions; round-robin scheduling policy; timing simulator; Analytical models; Computer architecture; Graphics processing units; Kernel; Multithreading; Timing; Vectors; GPGPU; interval analysis; performance modeling; simulation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Microarchitecture (MICRO), 2014 47th Annual IEEE/ACM International Symposium on
Conference_Location :
Cambridge
ISSN :
1072-4451
Type :
conf
DOI :
10.1109/MICRO.2014.59
Filename :
7011394
Link To Document :
بازگشت