Title :
GPGPU-MiniBench: Accelerating GPGPU Micro-Architecture Simulation
Author :
Zhibin Yu ; Eeckhout, Lieven ; Goswami, Nilanjan ; Tao Li ; John, Lizy K. ; Hai Jin ; Chengzhong Xu ; Junmin Wu
Author_Institution :
Cloud Comput. Center, Shenzhen Inst. of Adv. Technol., Shenzhen, China
Abstract :
Graphics processing units (GPU), due to their massive computational power with up to thousands of concurrent threads and general-purpose GPU (GPGPU) programming models such as CUDA and OpenCL, have opened up new opportunities for speeding up general-purpose parallel applications. Unfortunately, pre-silicon architectural simulation of modern-day GPGPU architectures and workloads is extremely time-consuming. This paper addresses the GPGPU simulation challenge by proposing a framework, called GPGPU-MiniBench, for generating miniature, yet representative GPGPU workloads. GPGPU-MiniBench first summarizes the inherent execution behavior of existing GPGPU workloads in a profile. The central component in the profile is the Divergence Flow Statistics Graph (DFSG), which characterizes the dynamic control flow behavior including loops and branches of a GPGPU kernel. GPGPU-MiniBench generates a synthetic miniature GPGPU kernel that exhibits similar execution characteristics as the original workload, yet its execution time is much shorter thereby dramatically speeding up architectural simulation. Our experimental results show that GPGPU-MiniBench can speed up GPGPU architectural simulation by a factor of 49× on average and up to 589×, with an average IPC error of 4.7 percent across a broad set of GPGPU benchmarks from the CUDA SDK, Rodinia and Parboil benchmark suites. We also demonstrate the usefulness of GPGPU-MiniBench for driving GPU architecture exploration.
Keywords :
graph theory; graphics processing units; statistical analysis; DFSG; GPGPU micro-architecture simulation; GPGPU-minibench; divergence flow statistics graph; dynamic control flow; general-purpose parallel applications; graphics processing units; synthetic miniature GPGPU kernel; Acceleration; Benchmark testing; Computational modeling; Computer architecture; Graphics processing units; Instruction sets; Kernel; GPGPU; Performance evaluation; computer architecture; general-purpose processing unit (GPGPU); simulation acceleration; synthetic workload generation; workload characterization; workload synthesis;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.2015.2395427