DocumentCode :
2124315
Title :
FastLanes: An FPGA accelerated GPU microarchitecture simulator
Author :
Kuan Fang ; Yufei Ni ; Jiayuan He ; Zonghui Li ; Shuai Mu ; Yangdong Deng
Author_Institution :
Inst. of Microelectron., Tsinghua Univ., Beijing, China
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
241
Lastpage :
248
Abstract :
Graphic Processing Units (GPUs) have emerged as a new general purpose computing platform that attracts significant research efforts. Currently, GPU architecture research resorts to time-consuming software simulations to evaluate microarchitecture innovations. In this paper, we propose FastLanes, an FPGA based simulator for a generic GPU microarchitecture, to enable hardware-accelerated simulation. FastLanes consists of a function model and a timing model, both implemented on FPGA. The functional model implements the full functionality of a multiprocessor of GPU and emulates multiple multiprocessors via time-division multiplexing. We develop a hybrid implementation strategy in which certain GPU logic is directly mapped to FPGA while the other logic is simulated by reusing the same FPGA logic. A corresponding context shifting mechanism is proposed to store execution states of threads from FPGA to external on-board memory, and vice versa. Such a mechanism makes it possible to simulate hundreds of GPU cores on a single FPGA evaluation board. Driven by the functional simulation results, the timing model considers the detailed configuration of GPU microarchitecture to derive the performance evaluation. A compiler tool-chain is also developed to allow the execution of NVIDIA GPU binary on FastLanes. Experimental results prove that FastLanes outperforms its software equivalent by up to 2 orders of magnitude.
Keywords :
field programmable gate arrays; graphics processing units; logic simulation; multiprocessing systems; time division multiplexing; FPGA accelerated GPU microarchitecture simulator; FastLanes; GPU logic; NVIDIA GPU binary; compiler tool-chain; external on-board memory; graphic processing units; hardware-accelerated simulation; threads execution states; time-division multiplexing; Decision support systems; FPGA; GPU; Graphics Processing Unit; SIMD; simulation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Design (ICCD), 2013 IEEE 31st International Conference on
Conference_Location :
Asheville, NC
Type :
conf
DOI :
10.1109/ICCD.2013.6657049
Filename :
6657049
Link To Document :
بازگشت