مرکز منطقه ای اطلاع رساني علوم و فناوري - Scalable Multi-cache Simulation Using GPUs

DocumentCode :

3089469

Title :

Scalable Multi-cache Simulation Using GPUs

Author :

Moeng, Michael ; Cho, Sangyeun ; Melhem, Rami

Author_Institution :

Dept. of Comput. Sci., Univ. of Pittsburgh, Pittsburgh, PA, USA

fYear :

2011

fDate :

25-27 July 2011

Firstpage :

159

Lastpage :

167

Abstract :

Software simulation is the primary tool used for evaluation of processor design. Simulation offers better accuracy than analytical models and is an important evaluation step before actually fabricating a chip. Unfortunately, simulator speeds are slow -- a conventional cycle-accurate simulator will be unable to keep up with increasing core counts in modern processor design. Parallel simulation is one method for improving simulation speeds. Two major areas of parallel simulation research are multithreaded simulators and FPGAs as simulation accelerators. Multithreaded simulators can only extract coarse-grained parallelism and must sacrifice accuracy in order to scale well. FPGA-based simulators can extract fine-grained parallelism, but are expensive and difficult to program. We propose using GPUs for architectural simulation, which can take advantage of a high degree of fine-grained parallelism. In addition, they are inexpensive and easier to program compared to FPGAs. To demonstrate our ideas, we implement a trace-driven many-cache simulator using NVIDIA´s CUDA toolkit. GPU-accelerated cache simulation displays remarkable scaling with number of simulated caches when compared to serial CPU-only simulation.

Keywords :

coprocessors; multi-threading; CUDA toolkit; FPGA-based simulator; GPU-accelerated cache simulation display; NVIDIA; analytical model; chip fabrication; coarse-grained parallelism; cycle-accurate simulator; fine-grained parallelism; important evaluation step; multithreaded simulator; parallel simulation; processor design; scalable multicache simulation; serial CPU-only simulation; simulation accelerator; software simulation; trace-driven many-cache simulator; Computational modeling; Field programmable gate arrays; Graphics processing unit; Instruction sets; Kernel; Synchronization; CUDA; Cache; General-Purpose GPU; Parallel Architecture; Simulation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE 19th International Symposium on

Conference_Location :

Singapore

ISSN :

1526-7539

Print_ISBN :

978-1-4577-0468-0

Type :

conf

DOI :

10.1109/MASCOTS.2011.24

Filename :

6005357

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3089469