Title :
RACB: Resource Aware Cache Bypass on GPUs
Author :
Hongwen Dai ; Kartsaklis, C. ; Chao Li ; Janjusic, T. ; Huiyang Zhou
Author_Institution :
Dept. of Electr. & Comput. Eng., North Carolina State Univ., Raleigh, NC, USA
Abstract :
Caches are universally used in computing systems to hide long off-chip memory access latencies. Unlike CPUs, massive threads running simultaneously on GPUs bring a tremendous pressure on memory hierarchy. As a result, the limitation of cache resources becomes a bottleneck for a GPU to exploit thread-level parallelism (TLP) and memory-level parallelism (MLP) and achieve high performance. In this paper, we propose a mechanism to bypass L1D and L2 cache based on the availability of cache resources. Our proposed mechanism is based on the observation that a huge number of stalls coming from limited cache resources prohibit GPUs from providing a higher throughput. So we propose Resource Aware Cache Bypass (RACB) with minor hardware changes to eliminate such stalls to improve performance. We examine the effectiveness of this approach when applied to L1D and L2 cache separately as well as together. Evaluation results with NVIDIA Computing SDK show that RACB generally improves performance the most when applied to both L1D and L2 cache, which is up to 88.05% and on an average of 16.73%, additionally, energy is saved up to 22.35% and on an average of 5.88% with minor hardware overheads.
Keywords :
cache storage; graphics processing units; multi-threading; software tools; CPU; GPU; L1D cache; L2 cache; MLP; NVIDIA Computing SDK; RACB; TLP; cache resources; computing systems; long off-chip memory access latencies; memory hierarchy; memory-level parallelism; resource aware cache bypass; thread-level parallelism; Benchmark testing; Graphics processing units; Hardware; Instruction sets; Memory management; Parallel processing; GPU; TLP; bypass; cache resource limitation; hardware;
Conference_Titel :
Computer Architecture and High Performance Computing Workshop (SBAC-PADW), 2014 International Symposium on
Conference_Location :
Paris
DOI :
10.1109/SBAC-PADW.2014.14