DocumentCode
3697101
Title
Hardware-Based and Hybrid L1 Data Cache Bypassing to Improve GPU Performance
Author
Yijie Huangfu;Wei Zhang
Author_Institution
Dept. of Electr. &
fYear
2015
Firstpage
972
Lastpage
976
Abstract
Intelligent GPU cache bypassing can improve the efficiency of using GPU memory bandwidth, which can benefit GPU performance. In this paper, we study a pure hardware-based GPU cache bypassing method that can be applied to GPU applications without having to recompile the programs. Moreover, we introduce a hybrid method that can exploit profiling information to further enhance the hardware-based bypassing. Our experimental results show that the hardware-based cache bypassing can improve performance for most benchmarks, and the hybrid method can achieve performance comparable to the state-of-the-art compiler-based bypassing with much less profiling cost.
Keywords
"Graphics processing units","Benchmark testing","Instruction sets","Cache memory","Kernel","Runtime"
Publisher
ieee
Conference_Titel
High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen on Embedded Software and Systems (ICESS), 2015 IEEE 17th International Conference on
Type
conf
DOI
10.1109/HPCC-CSS-ICESS.2015.248
Filename
7336296
Link To Document