• DocumentCode
    3697101
  • Title

    Hardware-Based and Hybrid L1 Data Cache Bypassing to Improve GPU Performance

  • Author

    Yijie Huangfu;Wei Zhang

  • Author_Institution
    Dept. of Electr. &
  • fYear
    2015
  • Firstpage
    972
  • Lastpage
    976
  • Abstract
    Intelligent GPU cache bypassing can improve the efficiency of using GPU memory bandwidth, which can benefit GPU performance. In this paper, we study a pure hardware-based GPU cache bypassing method that can be applied to GPU applications without having to recompile the programs. Moreover, we introduce a hybrid method that can exploit profiling information to further enhance the hardware-based bypassing. Our experimental results show that the hardware-based cache bypassing can improve performance for most benchmarks, and the hybrid method can achieve performance comparable to the state-of-the-art compiler-based bypassing with much less profiling cost.
  • Keywords
    "Graphics processing units","Benchmark testing","Instruction sets","Cache memory","Kernel","Runtime"
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen on Embedded Software and Systems (ICESS), 2015 IEEE 17th International Conference on
  • Type

    conf

  • DOI
    10.1109/HPCC-CSS-ICESS.2015.248
  • Filename
    7336296