• DocumentCode
    3757983
  • Title

    Extensions over OpenCL for Latency Reduction and Critical Applications

  • Author

    Grigore Lupescu;Emil-Ioan Slusanschi;Nicolae Tapus

  • Author_Institution
    Comput. Sci. &
  • fYear
    2015
  • Firstpage
    379
  • Lastpage
    385
  • Abstract
    Hardware and software stack complexity make programming GPGPUs difficult and limit application portability. This article first discusses challenges imposed by the current hardware and software model in GPGPU systems which relies heavily on the HOST device (CPU). We then identify system bottlenecks both in the hardware design and in the software stack and present two ideas to extend the HOST and DEVICE side of the OpenCL API with the aim to improve latency and device safety. As a first goal we target HOST side latency reduction using user synchronization directives. Our second goal was to improve on DEVICE side latency and add safety through a software layer which manages kernel execution. For both HOST and DEVICE side latency reduction we present concrete performance results.
  • Keywords
    "Kernel","Hardware","Computer architecture","Runtime","Program processors","Message systems"
  • Publisher
    ieee
  • Conference_Titel
    Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2015 17th International Symposium on
  • Type

    conf

  • DOI
    10.1109/SYNASC.2015.64
  • Filename
    7426108