• DocumentCode
    3691882
  • Title

    The Approximate Discrete Radon Transform: A Case Study in Auto-Tuning of OpenCL Implementations

  • Author

    Bücker;Ralf Seidler; Neuhäuser;Tobias Beier

  • Author_Institution
    Carl Zeiss Jena GmbH, Jena, Germany
  • fYear
    2015
  • Firstpage
    219
  • Lastpage
    226
  • Abstract
    The Open Computing Language (OpenCL) is designed to provide a platform-independent specification for programming heterogenous computing systems. The performance of an OpenCL program, however, is not easily transferrable from one platform to another. Auto-tuning is among the techniques that address this situation by automating the performance optimization of OpenCL programs via systematically applying program transformations. We introduce a novel auto-tuning framework to generate OpenCL programs and report on a case study computing an approximate discrete Radon transform. Experiments on four different graphics processing units indicate that, for a wide range of problem sizes and input parameters, the execution times of the auto-tuned OpenCL programs are smaller than those of three hand-tuned CUDA implementations.
  • Keywords
    "Kernel","Radon","Transforms","Graphics processing units","Runtime","Arrays"
  • Publisher
    ieee
  • Conference_Titel
    Embedded Multicore/Many-core Systems-on-Chip (MCSoC), 2015 IEEE 9th International Symposium on
  • Type

    conf

  • DOI
    10.1109/MCSoC.2015.38
  • Filename
    7328208