Title :
Optimised OpenCL workgroup synthesis for hybrid ARM-FPGA devices
Author :
Mohammad Hosseinabady;Jose Luis Nunez-Yanez
Author_Institution :
Department of Electrical and Electronic Engineering University of Bristol, UK
Abstract :
This paper presents a workgroup synthesis mechanism to compile an OpenCL kernel to FPGA-based accelerators embedded in a multi-core CPU system-on-a-chip (SoC). The OpenCL kernels considered in this paper exhibit regular data access patterns. Coping with the limited amount of internal memory in embedded FPGAs, the workgroup synthesis utilises a novel data access pattern formulation to describe the parallelism already provided by the OpenCL kernels. To provide an OpenCL framework prototype to validate the proposed technique, a source-to-source compiler that transforms the OpenCL kernel into C/C++ code is developed. Then vendor-specific high-level synthesis tools are used to convert the C/C++ code into the FPGA bitstream. Results based on popular real applications show up to 89.8% improvement in the execution time compared to other commercial FPGA OpenCL implementations.
Keywords :
"Field programmable gate arrays","Kernel","Parallel processing","Hardware","Arrays","Optimization","System-on-chip"
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2015 25th International Conference on
DOI :
10.1109/FPL.2015.7294016