Title :
Fine-tuning loop-level parallelism for increasing performance of DSP applications on FPGAs
Author :
Özer, Emre ; Nisbet, Andy P. ; Gregg, David
Author_Institution :
Dept. of Comput. Sci., Trinity Coll., Dublin, Ireland
Abstract :
This paper discusses the balance between loop-level parallelism and clock rate for enhancing the performance of DSP applications fully implemented on FPGAs. Loop-level parallelism reduces the total cycles of an application at the cost of increased routing complexity that often results in lower clock rates. We analyze loops that can be fully parallelized and show that it is possible to achieve better performance by controlling the number of parallel iterations of the loops than using fully parallel loops. We have implemented loop parallelism in our compilation framework and fine-tune them to enhance the performance of DSP applications that target Xilinx Virtex-II FPGA chip. Our experimental results show that it is possible to reach a performance equilibrium point where the total number of cycles and the overall clock frequency can be adjusted to maximize the overall performance of an application.
Keywords :
digital signal processing chips; field programmable gate arrays; DSP applications; Xilinx Virtex-II FPGA chip; clock frequency; fine tuning loop level parallelism; parallel iterations; parallel loops; routing complexity; Concurrent computing; Digital signal processing; Field programmable gate arrays;
Conference_Titel :
Field-Programmable Custom Computing Machines, 2004. FCCM 2004. 12th Annual IEEE Symposium on
Print_ISBN :
0-7695-2230-0
DOI :
10.1109/FCCM.2004.32