DocumentCode :
3585583
Title :
Improve memory access for achieving both performance and energy efficiencies on heterogeneous systems
Author :
Hongyuan Ding ; Miaoqing Huang
Author_Institution :
Dept. of Comput. Sci. & Comput. Eng., Univ. of Arkansas, Fayetteville, AR, USA
fYear :
2014
Firstpage :
91
Lastpage :
98
Abstract :
Hardware accelerators are capable of achieving significant performance improvement for many applications. In this work we demonstrate that it is critical to provide sufficient memory access bandwidth for accelerators to improve the performance and reduce energy consumption. We use the scale-invariant feature transform (SIFT) algorithm as a case study in which three bottleneck stages are accelerated on hardware logic. Based on different memory access patterns of SIFT algorithms, two different approaches are designed to accelerate different functions in SIFT on the Xilinx Zynq-7045 device. In the first approach, convolution is accelerated by designing fully customized hardware accelerator. On top of it, three interfacing methods are analyzed. In the second approach, a distributed multi-processor hardware system with its programming model is built to handle inconsecutive memory accesses. Furthermore, the last level cache (LLC) on the host processor is shared by all slaves to achieve better performance. Experiment results on the Zynq-7045 device show that the hybrid design in which two approaches are combined can achieve ~10 times and better improvement for both performance improvement and energy reduction compared with the pure software implementation for the convolution stage and the SIFT algorithm, respectively.
Keywords :
convolution; energy conservation; energy consumption; integrated memory circuits; scaling phenomena; LLC; SIFT algorithm; Xilinx Zynq-7045 device; convolution; distributed multiprocessor hardware system; energy consumption reduction; energy efficiency; hardware accelerator; hardware logic; heterogeneous system; last level cache; memory access bandwidth; scale-invariant feature transform algorithm; Acceleration; Algorithm design and analysis; Convolution; Hardware; Performance evaluation; Registers; Software algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Technology (FPT), 2014 International Conference on
Print_ISBN :
978-1-4799-6244-0
Type :
conf
DOI :
10.1109/FPT.2014.7082759
Filename :
7082759
Link To Document :
بازگشت