مرکز منطقه ای اطلاع رساني علوم و فناوري - Design of an accelerator-rich architecture by integrating multiple heterogeneous coarse grain reconfigurable arrays over a network-on-chip

DocumentCode :

1772611

Title :

Design of an accelerator-rich architecture by integrating multiple heterogeneous coarse grain reconfigurable arrays over a network-on-chip

Author :

Hussain, Waqar ; Airoldi, Roberto ; Hoffmann, Henry ; Ahonen, Tapani ; Nurmi, Jari

Author_Institution :

Dept. of Electron. & Commun. Eng., Tampere Univ. of Technol., Tampere, Finland

fYear :

2014

fDate :

18-20 June 2014

Firstpage :

131

Lastpage :

138

Abstract :

This paper presents an accelerator-rich system-on-chip (SoC) architecture integrating many heterogeneous Coarse Grain Reconfigurable Arrays (CGRA) connected through a Network-on-Chip (NoC). The architecture is designed to maximize the reconfigurable processing capacity for the execution of massively parallel algorithms. The central node of the NoC contains a Reduced Instruction Set Computer (RISC) core that manages distribution of computing functions and data within the SoC while the other nodes contain CGRAs of application-specific sizes. Prior approaches coupled only a few accelerators with a RISC core using special instructions and/or a direct memory access device. In contrast, our design couples a RISC core to many CGRAs through the NoC. This approach provides for independent and simultaneous execution of multiple computing kernels. Furthermore, the proposed architecture mitigates power dissipation as CGRA sizes are tailored for the individual application kernels. We present a proof-of-concept design with a total of 408 reconfigurable processing elements. This instance and its sub-systems are customized and tested for different computationally-intensive signal processing algorithms. The overall single-chip computing system is synthesized for a Field Programmable Gate Array device. We present comparison to and evaluation against some of the existing multicore systems in terms of multiple performance metrics.

Keywords :

field programmable gate arrays; network-on-chip; reduced instruction set computing; CGRA; FPGA; NoC; RISC core; SoC; accelerator-rich architecture design; application-specific sizes; central node; computationally-intensive signal processing algorithms; direct memory access device; field programmable gate array device; massively parallel algorithms; multicore systems; multiple computing kernels; multiple heterogeneous coarse grain reconfigurable arrays; multiple performance metrics; network-on-chip; proof-of-concept design; reconfigurable processing capacity; reduced instruction set computer core; single-chip computing system; system-on-chip architecture; Acceleration; Data transfer; Kernel; Multicore processing; Reduced instruction set computing; Switches;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Application-specific Systems, Architectures and Processors (ASAP), 2014 IEEE 25th International Conference on

Conference_Location :

Zurich

Type :

conf

DOI :

10.1109/ASAP.2014.6868647

Filename :

6868647

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1772611