Accelerator-rich CMPs: From concept to real hardware

Author

Yu-Ting Chen ; Cong, J. ; Ghodrat, Mohammad Ali ; Huang, Meng ; Chunyue Liu ; Bingjun Xiao ; Yi Zou

Author_Institution

Comput. Sci. Dept., Univ. of California, Los Angeles, Los Angeles, CA, USA

fYear

2013

fDate

6-9 Oct. 2013

Firstpage

169

Lastpage

176

Abstract

Application-specific accelerators provide 10-100× improvement in power efficiency over general-purpose processors. The accelerator-rich architectures are especially promising. This work discusses a prototype of accelerator-rich CMPs (PARC). During our development of PARC in real hardware, we encountered a set of technical challenges and proposed corresponding solutions. First, we provided system IPs that serve a sea of accelerators to transfer data between userspace and accelerator memories without cache overhead. Second, we designed a dedicated interconnect between accelerators and memories to enable memory sharing. Third, we implemented an accelerator manager to virtualize accelerator resources for users. Finally, we developed an automated flow with a number of IP templates and customizable interfaces to a C-based synthesis flow to enable rapid design and update of PARC. We implemented PARC in a Virtex-6 FPGA chip with integration of platform-specific peripherals and booting of unmodified Linux. Experimental results show that PARC can fully exploit the energy benefits of accelerators at little system overhead.

Keywords

cache storage; computer architecture; logic circuits; logic design; multiprocessing systems; multiprocessor interconnection networks; virtualisation; C-based synthesis flow; IP template; PARC design; Virtex-6 FPGA chip; accelerator manager; accelerator memory; accelerator resource virtualization; accelerator-rich CMP; accelerator-rich architecture; application-specific accelerator; cache overhead; data transfer; dedicated interconnect design; general-purpose processors; memory sharing; platform-specific peripherals; power efficiency; system IP; system overhead; technical challenges; unmodified Linux booting; userspace memory; Acceleration; Data transfer; Field programmable gate arrays; Hardware; Program processors; Resource management; System-on-chip; FPGA; computer architecture; customizable computing; design automation; prototyping;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Design (ICCD), 2013 IEEE 31st International Conference on

Conference_Location

Asheville, NC

Type

conf

DOI

10.1109/ICCD.2013.6657039

Filename

6657039