DocumentCode :
2124012
Title :
Accelerator-rich CMPs: From concept to real hardware
Author :
Yu-Ting Chen ; Cong, J. ; Ghodrat, Mohammad Ali ; Huang, Meng ; Chunyue Liu ; Bingjun Xiao ; Yi Zou
Author_Institution :
Comput. Sci. Dept., Univ. of California, Los Angeles, Los Angeles, CA, USA
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
169
Lastpage :
176
Abstract :
Application-specific accelerators provide 10-100× improvement in power efficiency over general-purpose processors. The accelerator-rich architectures are especially promising. This work discusses a prototype of accelerator-rich CMPs (PARC). During our development of PARC in real hardware, we encountered a set of technical challenges and proposed corresponding solutions. First, we provided system IPs that serve a sea of accelerators to transfer data between userspace and accelerator memories without cache overhead. Second, we designed a dedicated interconnect between accelerators and memories to enable memory sharing. Third, we implemented an accelerator manager to virtualize accelerator resources for users. Finally, we developed an automated flow with a number of IP templates and customizable interfaces to a C-based synthesis flow to enable rapid design and update of PARC. We implemented PARC in a Virtex-6 FPGA chip with integration of platform-specific peripherals and booting of unmodified Linux. Experimental results show that PARC can fully exploit the energy benefits of accelerators at little system overhead.
Keywords :
cache storage; computer architecture; logic circuits; logic design; multiprocessing systems; multiprocessor interconnection networks; virtualisation; C-based synthesis flow; IP template; PARC design; Virtex-6 FPGA chip; accelerator manager; accelerator memory; accelerator resource virtualization; accelerator-rich CMP; accelerator-rich architecture; application-specific accelerator; cache overhead; data transfer; dedicated interconnect design; general-purpose processors; memory sharing; platform-specific peripherals; power efficiency; system IP; system overhead; technical challenges; unmodified Linux booting; userspace memory; Acceleration; Data transfer; Field programmable gate arrays; Hardware; Program processors; Resource management; System-on-chip; FPGA; computer architecture; customizable computing; design automation; prototyping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Design (ICCD), 2013 IEEE 31st International Conference on
Conference_Location :
Asheville, NC
Type :
conf
DOI :
10.1109/ICCD.2013.6657039
Filename :
6657039
Link To Document :
بازگشت