• DocumentCode
    2124012
  • Title

    Accelerator-rich CMPs: From concept to real hardware

  • Author

    Yu-Ting Chen ; Cong, J. ; Ghodrat, Mohammad Ali ; Huang, Meng ; Chunyue Liu ; Bingjun Xiao ; Yi Zou

  • Author_Institution
    Comput. Sci. Dept., Univ. of California, Los Angeles, Los Angeles, CA, USA
  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    169
  • Lastpage
    176
  • Abstract
    Application-specific accelerators provide 10-100× improvement in power efficiency over general-purpose processors. The accelerator-rich architectures are especially promising. This work discusses a prototype of accelerator-rich CMPs (PARC). During our development of PARC in real hardware, we encountered a set of technical challenges and proposed corresponding solutions. First, we provided system IPs that serve a sea of accelerators to transfer data between userspace and accelerator memories without cache overhead. Second, we designed a dedicated interconnect between accelerators and memories to enable memory sharing. Third, we implemented an accelerator manager to virtualize accelerator resources for users. Finally, we developed an automated flow with a number of IP templates and customizable interfaces to a C-based synthesis flow to enable rapid design and update of PARC. We implemented PARC in a Virtex-6 FPGA chip with integration of platform-specific peripherals and booting of unmodified Linux. Experimental results show that PARC can fully exploit the energy benefits of accelerators at little system overhead.
  • Keywords
    cache storage; computer architecture; logic circuits; logic design; multiprocessing systems; multiprocessor interconnection networks; virtualisation; C-based synthesis flow; IP template; PARC design; Virtex-6 FPGA chip; accelerator manager; accelerator memory; accelerator resource virtualization; accelerator-rich CMP; accelerator-rich architecture; application-specific accelerator; cache overhead; data transfer; dedicated interconnect design; general-purpose processors; memory sharing; platform-specific peripherals; power efficiency; system IP; system overhead; technical challenges; unmodified Linux booting; userspace memory; Acceleration; Data transfer; Field programmable gate arrays; Hardware; Program processors; Resource management; System-on-chip; FPGA; computer architecture; customizable computing; design automation; prototyping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Design (ICCD), 2013 IEEE 31st International Conference on
  • Conference_Location
    Asheville, NC
  • Type

    conf

  • DOI
    10.1109/ICCD.2013.6657039
  • Filename
    6657039