DocumentCode :
2235864
Title :
MARC: A Many-Core Approach to Reconfigurable Computing
Author :
Lebedev, Ilia ; Cheng, Shaoyi ; Doupnik, Austin ; Martin, James ; Fletcher, Christopher ; Burke, Daniel ; Lin, Mingjie ; Wawrzynek, John
Author_Institution :
Dept. of EECS, Univ. of California at Berkeley, Berkeley, CA, USA
fYear :
2010
fDate :
13-15 Dec. 2010
Firstpage :
7
Lastpage :
12
Abstract :
We present a Many-core Approach to Reconfigurable Computing (MARC), enabling efficient high-performance computing for applications expressed using parallel programming models such as OpenCL. The MARC system exploits abundant special FPGA resources such as distributed block memories and DSP blocks to implement complete single-chip high efficiency many-core micro architectures. The key benefits of MARC are that it (i) allows programmers to easily express parallelism through the API defined in a high-level programming language, (ii) supports coarse-grain multithreading and dataflow-style fine-grain threading while permitting bit-level resource control, and (iii) greatly reduces the effort required to re-purpose the hardware system for different algorithms or different applications. A MARC prototype machine with 48 processing nodes was implemented using a Virtex-5 (XCV5LX155T-2) FPGA for a well known Bayesian network inference problem. We compare the runtime of the MARC machine against a manually optimized implementation. With fully synthesized application-specific processing cores, our MARC machine comes within a factor of 3 of the performance of a fully optimized FPGA solution but with a considerable reduction in development effort and a significant increase in retarget ability.
Keywords :
application program interfaces; belief networks; data flow computing; field programmable gate arrays; high level languages; inference mechanisms; multi-threading; multiprocessing programs; multiprocessing systems; optimising compilers; reconfigurable architectures; API; Bayesian network inference problem; MARC prototype machine; Virtex-5 FPGA; bit-level resource control; coarse-grain multithreading; dataflow-style fine-grain threading; high-level programming language; high-performance computing; optimising compiler; parallel programming model; reconfigurable computing; single-chip high efficiency many-core microarchitecture; Compiler; FPGA; Many-Core; Performance; Reconfigurable Computing; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reconfigurable Computing and FPGAs (ReConFig), 2010 International Conference on
Conference_Location :
Quintana Roo
Print_ISBN :
978-1-4244-9523-8
Electronic_ISBN :
978-0-7695-4314-7
Type :
conf
DOI :
10.1109/ReConFig.2010.49
Filename :
5695273
Link To Document :
بازگشت