Title :
Extracting memory-level parallelism through reconfigurable hardware traces
Author :
Mingjie Lin ; Shaoyi Cheng ; Wawrzynek, J.
Abstract :
This paper proposes a new FPGA-based embedded computer architecture, which focuses on how to construct an application-specific memory access network capable of extracting the maximum amount of memory-level parallelism on a per-application basis. Specifically, through performing dynamic memory analysis and utilizing the capabilities of modern FPGA devices: abundant distributed block RAMs and programmability, the proposed reconfigurable architecture synthesizes highly efficient accelerators that enable parallelized memory accesses, and therefore accomplish effective data orchestration by maximally extracting the target application´s instruction, loop and memory-level parallelism. To validate our proposed architecture, we implemented a baseline embedded processor platform, a conventional CPU +accelerator with a centralized single memory, and a prototype based on Xilinx MicroBlaze technology. Our experimental results have shown that on average for 5 benchmark applications from SPEC2006 and MiBench [1], our proposed architecture achieves 8.6 times speedup compared to the baseline embedded processor platform and 1.7 times speedup compared to a conventional CPU+accelcrator platform. More interestingly, the proposed platform achieves more than 40% reduction in energy-delay product compared to a conventional CPU+accelerator with a centralized memory.
Keywords :
embedded systems; field programmable gate arrays; memory architecture; microprocessor chips; random-access storage; reconfigurable architectures; FPGA devices; FPGA-based embedded computer architecture; MiBench; SPEC2006; Xilinx MicroBlaze technology; application-specific memory access network; baseline embedded processor platform; centralized memory; centralized single memory; conventional CPU +accelerator; data orchestration; distributed block RAM; dynamic memory analysis; energy-delay product; memory-level parallelism; parallelized memory accesses; programmability; reconfigurable architecture; reconfigurable hardware traces; Acceleration; Coherence; Computer architecture; Field programmable gate arrays; Hardware; Parallel processing; Performance evaluation;
Conference_Titel :
Reconfigurable Computing and FPGAs (ReConFig), 2013 International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4799-2078-5
DOI :
10.1109/ReConFig.2013.6732290