DocumentCode
3669001
Title
Scavenger: Automating the construction of application-optimized memory hierarchies
Author
Hsin-Jung Yang;Kermin Fleming;Michael Adler;Felix Winterstein;Joel Emer
Author_Institution
CSAIL, Massachusetts Institute of Technology, USA
fYear
2015
Firstpage
1
Lastpage
8
Abstract
High-level abstractions separate algorithm design from platform implementation, allowing programmers to focus on algorithms while building increasingly complex systems. This separation also provides system programmers and compilers an opportunity to optimize platform services for each application. In FPGAs, this platform-level malleability extends to the memory system: unlike general-purpose processors, in which memory hardware is fixed at design time, the capacity, associativity, and topology of FPGA memory systems may all be tuned to improve application performance. Since application kernels often use few memory resources, substantial memory capacity may be available to the platform for use on behalf of the user program. In this work, we perform an initial exploration of methods for automating the construction of these application-specific memory hierarchies. Although exploiting spare resources can be beneficial, naïvely consuming all memory resources may cause frequency degradation. To relieve timing pressure in large BRAM structures, we provide microarchitectural techniques to trade memory latency for design frequency. We demonstrate, by examining both hand-assembled and HLS-compiled benchmarks, that our application-optimized memory system can improve pre-existing application runtime by 25% on average.
Keywords
"Field programmable gate arrays","Program processors","System-on-chip","Microarchitecture","Memory management","Random access memory","Kernel"
Publisher
ieee
Conference_Titel
Field Programmable Logic and Applications (FPL), 2015 25th International Conference on
Type
conf
DOI
10.1109/FPL.2015.7294018
Filename
7294018
Link To Document