DocumentCode :
1927500
Title :
FPGA implementation of a configurable cache/scratchpad memory with virtualized user-level RDMA capability
Author :
Kalokerinos, George ; Papaefstathiou, Vassilis ; Nikiforos, George ; Kavadias, Stamatis ; Katevenis, Manolis ; Pnevmatikatos, Dionisios ; Yang, Xiaojun
Author_Institution :
Inst. of Comput. Sci., FORTH, Heraklion, Greece
fYear :
2009
fDate :
20-23 July 2009
Firstpage :
149
Lastpage :
156
Abstract :
We report on the hardware implementation of a local memory system for individual processors inside future chip multiprocessors (CMP). It intends to support both implicit communication, via caches, and explicit communication, via directly accessible local (ldquoscratchpadrdquo) memories and remote DMA (RDMA). We provide run-time configurability of the SRAM blocks near each processor, so that part of them operates as 2nd level (local) cache, while the rest operates as scratchpad. We also strive to merge the communication subsystems required by the cache and scratchpad into one integrated Network Interface (NI) and Cache Controller (CC), in order to economize on circuits. The processor communicates with the NI in user-level, through virtualized command areas in scratchpad; through a similar mechanism, the NI also provides efficient support for synchronization, using two hardware primitives: counters, and queues. We describe the block diagram, the hardware cost, and the latencies of our FPGA-based prototype implementation, which integrates four MicroBlaze processors, each with 64 KBytes of local SRAM, a crossbar NoC, and a DRAM controller on a Xilinx-5 FPGA. One-way, end-to-end, user-level communication completes within about 30 clock cycles for short transfer sizes.
Keywords :
SRAM chips; cache storage; counting circuits; field programmable gate arrays; memory architecture; microprocessor chips; network interfaces; virtual storage; FPGA implementation; MicroBlaze processors; cache controller; chip multiprocessors; configurable cache; counters; network interface; queues; remote direct memory access; run-time SRAM configurability; scratchpad memory; scratchpad virtualized command; user-level RDMA capability; Communication system control; Costs; Counting circuits; Delay; Field programmable gate arrays; Hardware; Network interfaces; Random access memory; Runtime; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Architectures, Modeling, and Simulation, 2009. SAMOS '09. International Symposium on
Conference_Location :
Samos
Print_ISBN :
978-1-4244-4502-8
Type :
conf
DOI :
10.1109/ICSAMOS.2009.5289226
Filename :
5289226
Link To Document :
بازگشت