DocumentCode
2343423
Title
DMA++: on the fly data realignment for on-chip memories
Author
Vujic, Nikola ; Gonzalez, Marc ; Cabarcas, Felipe ; Ramirez, Alex ; Martorell, Xavier ; Ayguade, Eduard
Author_Institution
Barcelona Supercomputing Center, Barcelona, Spain
fYear
2010
fDate
9-14 Jan. 2010
Firstpage
1
Lastpage
12
Abstract
Multimedia extensions based on Single-Instruction Multiple-Data (SIMD) units are widespread. They are used both in processors and accelerators (e.g., the Cell SPEs), since some time ago. SIMD units have usually big memory alignment constraints in order to meet power requirements and design simplicity. This increases the complexity of the code generated by the compiler, as in the general case, the compiler cannot be sure of the proper alignment of data. For that, the ISA provides either unaligned memory load and store instructions, or a special set of instructions to perform the realignments in software. In this paper, we propose a hardware realignment unit that takes advantage of the DMA transfers needed in accelerators with local memories. While the data is being transferred, it is realigned on the fly by our realignment unit, and stored with the proper alignment in the accelerator memory. The accelerator can then access the data with no special instructions. Finally, the data is realigned properly also when put back to main memory. Our experiments with four applications show that with our approach, the bandwidth of the DMA transfers is not penalized. And the performance of the synthetic benchmarks shows that aligned code is 1.5 to 2 times better with respect using unaligned code.
Keywords
Bandwidth; Computer architecture; Digital signal processing; Hardware; Instruction sets; Memory management; Multicore processing; Multimedia computing; Random access memory; Software performance;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computer Architecture (HPCA), 2010 IEEE 16th International Symposium on
Conference_Location
Bangalore, India
ISSN
1530-0897
Print_ISBN
978-1-4244-5658-1
Type
conf
DOI
10.1109/HPCA.2010.5463057
Filename
5463057
Link To Document