DocumentCode
2757788
Title
Software Programmable Data Allocation in Multi-bank Memory of SIMD Processors
Author
Wang, Jian ; Sohl, Joar ; Kraigher, Olof ; Liu, Dake
Author_Institution
Dept. of Electr. Eng., Linkoping Univ., Linköping, Sweden
fYear
2010
fDate
1-3 Sept. 2010
Firstpage
28
Lastpage
33
Abstract
The host-SIMD style heterogeneous multi-processor architecture offers high computing performance and user friendly programmability. It explores both task level parallelism and data level parallelism by the on-chip multiple SIMD coprocessors. For embedded DSP applications with predictable computing feature, this architecture can be further optimized for performance, implementation cost and power consumption. The optimization could be done by improving the SIMD processing efficiency and reducing redundant memory accesses and data shuffle operations. This paper introduces one effective approach by designing a software programmable multi-bank memory system for SIMD processors. Both the hardware architecture and software programming model are described in this paper, with an implementation example of the BLAS syrk routine. The proposed memory system offers high SIMD data access flexibility by using lookup table based address generators, and applying data permutations on both DMA controller interface and SIMD data access. The evaluation results show that the SIMD processor with this memory system can achieve high execution efficiency, with only 10% to 30% overhead. The proposed memory system also saves the implementation cost on SIMD local registers, in our system, each SIMD core has only 8 128-bit vector registers.
Keywords
coprocessors; digital signal processing chips; embedded systems; multiprocessing systems; parallel processing; BLAS syrk routine; SIMD processors; data level parallelism; data permutations; digital signal processing; embedded DSP applications; heterogeneous multiprocessor architecture; lookup table based address generators; multibank memory; on-chip coprocessors; software programmable data allocation; task level parallelism; Coprocessors; Digital signal processing; Program processors; Registers; Table lookup; SIMD processor; conflict-free memory access; data allocation; multi-bank memory;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital System Design: Architectures, Methods and Tools (DSD), 2010 13th Euromicro Conference on
Conference_Location
Lille
Print_ISBN
978-1-4244-7839-2
Type
conf
DOI
10.1109/DSD.2010.26
Filename
5615613
Link To Document