• DocumentCode
    2757788
  • Title

    Software Programmable Data Allocation in Multi-bank Memory of SIMD Processors

  • Author

    Wang, Jian ; Sohl, Joar ; Kraigher, Olof ; Liu, Dake

  • Author_Institution
    Dept. of Electr. Eng., Linkoping Univ., Linköping, Sweden
  • fYear
    2010
  • fDate
    1-3 Sept. 2010
  • Firstpage
    28
  • Lastpage
    33
  • Abstract
    The host-SIMD style heterogeneous multi-processor architecture offers high computing performance and user friendly programmability. It explores both task level parallelism and data level parallelism by the on-chip multiple SIMD coprocessors. For embedded DSP applications with predictable computing feature, this architecture can be further optimized for performance, implementation cost and power consumption. The optimization could be done by improving the SIMD processing efficiency and reducing redundant memory accesses and data shuffle operations. This paper introduces one effective approach by designing a software programmable multi-bank memory system for SIMD processors. Both the hardware architecture and software programming model are described in this paper, with an implementation example of the BLAS syrk routine. The proposed memory system offers high SIMD data access flexibility by using lookup table based address generators, and applying data permutations on both DMA controller interface and SIMD data access. The evaluation results show that the SIMD processor with this memory system can achieve high execution efficiency, with only 10% to 30% overhead. The proposed memory system also saves the implementation cost on SIMD local registers, in our system, each SIMD core has only 8 128-bit vector registers.
  • Keywords
    coprocessors; digital signal processing chips; embedded systems; multiprocessing systems; parallel processing; BLAS syrk routine; SIMD processors; data level parallelism; data permutations; digital signal processing; embedded DSP applications; heterogeneous multiprocessor architecture; lookup table based address generators; multibank memory; on-chip coprocessors; software programmable data allocation; task level parallelism; Coprocessors; Digital signal processing; Program processors; Registers; Table lookup; SIMD processor; conflict-free memory access; data allocation; multi-bank memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital System Design: Architectures, Methods and Tools (DSD), 2010 13th Euromicro Conference on
  • Conference_Location
    Lille
  • Print_ISBN
    978-1-4244-7839-2
  • Type

    conf

  • DOI
    10.1109/DSD.2010.26
  • Filename
    5615613