• DocumentCode
    2873818
  • Title

    AIFSP: An Adaptive Instruction Flow Stream Processor

  • Author

    Wang, Yaohua ; Chen, Shuming ; Wan, Jianghua ; Zhang, Kai ; Chen, Shenggang

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Defence Technol., Changsha, China
  • fYear
    2011
  • fDate
    4-6 July 2011
  • Firstpage
    272
  • Lastpage
    277
  • Abstract
    Stream processor is efficient for media applications as it exploits the features of media processing, such as data parallelism, producer-consumer locality and so on. However, the loosely coupled structure between host and stream processor makes the communication between scalar and SIMD part costly and scheduling across kernels less flexible. Besides, the kernel loading time adds additional cost. When the stream length becomes shorter the performance degradation caused by these factors is unacceptable. In addition to the loosely coupled structure, lack of efficient support for chained scalar and SIMD kernels makes the case worse. To overcome these shortcomings of stream processor, we propose a target architecture named AIFSP, which merges the host and stream processor together into a tightly coupled structure with both scalar and SIMD part. The whole processor can run in a single or dual instruction flow mode, adaptive to the characteristic of applications. When running in a single instruction flow mode, costless communication between scalar and SIMD part, flexible scheduling across kernels and zero kernel loading time can be achieved, the speedup for short streams can reach 2.6x, while in dual instruction flow mode, the scalar and SIMD kernels can run concurrently on scalar and SIMD part of AIFSP, thus kernel overlapping is realized and about 20% performance improving can be attained when SIMD width is set to 8, with the increase of SIMD width, the performance gain will be larger.
  • Keywords
    parallel processing; processor scheduling; AIFSP; SIMD kernels; adaptive instruction flow stream processor; data parallelism; media applications; producer-consumer locality; single instruction flow mode; zero kernel loading time; Kernel; Loading; Media; Microcontrollers; Performance gain; Process control; Streaming media; Enhanced Scalar Processor; Kernel Overlapping; Stream Length Effect; Stream Processor;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    VLSI (ISVLSI), 2011 IEEE Computer Society Annual Symposium on
  • Conference_Location
    Chennai
  • ISSN
    2159-3469
  • Print_ISBN
    978-1-4577-0803-9
  • Electronic_ISBN
    2159-3469
  • Type

    conf

  • DOI
    10.1109/ISVLSI.2011.62
  • Filename
    5992518