DocumentCode
2873818
Title
AIFSP: An Adaptive Instruction Flow Stream Processor
Author
Wang, Yaohua ; Chen, Shuming ; Wan, Jianghua ; Zhang, Kai ; Chen, Shenggang
Author_Institution
Sch. of Comput., Nat. Univ. of Defence Technol., Changsha, China
fYear
2011
fDate
4-6 July 2011
Firstpage
272
Lastpage
277
Abstract
Stream processor is efficient for media applications as it exploits the features of media processing, such as data parallelism, producer-consumer locality and so on. However, the loosely coupled structure between host and stream processor makes the communication between scalar and SIMD part costly and scheduling across kernels less flexible. Besides, the kernel loading time adds additional cost. When the stream length becomes shorter the performance degradation caused by these factors is unacceptable. In addition to the loosely coupled structure, lack of efficient support for chained scalar and SIMD kernels makes the case worse. To overcome these shortcomings of stream processor, we propose a target architecture named AIFSP, which merges the host and stream processor together into a tightly coupled structure with both scalar and SIMD part. The whole processor can run in a single or dual instruction flow mode, adaptive to the characteristic of applications. When running in a single instruction flow mode, costless communication between scalar and SIMD part, flexible scheduling across kernels and zero kernel loading time can be achieved, the speedup for short streams can reach 2.6x, while in dual instruction flow mode, the scalar and SIMD kernels can run concurrently on scalar and SIMD part of AIFSP, thus kernel overlapping is realized and about 20% performance improving can be attained when SIMD width is set to 8, with the increase of SIMD width, the performance gain will be larger.
Keywords
parallel processing; processor scheduling; AIFSP; SIMD kernels; adaptive instruction flow stream processor; data parallelism; media applications; producer-consumer locality; single instruction flow mode; zero kernel loading time; Kernel; Loading; Media; Microcontrollers; Performance gain; Process control; Streaming media; Enhanced Scalar Processor; Kernel Overlapping; Stream Length Effect; Stream Processor;
fLanguage
English
Publisher
ieee
Conference_Titel
VLSI (ISVLSI), 2011 IEEE Computer Society Annual Symposium on
Conference_Location
Chennai
ISSN
2159-3469
Print_ISBN
978-1-4577-0803-9
Electronic_ISBN
2159-3469
Type
conf
DOI
10.1109/ISVLSI.2011.62
Filename
5992518
Link To Document