DocumentCode
2908736
Title
Scalar Processing Overhead on SIMD-Only Architectures
Author
Azevedo, A. ; Juurlink, Ben
Author_Institution
Comput. Eng. Group, Delft Univ. of Technol., Delft, Netherlands
fYear
2009
fDate
7-9 July 2009
Firstpage
183
Lastpage
190
Abstract
The Cell processor consists of a general-purpose core and eight cores with a complete SIMD instruction set. Although originally designed for multimedia and gaming, it is currently being used for a much broader range of applications.In this paper we evaluate if the Cell SPEs could benefit significantly from a scalar processing unit using two methodologies. In the first methodology the scalar processing overhead is eliminated by replacing all scalar data types by the quadword data type. This methodology is feasible only for relatively small kernels. In the second methodology SPE performance is compared to the performance of a similarly configured PPU, which supports scalar operations. Experimental results show that the scalar processing overhead ranges from 19% to 57% for small kernels and from 12% to 39% for large kernels. Solutions to eliminate this overhead are also discussed.
Keywords
computer architecture; instruction sets; microprocessor chips; Cell SPE; Cell processor; SIMD instruction set; SIMD-only architectures; computer architecture; general-purpose core; scalar processing overhead; scalar processing unit; single-instruction multiple-data; synergistic processing elements; Acceleration; Application software; Computer architecture; Computer science; Hardware; Kernel; Mathematics; Multicore processing; Parallel processing; Supercomputers; Computer architecture; Datapath; SIMD overhead; SIMD processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Application-specific Systems, Architectures and Processors, 2009. ASAP 2009. 20th IEEE International Conference on
Conference_Location
Boston, MA
ISSN
2160-0511
Print_ISBN
978-0-7695-3732-0
Electronic_ISBN
2160-0511
Type
conf
DOI
10.1109/ASAP.2009.12
Filename
5200027
Link To Document