Title :
Scalar Processing Overhead on SIMD-Only Architectures
Author :
Azevedo, A. ; Juurlink, Ben
Author_Institution :
Comput. Eng. Group, Delft Univ. of Technol., Delft, Netherlands
Abstract :
The Cell processor consists of a general-purpose core and eight cores with a complete SIMD instruction set. Although originally designed for multimedia and gaming, it is currently being used for a much broader range of applications.In this paper we evaluate if the Cell SPEs could benefit significantly from a scalar processing unit using two methodologies. In the first methodology the scalar processing overhead is eliminated by replacing all scalar data types by the quadword data type. This methodology is feasible only for relatively small kernels. In the second methodology SPE performance is compared to the performance of a similarly configured PPU, which supports scalar operations. Experimental results show that the scalar processing overhead ranges from 19% to 57% for small kernels and from 12% to 39% for large kernels. Solutions to eliminate this overhead are also discussed.
Keywords :
computer architecture; instruction sets; microprocessor chips; Cell SPE; Cell processor; SIMD instruction set; SIMD-only architectures; computer architecture; general-purpose core; scalar processing overhead; scalar processing unit; single-instruction multiple-data; synergistic processing elements; Acceleration; Application software; Computer architecture; Computer science; Hardware; Kernel; Mathematics; Multicore processing; Parallel processing; Supercomputers; Computer architecture; Datapath; SIMD overhead; SIMD processing;
Conference_Titel :
Application-specific Systems, Architectures and Processors, 2009. ASAP 2009. 20th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-0-7695-3732-0
Electronic_ISBN :
2160-0511
DOI :
10.1109/ASAP.2009.12