Title :
Refining instruction set architecture for high-performance multimedia processing in constrained environments
Author :
Lee, Ruby B. ; Fiskiran, Murat A. ; Shi, Zhijie ; Yang, Xiao
Author_Institution :
Dept. of Electr. Eng., Princeton Univ., NJ, USA
Abstract :
Multimedia processing in software has been significantly accelerated by the addition of subword-parallel instructions to the instruction set architectures (ISAs) of modem microprocessors. While some of these multimedia instructions are simple and effective, others are very complex, requiring large, special-purpose functional units that are not practical for constrained environments such as handheld multimedia information appliances. For such environments, low-power and low-cost are as important as the high performance required for real-time multimedia processing and the general-purpose programmability required to support an ever growing range of applications. In this paper, we introduce PLX, a concise ISA that selects the most useful features from the first two generations of multimedia instructions added to microprocessors, and explores new ISA features for high-performance yet low-cost multimedia processing with small footprint processors. PLX is unique in that it is designed from scratch as a fully subword-parallel architecture with novel features like datapath scalability from 32-bit to 128-bit words, and a new definition of predication for reducing conditional branches. We illustrate the use of PLX´s architectural features with four frequently used multimedia kernels: discrete cosine transform, pixel padding, clip test and median filter. Our performance results show that a 64-bit PLX implementation achieves significant speedups compared to a basic 64-bit RISC processor and to IA-32 processors with MMX and SSE multimedia extensions. PLX´s datapath scalability feature often provides an additional 2x speedup in a cost-effective way.
Keywords :
computer architecture; computer graphics; discrete cosine transforms; instruction sets; low-power electronics; median filters; microprocessor chips; multimedia computing; operating system kernels; reduced instruction set computing; video coding; 32 to 128 bit; 64 bit; MMX extensions; PLX; RISC processors; SSE extensions; clip test; conditional branch reduction predication; constrained environments; datapath scalability; discrete cosine transforms; handheld multimedia information appliances; high-performance real-time multimedia processing; instruction set architecture subword-parallel instructions; low-cost applications; low-power applications; median filter; microprocessor ISA; multimedia kernels; pixel padding; programmability; small footprint processors; software multimedia processing; useful multimedia instruction features; Acceleration; Computer architecture; Discrete cosine transforms; Home appliances; Instruction sets; Kernel; Microprocessors; Modems; Scalability; Testing;
Conference_Titel :
Application-Specific Systems, Architectures and Processors, 2002. Proceedings. The IEEE International Conference on
Print_ISBN :
0-7695-1712-9
DOI :
10.1109/ASAP.2002.1030724