Title :
Efficient IDCT implementations on VLIW processors
Author :
Bagni, Daniele ; Borneo, Antonio ; Celetto, Luca
Author_Institution :
Adv. Syst. Technol., STMicroelectron., Agrate Brianza, Italy
Abstract :
In this paper we describe two efficient software implementations of bi-dimensional IDCT (Inverse Discrete Cosine Transform). Instead of using a traditional separation into eight horizontal and vertical mono-dimensional IDCT stages, we apply a novel approach to directly represent the bi-dimensional IDCT into only eight mono-dimensional units followed by a network of addition and subtraction operations. We have then optimized this method in pure ANSI-C for 32-bit architecture VLIW (Very Long Instruction Word) processors. By arranging the network structure in a proper way to exploit sub-word parallelism and by defining totally new multimedia instructions, we have implemented a second version that is 23% more efficient than the previous one. Our fixed-point arithmetic IDCT implementations are fully compliant with the IEEE 1180 standard, as required by most of the video compression standards.
Keywords :
discrete cosine transforms; fixed point arithmetic; instruction sets; inverse transforms; parallel architectures; video coding; ANSI-C; IEEE 1180 standard; VLIW processor; bidimensional IDCT; fixed point arithmetic IDCT implementation; inverse discrete cosine transform; monodimensional unit; multimedia instruction; very long instruction word; video compression standard; Algorithm design and analysis; Discrete cosine transforms; Program processors; Signal processing algorithms; Standards; VLIW;
Conference_Titel :
Signal Processing Conference, 2002 11th European
Conference_Location :
Toulouse