Title :
Row-column decomposition based 2D transform optimization on subword parallel processors
Author :
Sihvo, Tero ; Niittylahti, Jarkko
Author_Institution :
Inst. of Digital & Comput. Syst., Tampere Univ. of Technol., Finland
Abstract :
This paper discusses the row-column decomposition based 2D block transform implementations, in which the matrix transpose plays a crucial role. A subword parallel VLIW processor architecture supporting simultaneous data processing and matrix transpose provides the required functionality for fast and flexible transform implementations. In addition, new instructions are proposed to further speed up the transforms in the H.264/AVC. With the proposed architectural optimizations, a speed-up by 2.7 is achieved for the 2D DCT/IDCT and a speed-up by over two for the transforms in the H.264/AVC, when compared to the sequential implementations.
Keywords :
discrete cosine transforms; parallel architectures; transform coding; video codecs; video coding; 2D transform optimization; DCT/IDCT; H.264/AVC; VLIW processor architecture; data processing; matrix transpose; row-column decomposition; subword parallel processors; Automatic voltage control; Concurrent computing; Decoding; Discrete cosine transforms; Discrete transforms; Matrix decomposition; Quantization; Registers; Transform coding; Video coding;
Conference_Titel :
Signals, Circuits and Systems, 2005. ISSCS 2005. International Symposium on
Print_ISBN :
0-7803-9029-6
DOI :
10.1109/ISSCS.2005.1509860