Title :
Asynchronous transpose-matrix architectures
Author :
Tierno, Jose A. ; Kudva, Prabhakar
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
The matrix transposition operation is a necessary step in several image/video compression and decompression algorithms, in particular the discrete cosine transform (DCT) and its inverse (IDCT), and some distributed arithmetic applications. These algorithms have to be performed at high data-rates, and with a minimum of power dissipation for portable applications. The authors describe how the clocked solution is usually implemented, and present two new asynchronous architectures that perform matrix transposition. These architectures, one based on two phase signaling, one based on four phase signaling, have better characteristics than the clocked solution in terms of latency and power, at no cost in area or throughput. They discuss the characteristics of these three architectures and evaluate the relative advantages of each one
Keywords :
computer architecture; data compression; discrete cosine transforms; pipeline processing; video coding; asynchronous transpose-matrix architectures; clocked solution; discrete cosine transform; distributed arithmetic applications; four phase signaling; high data-rates; image compression algorithm; image decompression algorithm; inverse discrete cosine transform; latency; matrix transposition operation; minimum power dissipation; portable applications; power; two phase signaling; video compression algorithm; video decompression algorithm; Circuits; Clocks; Costs; Delay; Digital signal processing; Discrete cosine transforms; Discrete transforms; Pipelines; Power dissipation; Throughput;
Conference_Titel :
Computer Design: VLSI in Computers and Processors, 1997. ICCD '97. Proceedings., 1997 IEEE International Conference on
Conference_Location :
Austin, TX
Print_ISBN :
0-8186-8206-X
DOI :
10.1109/ICCD.1997.628904