DocumentCode :
1410243
Title :
A Row-Parallel 8 ,\\times, 8 2-D DCT Architecture Using Algebraic Integer-Based Exact Computation
Author :
Madanayake, Arjuna ; Cintra, Renato J. ; Onen, Denis ; Dimitrov, Vassil S. ; Rajapaksha, Nilanka ; Bruton, L.T. ; Edirisuriya, Amila
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Akron, Akron, OH, USA
Volume :
22
Issue :
6
fYear :
2012
fDate :
6/1/2012 12:00:00 AM
Firstpage :
915
Lastpage :
929
Abstract :
An algebraic integer (AI)-based time-multiplexed row-parallel architecture and two final reconstruction step (FRS) algorithms are proposed for the implementation of bivariate AI encoded 2-D discrete cosine transform (DCT). The architecture directly realizes an error-free 2-D DCT without using FRSs between row-column transforms, leading to an 8 × 8 2-D DCT that is entirely free of quantization errors in AI basis. As a result, the user-selectable accuracy for each of the coefficients in the FRS facilitates each of the 64 coefficients to have its precision set independently of others, avoiding the leakage of quantization noise between channels as is the case for published DCT designs. The proposed FRS uses two approaches based on: 1) optimized Dempster-Macleod multipliers, and 2) expansion factor scaling. This architecture enables low-noise high-dynamic range applications in digital video processing that requires full control of the finite-precision computation of the 2-D DCT. The proposed architectures and FRS techniques are experimentally verified and validated using hardware implementations that are physically realized and verified on field-programmable gate array (FPGA) chip. Six designs, for 4-bit and 8-bit input word sizes, using the two proposed FRS schemes, have been designed, simulated, physically implemented, and measured. The maximum clock rate and block rate achieved among 8-bit input designs are 307.787 MHz and 38.47 MHz, respectively, implying a pixel rate of 8 × 307.787≈2.462 GHz if eventually embedded in a real- time video-processing system. The equivalent frame rate is about 1187.35Hz for the image size of 1920 × 1080. All implementations are functional on a Xilinx Virtex-6 XC6VLX240T FPGA device.
Keywords :
clocks; discrete cosine transforms; encoding; field programmable gate arrays; microprocessor chips; multiplying circuits; parallel architectures; quantisation (signal); 2D DCT architecture; Dempster-Macleod multipliers; FPGA chip; Xilinx Virtex-6 XC6VLX240T FPGA device; algebraic integer; bivariate AI encoding; clock rate; digital video processing; discrete cosine transform; exact computation; expansion factor scaling; field programmable gate array; final reconstruction step; frequency 307.787 MHz; frequency 38.47 MHz; quantization errors; quantization noise; row-column transforms; time-multiplexed row-parallel architecture; user-selectable accuracy; Artificial intelligence; Computer architecture; Discrete cosine transforms; Encoding; Noise; Quantization; Very large scale integration; Algebraic integer quantization; FPGA design; discrete cosine transform (DCT);
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2011.2181232
Filename :
6114256
Link To Document :
بازگشت