مرکز منطقه ای اطلاع رساني علوم و فناوري - Performance trade-off of DCT architectures in Xilinx FPGAs

DocumentCode :

1837255

Title :

Performance trade-off of DCT architectures in Xilinx FPGAs

Author :

Kumar, Dhiraj ; Parhi, Keshab K.

Author_Institution :

Dept. of Electr. & Comput. Eng., Minnesota Univ., Minneapolis, MN, USA

Volume :

fYear :

1999

fDate :

24-27 Oct. 1999

Firstpage :

579

Abstract :

This paper presents the performance trade-offs among a class of three different 8-point 1-D discrete cosine transform (DCT) architectures implemented in Xilinx FPGAs. The architectures chosen for implementation are the distributed arithmetic (DA) architecture (with digit sizes 1 and 3), digit-serial flow graph (DSFG) architectures (with digit sizes 2, 3 and 6) and systolic architecture. The experiments show that the DA is the best architecture in terms of area, speed and latency while the systolic architecture is the worst. The 3-bit serial DA architecture has 85% better area*delay product, 24% lower clock energy per computation and 90% less latency as compared to the systolic design and it provides a HDTV throughput of 111 MPix/sec. The performance of the DSFG designs vary with digit-size. The 2-bit and 3-bit DSFG architectures have a poorer performance as compared to the DA architectures in that throughput range. However 6-bit DSFG comes quite close to 3-bit DA architecture. It has 83% better area*delay product, 13% lower clock energy per computation and 80% less latency as compared to the systolic architecture and provides a throughput of 163 MPix/sec. Another interesting observation is the dominance of route delays which form approximately 50% of the critical path delays.

Keywords :

delays; digital signal processing chips; discrete cosine transforms; distributed arithmetic; field programmable gate arrays; high definition television; signal flow graphs; systolic arrays; 2 bit; 3 bit; 6 bit; 8-point 1D DCT architectures; DCT architectures; HDTV throughput; Xilinx FPGA; area; area*delay product; clock energy per computation; critical path delays; digit sizes; digit-serial flow graph architecture; discrete cosine transform; experiments; latency; performance trade-off; route delays; serial distributed arithmetic architecture; speed; systolic architecture; Clocks; Computer architecture; Discrete cosine transforms; Distributed computing; Field programmable gate arrays; Fixed-point arithmetic; Flow graphs; Pipeline processing; Read only memory; Very large scale integration;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signals, Systems, and Computers, 1999. Conference Record of the Thirty-Third Asilomar Conference on

Conference_Location :

Pacific Grove, CA, USA

ISSN :

1058-6393

Print_ISBN :

0-7803-5700-0

Type :

conf

DOI :

10.1109/ACSSC.1999.832396

Filename :

832396

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1837255