DocumentCode
1189309
Title
A VLIW Vector Media Coprocessor With Cascaded SIMD ALUs
Author
Wada, Takahisa ; Ishiwata, Shunichi ; Kimura, Katsuyuki ; Nakanishi, Keiri ; Sumiyoshi, Masato ; Miyamori, Takashi ; Nakagawa, Masaki
Author_Institution
Center for Semicond. R&D, Toshiba Corp., Kawasaki, Japan
Volume
17
Issue
9
fYear
2009
Firstpage
1285
Lastpage
1296
Abstract
High-definition video applications, such as digital TV and digital video cameras, require high processing performance for high-quality visual images in addition to a complex video CODEC. Pre-/postprocessing to improve video quality is becoming much more important because requirements for pre-/postprocessing vary among applications and processing algorithms have not been stabilized. Therefore, a new processor architecture that has a highly parallel datapath is needed. In this paper, we introduce a VLIW vector media coprocessor, ldquovector coprocessor (VCP),rdquo that includes three asymmetric execution pipelines with cascaded SIMD ALUs. To improve performance efficiency, we reduce the area ratio of the control circuit while increasing the ratio of the arithmetic circuit. The total gate count of VCP is 1268 kgates and its maximum operating frequency is 300 MHz at 90-nm CMOS process. Some of the processing kernels in an adaptive prefilter that is applied to preprocessing for video encoding are evaluated. In the case of the edgeness and the sum of absolute differences, the performance is 183 giga operations per second. VCP offers enough performance for HD video processing and good cost-performance while all processing pipeline units operate effectively.
Keywords
CMOS digital integrated circuits; digital television; high definition video; instruction sets; parallel architectures; pipeline processing; vector processor systems; video coding; ALU; VLIW; adaptive prefilter; arithmetic circuit; asymmetric execution pipelines; complex video CODEC; control circuit; digital TV; digital video cameras; frequency 300 MHz; high-definition video encoding; high-quality visual images; image processing kernels; multiple data stream; parallel datapath; processing kernels; processor architecture; single instruction stream; size 90 nm; vector media coprocessor; very long instruction word; video encoding; video quality; Single instruction stream, multiple data stream (SIMD); vector coprocessor (VCP); very long instruction word (VLIW);
fLanguage
English
Journal_Title
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-8210
Type
jour
DOI
10.1109/TVLSI.2008.2003006
Filename
4799236
Link To Document