Title :
Scalable array architecture design for full search block matching
Author :
Chang, Shifan ; Hwang, Juin-Haur ; Jen, Chein-Wei
Author_Institution :
Dept. of Electron. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fDate :
8/1/1995 12:00:00 AM
Abstract :
Block matching is a widely used motion estimation algorithm in current video systems. Among typical searching strategies, the full search scheme provides better precision and regular data flow as well as higher parallelism, a characteristic that is advantageous for VLSI implementation. However, the huge computation load incurred by full search results in high cost, especially in high pixel rate applications. Since the block matching algorithm is used in a wide range of pixel rates, an architecture that is cascadable and offers variable computing power is promising. An approach based on dependence graph (DG) is proposed to analyze the operation sequence and data flow of full search block matching. The approach employs a transformation on DGs, called slice and tile, to produce different forms of DGs. Through this technique, most existing architectures can be represented in graphs for analysis, and new architectures can be found. A new architecture is presented that features cascadable modules of processing elements (PEs) with simple interconnection. Therefore, flexibility in computing power is available. The other advantages include variable sizes of the search area and 100% PE utilization. These characteristics offer great flexibility and efficiency for different applications. This architecture is implemented with thirty-two PEs in one chip that consists of 102 K transistors
Keywords :
VLSI; data flow graphs; digital signal processing chips; image matching; motion estimation; parallel architectures; search problems; video signal processing; VLSI; chip; data flow; dependence graph; full search block matching; motion estimation algorithm; operation sequence; pixel rates; processing elements; scalable array architecture design; transistors; variable computing power; variable search area size; video systems; Algorithm design and analysis; Computer architecture; Costs; Flow graphs; Hardware; Integrated circuit interconnections; Motion estimation; Parallel processing; Tiles; Very large scale integration;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on