DocumentCode
465043
Title
Memory Access Reduced Software Implementation of H.264/AVC Sub-pixel Motion Estimation Using Differential Data Encoding
Author
Choi, Hyojin ; Lee, Wonchul ; Sung, Wonyong
Author_Institution
Sch. of Electr. Eng., Seoul Nat. Univ.
fYear
2007
fDate
27-30 May 2007
Firstpage
2898
Lastpage
2901
Abstract
We studied an efficient software implementation of H.264/AVC sub-pixel motion estimation (ME) algorithm on a VLIW-SIMD digital signal processor, TMS320C6416. The sub-pixel ME algorithm demands large memory accesses while the required arithmetic operations are fairly simple. Although the CPU clock cycles for arithmetic operations can be reduced much by employing sub-word operations and applying software pipelining techniques, the limited memory bandwidth of the architecture restricts the overall performance. Moreover, aggressive VLIW-SIMD optimization results in the degradation of the performance by causing excessive CPU stalls during memory accesses. In this paper, we relieved the memory bandwidth requirements for creating quarter-pixel images by reducing the precision of image data, from 8 bits to 4 bits. As a result, the amount of memory accesses is much reduced at the cost of some increase of the arithmetic operations, which contributes to the balance of arithmetic and memory access operations. The experimental result shows that the memory stall cycles are decreased by 80% and the speed-up of 260% is obtained. The bit rate of the encoded video stream is increased slightly, about 2% on the average, due to the effects of quantization.
Keywords
digital arithmetic; digital signal processing chips; encoding; motion estimation; quantisation (signal); video coding; CPU clock cycles; H.264/AVC sub-pixel motion estimation; TMS320C6416; VLIW-SIMD digital signal processor; advanced video coding; arithmetic operations; differential data encoding; memory access reduced software implementation; memory bandwidth requirements; software pipelining; sub-word operations; Arithmetic; Automatic voltage control; Bandwidth; Clocks; Digital signal processors; Encoding; Motion estimation; Signal processing algorithms; Software algorithms; Software performance;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2007. ISCAS 2007. IEEE International Symposium on
Conference_Location
New Orleans, LA
Print_ISBN
1-4244-0920-9
Electronic_ISBN
1-4244-0921-7
Type
conf
DOI
10.1109/ISCAS.2007.377855
Filename
4253284
Link To Document