Title :
Fast parallel implementation of H.264 /AVC transform exploiting SIMD instructions
Author :
Jianhong, Ye ; Jilin, Liu
Author_Institution :
ZheJiang Univ. of Sci. & Electron. Eng., Hangzhou
fDate :
Nov. 28 2007-Dec. 1 2007
Abstract :
This paper presents a fast parallel implementation of H.264 transform (FPU) using SIMD instructions. The block size of transform used by H.264 is 4 x 4, which is quite different to the transform of the previous video compression documentations. The 4 x 4 transform is computed in integer arithmetic and can avoid the mismatch of inverse transform. The transform is very low computational, because it just uses additions and shifts, without multiplications. However, it is hard to using SIMD instructions to optimize the transform because the 4 x 4 block can not use the 16 -byte register efficiently. This paper gives out a solution to use SIMD instructions efficiently and designs a fast transform of H.264. Simulation result reveals that the FPIT using SIMD instructions increases up to 13 -15 times comparing to the same data by using fast transform in butterfly structure [2][3]onP4 2.0 GHZ.
Keywords :
data compression; parallel processing; video coding; FPIT; H.264 /AVC transform; SIMD; inverse transform; parallel implementation; Arithmetic; Automatic voltage control; Discrete cosine transforms; Discrete transforms; IEC standards; ISO standards; MPEG 4 Standard; Streaming media; Video coding; Video compression; FPIT; H.264; SIMD instructions; SSE2;
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2007. ISPACS 2007. International Symposium on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-1447-5
Electronic_ISBN :
978-1-4244-1447-5
DOI :
10.1109/ISPACS.2007.4446026