Title :
Implementation of HEVC decoder on x86 processors with SIMD optimization
Author :
Leju Yan ; Yizhou Duan ; Jun Sun ; Zongming Guo
Author_Institution :
Inst. of Comput. Sci. & Technol., Peking Univ., Beijing, China
Abstract :
High Efficient Video Coding (HEVC) is the next generation video coding standard in progress. Based on the traditional hybrid coding framework, HEVC implements enhanced tools to improve compression efficiency at the cost of far more computational payload than the capacity of real-time video applications. In this paper, we focus on the software implementation of a real-time HEVC decoder over modern Intel x86 processors. First, we identify the most time-consuming modules of HM 4.0 decoder, represented by motion compensation, adaptive loopfilter, deblocking filter and integer transform. Then the single-execution-multiple-data (SIMD) methods are proposed to optimize the computational performance of these modules. Experimental results show that the optimized decoder is more than 4 times faster than the HM 4.0 decoder, with decoding speed of over 40 frames per second for 1920×1080 resolution videos on Intel i5-2400 processor.
Keywords :
adaptive filters; data compression; decoding; motion compensation; parallel processing; transforms; video coding; HM 4.0 decoder; Intel i5-2400 processor; Intel x86 processors; SIMD method; SIMD optimization; adaptive loop-filter; compression efficiency improvement; deblocking filter; decoding speed; high-efficient video coding; integer transform; motion compensation; next generation video coding standard; real-time HEVC decoder; real-time video applications; single-execution-multiple-data method; software implementation; Decoding; Encoding; Interpolation; Program processors; Registers; Transforms; Video coding; HEVC; MMX/SSE technologies; SIMD optimization;
Conference_Titel :
Visual Communications and Image Processing (VCIP), 2012 IEEE
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-4405-0
Electronic_ISBN :
978-1-4673-4406-7
DOI :
10.1109/VCIP.2012.6410845