DocumentCode :
1754940
Title :
Novel Efficient HEVC Decoding Solution on General-Purpose Processors
Author :
Yizhou Duan ; Jun Sun ; Leju Yan ; Keji Chen ; Zongming Guo
Author_Institution :
Inst. of Comput. Sci. & Technol., Peking Univ., Beijing, China
Volume :
16
Issue :
7
fYear :
2014
fDate :
Nov. 2014
Firstpage :
1915
Lastpage :
1928
Abstract :
Although the emerging video coding standard High Efficiency Video Coding (HEVC) successfully doubles the compression efficiency of H.264/AVC, its growing computational complexity makes real-time decoding of high-definition HEVC videos a very challenging issue for the existing personal computers and mobile devices. In this paper, a systematical, efficient HEVC decoding solution on general processors is provided, consisting of structure-level, data-level, and task-level approaches. First, a redesigned overall structure of a HEVC decoder with data redundancy reduction mechanism is introduced, which cuts down basic data operation cost and achieves an average decoding speedup of 2.37 × compared to the HM 10.0 decoder. On this basis, novel single-instruction multiple-data (SIMD) algorithms such as low-complexity motion compensation, transpose-free transform, symmetric deblocking filter, and parallel-index sample adaptive offset are developed, which further parallelize the data operations of each decoding task and bring another 2.67 × decoding speedup. Finally, a frame-based task-level parallel framework is employed with a flexible entry scheme to efficiently support the simultaneous processing of multiple decoding tasks for different HEVC parallel strategies. The overall solution achieves decoding fps of 40-75 for 4k HEVC videos on the Intel i7-2600 3.4 GHz quad-core processor (4-thread decoding) and 35-55 for 720p videos on the ARM Cortex-A9 1.2 GHz duo-core processor (2-thread decoding). This proposal is the recommended cross-platform HEVC decoding solution of Intel, AMD, and Cisco, and has provided HEVC service to over 1500 million people in China via the Xunlei Kankan video client.
Keywords :
data compression; decoding; general purpose computers; parallel processing; real-time systems; video codecs; video coding; 2-thread decoding; 4-thread decoding; AMD; ARM Cortex-A9 duo-core processor; Cisco; H.264-AVC; HM 10.0 decoder; Intel i7-2600; SIMD algorithms; Xunlei Kankan video client; compression efficiency; computational complexity; cross-platform decoding solution; data operations; data redundancy reduction mechanism; data-level approaches; general-purpose processors; high efficiency video coding; high-definition HEVC videos; low-complexity motion compensation; mobile devices; parallel strategies; parallel-index sample adaptive offset; personal computers; quad-core processor; real-time decoding; single-instruction multiple-data algorithms; structure-level approaches; symmetric deblocking filter; task-level approaches; transpose-free transform; video coding standard; Algorithm design and analysis; Decoding; Encoding; Program processors; Standards; Transforms; Video coding; Decoding; SIMD; high efficiency video coding (HEVC); video codecs;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2014.2337834
Filename :
6851926
Link To Document :
بازگشت