DocumentCode :
3501305
Title :
A 16-pixel parallel architecture with block-level/mode-level co-reordering approach for intra prediction in 4k×2k H.264/AVC video encoder
Author :
Ren, Huailu ; Fan, Yibo ; Chen, Xinhua ; Zeng, Xiaoyang
Author_Institution :
Coll. of Inf. Sci. & Eng., Shandong Univ. of Sci. & Technol., Qingdao, China
fYear :
2012
fDate :
Jan. 30 2012-Feb. 2 2012
Firstpage :
801
Lastpage :
806
Abstract :
Intra prediction is the most important technology in H.264/AVC intra frame encoder. But there is extremely complicated data dependency and an immense amount of computation in intra prediction process. In order to meet the requirements of real-time coding and avoid hardware waste, this paper presents a parallel and high efficient H.264/AVC intra prediction architecture which targets high-resolution (e.g. 4k×2k) video encoding applications. In this architecture, the optimized intra 4×4 prediction engine can process sixteen pixels in parallel at a slightly higher hardware cost (compared to the previous four-pixel parallel architecture). The intra 16×16 prediction engine works in parallel with intra 4×4 prediction engine. It reuses the adder-tree of Sum of Absolute Transformed Difference (SATD) generator. Moreover, in order to reduce the data-dependency in intra 4×4 reconstruction loop, a block-level and mode-level co-reordering strategy is proposed. Therefore, the performance bottleneck of H.264/AVC intra encoding can be alleviated to a great extent. The proposed architecture supports full-mode intra prediction for H.264/AVC baseline, main and extended profiles. It takes only 163 cycles to complete the intra prediction process of one macroblock (MB). This design is synthesized with a SMIC 0.13μm CMOS cell library. The result shows that it takes 61k gates and can run at 215MHz, supporting real-time encoding of 4k×2k@40fps video sequences.
Keywords :
CMOS integrated circuits; video codecs; video coding; 16-pixel parallel architecture; CMOS cell library; H.264/AVC intra frame encoder; H.264/AVC video encoder; SATD generator; SMIC; block-level-mode-level co-reordering approach; encoding; four-pixel parallel architecture; intra 4x4 prediction engine; intra prediction process; sum of absolute transformed difference; video sequences; Computer architecture; Discrete cosine transforms; Engines; Equations; Generators; Hardware; Image coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design Automation Conference (ASP-DAC), 2012 17th Asia and South Pacific
Conference_Location :
Sydney, NSW
ISSN :
2153-6961
Print_ISBN :
978-1-4673-0770-3
Type :
conf
DOI :
10.1109/ASPDAC.2012.6165065
Filename :
6165065
Link To Document :
بازگشت