DocumentCode
2277023
Title
Perceptual video coding: Challenges and approaches
Author
Chen, Zhenzhong ; Lin, Weisi ; Ngan, King Ngi
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2010
fDate
19-23 July 2010
Firstpage
784
Lastpage
789
Abstract
Investigation on the human perception can play an important role in video signal processing. Recently, there has been great interest in incorporating the human perception in video coding systems to enhance the perceptual quality of the represented visual signal. However, the limited understanding of the human visual system and high complexity of computational models of human visual system make it a challenging task. Furthermore, the hybrid video coding structure brings difficulties to integrate computational models with coding components to fulfill the requirements. In this paper, we review the physiological characteristics of human perception and address the most relevant aspects to video coding applications. Moreover, we discuss the computational models and metrics which guide the design and implementation of the video coding system, as well as the recent advances in perceptual video coding. To introduce this overview with the latest technologies and most promising directions in perceptual video coding, we focus on three key areas. Specifically, we cover 1) visual attention and sensitivity modeling, with which we concentrate on the computational models of bottom-up and top-down attention, contrast sensitivity functions and masking effects, and fovea based manipulations; 2) perceptual quality optimization for constrained video coding, with which we discuss how to achieve maximum perceptual quality whilst satisfying various constraints; and 3) the impact of the human perception on advanced video applications, including emerging immersive multimedia services, and compression of high dynamic range video content and 3D video. For each aspect, we discuss the major challenges, highlight significant approaches, and outline future research directions.
Keywords
data compression; sensitivity analysis; video coding; 3D video; bottom-up attention; contrast sensitivity functions; fovea based manipulations; high dynamic range video content; human perception; human visual system; hybrid video coding structure; immersive multimedia services; masking effects; perceptual quality optimization; perceptual video coding; sensitivity modeling; top-down attention; video compression; video signal processing; visual attention; visual signal; Computational modeling; Encoding; Humans; Retina; Sensitivity; Video coding; Visualization; Perceptual video coding; human visual system; quality optimization; visual perception;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2010 IEEE International Conference on
Conference_Location
Suntec City
ISSN
1945-7871
Print_ISBN
978-1-4244-7491-2
Type
conf
DOI
10.1109/ICME.2010.5582549
Filename
5582549
Link To Document