• DocumentCode
    2277023
  • Title

    Perceptual video coding: Challenges and approaches

  • Author

    Chen, Zhenzhong ; Lin, Weisi ; Ngan, King Ngi

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2010
  • fDate
    19-23 July 2010
  • Firstpage
    784
  • Lastpage
    789
  • Abstract
    Investigation on the human perception can play an important role in video signal processing. Recently, there has been great interest in incorporating the human perception in video coding systems to enhance the perceptual quality of the represented visual signal. However, the limited understanding of the human visual system and high complexity of computational models of human visual system make it a challenging task. Furthermore, the hybrid video coding structure brings difficulties to integrate computational models with coding components to fulfill the requirements. In this paper, we review the physiological characteristics of human perception and address the most relevant aspects to video coding applications. Moreover, we discuss the computational models and metrics which guide the design and implementation of the video coding system, as well as the recent advances in perceptual video coding. To introduce this overview with the latest technologies and most promising directions in perceptual video coding, we focus on three key areas. Specifically, we cover 1) visual attention and sensitivity modeling, with which we concentrate on the computational models of bottom-up and top-down attention, contrast sensitivity functions and masking effects, and fovea based manipulations; 2) perceptual quality optimization for constrained video coding, with which we discuss how to achieve maximum perceptual quality whilst satisfying various constraints; and 3) the impact of the human perception on advanced video applications, including emerging immersive multimedia services, and compression of high dynamic range video content and 3D video. For each aspect, we discuss the major challenges, highlight significant approaches, and outline future research directions.
  • Keywords
    data compression; sensitivity analysis; video coding; 3D video; bottom-up attention; contrast sensitivity functions; fovea based manipulations; high dynamic range video content; human perception; human visual system; hybrid video coding structure; immersive multimedia services; masking effects; perceptual quality optimization; perceptual video coding; sensitivity modeling; top-down attention; video compression; video signal processing; visual attention; visual signal; Computational modeling; Encoding; Humans; Retina; Sensitivity; Video coding; Visualization; Perceptual video coding; human visual system; quality optimization; visual perception;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2010 IEEE International Conference on
  • Conference_Location
    Suntec City
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-7491-2
  • Type

    conf

  • DOI
    10.1109/ICME.2010.5582549
  • Filename
    5582549