• DocumentCode
    431579
  • Title

    Perceptually adaptive rate-distortion optimization for variable block size motion alignment in 3D wavelet coding

  • Author

    Sun, Y. ; Pan, F. ; Kassim, A.A.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore
  • Volume
    2
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    A novel content adaptive rate-distortion optimization scheme has been proposed. The scheme can effectively distinguish texture regions, edge regions and flat regions using a directional field technique. Since the human visual system (HVS) perceives distortions more easily near edges and in flat regions, distortion reduction is more important in those regions than the bits it consumes to code the motion information. Adaptive rate-distortion optimization is carried out by adjusting the Lagrangian multiplier so that small values are assigned to edge and flat regions and large values to the random texture region. The proposed scheme has been tested in the scalable video coding (SVC) reference codec by Microsoft Research Asia (MSRA) (Xu, J. et al., ISO/EEC JTC/WG11 M10569, S05, 2004). Experimental results show that the accuracy of motion alignment in visually important regions is greatly improved in the temporal transform step of 3D wavelet coding and the scheme effectively preserves details in the most perceptually prominent regions for all bitstream layers, with no loss in PSNR.
  • Keywords
    distortion; image motion analysis; image texture; optimisation; rate distortion theory; transform coding; video coding; visual perception; wavelet transforms; 3D wavelet coding; Lagrangian multiplier; PSNR; directional field technique; distortion reduction; edge regions; flat regions; human visual system; perceptually adaptive rate-distortion optimization; reference codec; scalable video coding; texture regions; variable block size motion alignment; Asia; Codecs; Humans; ISO; Lagrangian functions; Rate-distortion; Static VAr compensators; Testing; Video coding; Visual system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1415558
  • Filename
    1415558