• DocumentCode
    4585
  • Title

    Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding

  • Author

    Yun Zhang ; Kwong, Sam ; Xu Wang ; Hui Yuan ; Zhaoqing Pan ; Long Xu

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • Volume
    24
  • Issue
    7
  • fYear
    2015
  • fDate
    Jul-15
  • Firstpage
    2225
  • Lastpage
    2238
  • Abstract
    In this paper, we propose a machine learning-based fast coding unit (CU) depth decision method for High Efficiency Video Coding (HEVC), which optimizes the complexity allocation at CU level with given rate-distortion (RD) cost constraints. First, we analyze quad-tree CU depth decision process in HEVC and model it as a three-level of hierarchical binary decision problem. Second, a flexible CU depth decision structure is presented, which allows the performances of each CU depth decision be smoothly transferred between the coding complexity and RD performance. Then, a three-output joint classifier consists of multiple binary classifiers with different parameters is designed to control the risk of false prediction. Finally, a sophisticated RD-complexity model is derived to determine the optimal parameters for the joint classifier, which is capable of minimizing the complexity in each CU depth at given RD degradation constraints. Comparative experiments over various sequences show that the proposed CU depth decision algorithm can reduce the computational complexity from 28.82% to 70.93%, and 51.45% on average when compared with the original HEVC test model. The Bjøntegaard delta peak signal-to-noise ratio and Bjøntegaard delta bit rate are -0.061 dB and 1.98% on average, which is negligible. The overall performance of the proposed algorithm outperforms those of the state-of-the-art schemes.
  • Keywords
    binary decision diagrams; communication complexity; decision theory; decision trees; image classification; learning (artificial intelligence); quadtrees; rate distortion theory; video coding; Bjøntegaard delta bit rate; Bjøntegaard delta peak signal-to-noise ratio; HEVC; RD cost constraint; RD-complexity model; coding unit depth decision method; computational complexity; flexible complexity allocation; hierarchical binary decision problem; high efficiency video coding; machine learning; multiple binary classifier; quadtree CU depth decision process; rate-distortion cost constraint; three-output joint classifier; Classification algorithms; Complexity theory; Image coding; Joints; Prediction algorithms; Support vector machines; Video coding; Coding Unit; High Efficiency Video Coding; High efficiency video coding; Machine Learning, Support Vector Machine; coding unit; machine learning; support vector machine;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2417498
  • Filename
    7070704