• DocumentCode
    3319675
  • Title

    Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

  • Author

    Xiangwen Wang ; Li Song ; Min Chen ; Junjie Yang

  • Author_Institution
    Inst. of Electron. & Inf. Eng., Shanghai Univ. of Electr. Power, Shanghai, China
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on parallel realization architecture design of VBSME in HEVC. Firstly, an efficient parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the parallel realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA´s C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.
  • Keywords
    computational complexity; graphics processing units; interpolation; motion estimation; video coding; CPU plus GPU platform; CTU; H.264-AVC; HEVC; HM; MC; ME; ME compression performance improvement; ME with variable block sizes; NVIDIA C2050 GPU; VBSME; border padding processes; coding tree unit; computational complexity; fast prediction unit partition mode decision algorithm; fractional-pixel image interpolation; motion compensation; variable block size motion estimation parallelization; Abstracts; Graphics processing units; Image reconstruction; Indexes; Interpolation; Laboratories; Video coding; GPU; HEVC; Motion Estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Type

    conf

  • DOI
    10.1109/ICMEW.2013.6618412
  • Filename
    6618412