• DocumentCode
    1097299
  • Title

    Fast Block Motion Estimation With 8-Bit Partial Sums Using SIMD Architectures

  • Author

    Duanmu, Chunjiang J. ; Ahmad, M. Omair ; Swamy, M.N.S.

  • Author_Institution
    Concordia Univ., Montreal
  • Volume
    17
  • Issue
    8
  • fYear
    2007
  • Firstpage
    1041
  • Lastpage
    1053
  • Abstract
    In order to take advantage of the byte-type data parallelism in the existing single-instruction multiple-data (SIMD) technique, this paper introduces the concept of 8-bit partial sums, obtained by a 4-bit right-shift operation on the sum of the 16 luminance values in a column of a 16 x 16 block of a video frame. Since these partial sums are of only eight bits, eight of them can be processed concurrently in a single 64-bit SIMD register. A method of employing these partial sums in order to speed up a given block motion-estimation algorithm is then proposed. The notion of the 8-bit partial sums is extended to the four-level case. It is shown that there are 15 possible methods of utilizing these multilevel 8-bit partial sums to accelerate a block motion-estimation algorithm without any loss of accuracy of the algorithm. Each of these 15 methods is used in the full-search algorithm to determine the one that provides the lowest computational complexity. This method is adopted as the chosen scheme to accelerate various block motion-estimation algorithms. Extensive simulations are carried out on eight video sequences showing that substantial speed-up can be achieved when the chosen scheme is incorporated with the various motion-estimation algorithms. The simulation results also demonstrate that the implementation on SIMD architectures can further accelerate the execution of the proposed scheme by more than 93% percent.
  • Keywords
    image sequences; motion estimation; parallel architectures; video coding; 4-bit right-shift operation; 8-bit partial sums; SIMD architectures; byte-type data parallelism; computational complexity; fast block motion estimation; video sequences; Acceleration; Computational complexity; Computational modeling; Discrete cosine transforms; Entropy; MPEG 4 Standard; Motion estimation; Parallel processing; Video coding; Video sequences; 8-bit partial sums; Block motion estimation; single-instruction multiple-data (SIMD); video coding;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2007.898645
  • Filename
    4291627