• DocumentCode
    1474481
  • Title

    Spatial-Random-Access-Enabled Video Coding for Interactive Virtual Pan/Tilt/Zoom Functionality

  • Author

    Mavlankar, Aditya ; Girod, Bernd

  • Author_Institution
    Stanford Univ., Stanford, CA, USA
  • Volume
    21
  • Issue
    5
  • fYear
    2011
  • fDate
    5/1/2011 12:00:00 AM
  • Firstpage
    577
  • Lastpage
    588
  • Abstract
    High-spatial-resolution videos offer the possibility of viewing an arbitrary region-of-interest (RoI) interactively. Zoom functionality enables watching high-resolution content even on displays of lower spatial resolution. If arbitrary regions corresponding to arbitrary zoom factors can be served to the user, the transmission and/or decoding of the entire high-spatial-resolution video can be avoided. Moreover, if the video content can be encoded such that arbitrary RoIs corresponding to different zoom factors can be simply extracted from the compressed bitstream, we can avoid dedicated video encoding for each user. We propose such a video coding scheme that is vital in allowing the system to scale to large numbers of remote users as well as to encode and store the content for subsequent repeated playback. Apart from generating a multi-resolution representation, our coding scheme uses P slices from H.264/AVC. We study the tradeoff in the choice of slice size. A larger slice size enables higher coding efficiency for representing the entire scene but increases the number of pixels that have to be transmitted. The optimal slice size achieves the best tradeoff and minimizes the expected transmission bitrate. Experimental results confirm the optimality of our predicted slice size for various test cases. Furthermore, we propose an improvement based on background extraction and long-term memory motion-compensated prediction. Experiments indicate up to 85% bitrate reduction while retaining efficient random access capability.
  • Keywords
    data compression; decoding; image representation; image resolution; interactive video; motion compensation; video coding; arbitrary region-of-interest; dedicated video encoding; high-resolution content; high-spatial-resolution video decoding; interactive virtual pan-tilt-zoom functionality; long-term memory motion-compensated prediction; low spatial resolution; multiresolution representation; optimal slice size; spatial-random-access-enabled video coding; transmission bitrate; zoom factors; zoom functionality; Automatic voltage control; Encoding; Pixel; Random variables; Spatial resolution; Streaming media; Video coding; Interactive video streaming; pan/tilt/zoom; region-of-interest;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2011.2129170
  • Filename
    5733400