Title :
View Scalable Multiview Video Coding Using 3-D Warping With Depth Map
Author :
Shimizu, Shinya ; Kitahara, Masaki ; Kimata, Hideaki ; Kamikura, Kazuto ; Yashima, Yoshiyuki
Author_Institution :
NTT Corp., Yokosuka
Abstract :
Multiview video coding demands high compression rates as well as view scalability, which enables the video to be displayed on a multitude of different terminals. In order to achieve view scalability, it is necessary to limit the inter-view prediction structure. In this paper, we propose a new multiview video coding scheme that can improve the compression efficiency under such a limited inter-view prediction structure. All views are divided into two groups in the proposed scheme: base view and enhancement views. The proposed scheme first estimates a view-dependent geometry of the base view. It then uses a video encoder to encode the video of base view. The view-dependent geometry is also encoded by the video encoder. The scheme then generates prediction images of enhancement views from the decoded video and the view-dependent geometry by using image-based rendering techniques, and it makes residual signals for each enhancement view. Finally, it encodes residual signals by the conventional video encoder as if they were regular video signals. We implement one encoder that employs this scheme by using a depth map as the view-dependent geometry and 3-D warping as the view generation method. In order to increase the coding efficiency, we adopt the following three modifications: (1) object-based interpolation on 3-D warping; (2) depth estimation with consideration of rate-distortion costs; and (3) quarter-pel accuracy depth representation. Experiments show that the proposed scheme offers about 30% higher compression efficiency than the conventional scheme, even though one depth map video is added to the original multiview video.
Keywords :
data compression; image enhancement; video coding; 3-D Warping; base view; coding efficiency; compression efficiency; compression rates; enhancement views; image-based rendering techniques; interview prediction structure; object-based interpolation; scalable multiview video coding; video encoder; view scalability; view-dependent geometry; 3-D warping; depth map; depth map, , ,; multi-view video coding; multiview video coding; view scalability;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2007.903773