Title :
3D geometry representation using multiview coding of image tiles
Author :
Yu Gao ; Gene Cheung ; Maugey, Thomas ; Frossard, Pascal ; Jie Liang
Abstract :
Compression of dynamic 3D geometry obtained from depth sensors is challenging, because noise and temporal inconsistency inherent in acquisition of depth data means there is no one-to-one correspondence between sets of 3D points in consecutive time instants. In this paper, instead of coding 3D points (or meshes) directly, we propose to represent an object´s 3D geometry as a collection of tile images. Specifically, we first place a set of image tiles around an object. Then, we project the object´s 3D geometry onto the tiles that are interpreted as 2D depth images, which we subsequently encode using a modified multiview image codec tuned for piecewise smooth signals. The crux of the tile image framework is the “optimal” placement of image tiles - one that yields the best tradeoff in rate and distortion. We show that if only planar and cylindrical tiles are considered, then the optimal placement problem for K tiles can be mapped to a tractable piece-wise linear approximation problem. We propose an efficient dynamic programming algorithm to find an optimal solution to the piecewise linear approximation problem. Experimental results show that optimal tiling outperforms naïve tiling by up to 35% in rate reduction, and graph transform can further exploit the smoothness of the tile images for coding gain.
Keywords :
data acquisition; dynamic programming; geometry; image coding; image sensors; piecewise linear techniques; tiles; 2D depth images; 3D geometry representation; coding 3D points; cylindrical tiles; data acquisition; depth sensors; dynamic 3D geometry compression; dynamic programming algorithm; graph transform; image tiles; modified multiview image codec; multiview coding; noise inconsistency; piecewise smooth signals; planar tiles; rate reduction; temporal inconsistency; tractable piecewise linear approximation problem; Encoding; Geometry; Image coding; Linear approximation; Piecewise linear approximation; Three-dimensional displays; 3D geometry compression; multiview image coding;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854787