Title :
Real-Time and Temporal-Coherent Foreground Extraction With Commodity RGBD Camera
Author :
Mengyao Zhao ; Chi-Wing Fu ; Jianfei Cai ; Tat-Jen Cham
Author_Institution :
Sch. of Comput. Eng. & BeingThere Centre, Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Foreground extraction from video stream is an important component in many multimedia applications. By exploiting commodity RGBD cameras, we could further extract dynamic foreground objects with 3D information in real-time, thereby enabling new forms of multimedia applications such as 3D telepresence. However, one critical problem with existing methods for real-time foreground extraction is temporal coherency. They could exhibit severe flickering results for foreground objects such as human motion, thus affecting the visual quality as well as the image object analysis in the multimedia applications. This paper presents a new GPU-based real-time foreground extraction method with several novel techniques. First, we detect shadow and fill missing depth data accordingly in RGBD video, and then adaptively combine color and depth masks to form a trimap. After that, we formulate a novel closed-form matting model to improve the temporal coherency in foreground extraction while achieving real-time performance. Particularly, we propagate RGBD data across temporal domain to improve the visual coherence in the foreground object extraction, and take advantage of various CUDA strategies and spatial data structures to improve the speed. Experiments with a number of users on different scenarios show that, compared with state-of-the-art methods, our method can extract stabler foreground objects with higher visual quality as well as better temporal coherency, while still achieving real-time performance (experimentally, 30.3 frames per second on average).
Keywords :
cameras; feature extraction; graphics processing units; image colour analysis; parallel architectures; spatial data structures; video signal processing; 3D information; 3D telepresence; CUDA strategies; GPU-based real-time foreground extraction method; RGBD data propagation; RGBD video; closed-form matting model; color masks; commodity RGBD camera; depth masks; dynamic foreground object extraction; flickering; human motion; image object analysis; missing depth data filling; multimedia applications; real-time performance; real-time temporal-coherent foreground object extraction; shadow detection; spatial data structures; speed improvement; stable foreground object extraction; temporal coherency improvement; temporal domain; trimap; video stream; visual coherence improvement; visual quality; Data mining; Graphics processing units; Image color analysis; Kernel; Real-time systems; Streaming media; Three-dimensional displays; 3D telepresence; Foreground extraction; RGBD video; alpha matting; depth camera;
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
DOI :
10.1109/JSTSP.2014.2382476