DocumentCode :
2173062
Title :
Gaze manipulation for one-to-one teleconferencing
Author :
Criminisi, A. ; Shotton, J. ; Blake, A. ; Torr, P.H.S.
Author_Institution :
Microsoft Res. Ltd., Cambridge, UK
fYear :
2003
fDate :
13-16 Oct. 2003
Firstpage :
191
Abstract :
A new algorithm is proposed for novel view generation in one-to-one teleconferencing applications. Given the video streams acquired by two cameras placed on either side of a computer monitor, the proposed algorithm synthesizes images from a virtual camera in arbitrary position (typically located within the monitor) to facilitate eye contact. Our technique is based on an improved, dynamic-programming, stereo algorithm for efficient novel-view generation. The two main contributions are: i) a new type of three-plane graph for dense-stereo dynamic-programming, that encourages correct occlusion labeling; ii) a compact geometric derivation for novel-view synthesis by direct projection of the minimum-cost surface. Furthermore, we present a novel algorithm for the temporal maintenance of a background model to enhance the rendering of occlusions and reduce temporal artefacts (flicker); and a cost aggregation algorithm that acts directly on our three-dimensional matching cost space. Examples are given that demonstrate the robustness of the new algorithm to spatial and temporal artefacts for long stereo video streams. These include demonstrations of synthesis of cyclopean views of extended conversational sequences. We further demonstrate synthesis from a freely translating virtual camera.
Keywords :
computational geometry; dynamic programming; hidden feature removal; image matching; image sequences; rendering (computer graphics); teleconferencing; video cameras; video coding; background model temporal maintenance; conversational sequence; cost aggregation algorithm; cyclopean view synthesis; dense-stereo dynamic-programming; gaze manipulation; minimum-cost surface; novel-view generation; occlusion labeling; one-to-one teleconferencing; spatial artefact; temporal artefact; three-dimensional matching cost space; three-plane graph; video stream; virtual camera; Application software; Cameras; Computer displays; Computerized monitoring; Costs; Heuristic algorithms; Labeling; Robustness; Streaming media; Teleconferencing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
Conference_Location :
Nice, France
Print_ISBN :
0-7695-1950-4
Type :
conf
DOI :
10.1109/ICCV.2003.1238340
Filename :
1238340
Link To Document :
بازگشت