DocumentCode
2173062
Title
Gaze manipulation for one-to-one teleconferencing
Author
Criminisi, A. ; Shotton, J. ; Blake, A. ; Torr, P.H.S.
Author_Institution
Microsoft Res. Ltd., Cambridge, UK
fYear
2003
fDate
13-16 Oct. 2003
Firstpage
191
Abstract
A new algorithm is proposed for novel view generation in one-to-one teleconferencing applications. Given the video streams acquired by two cameras placed on either side of a computer monitor, the proposed algorithm synthesizes images from a virtual camera in arbitrary position (typically located within the monitor) to facilitate eye contact. Our technique is based on an improved, dynamic-programming, stereo algorithm for efficient novel-view generation. The two main contributions are: i) a new type of three-plane graph for dense-stereo dynamic-programming, that encourages correct occlusion labeling; ii) a compact geometric derivation for novel-view synthesis by direct projection of the minimum-cost surface. Furthermore, we present a novel algorithm for the temporal maintenance of a background model to enhance the rendering of occlusions and reduce temporal artefacts (flicker); and a cost aggregation algorithm that acts directly on our three-dimensional matching cost space. Examples are given that demonstrate the robustness of the new algorithm to spatial and temporal artefacts for long stereo video streams. These include demonstrations of synthesis of cyclopean views of extended conversational sequences. We further demonstrate synthesis from a freely translating virtual camera.
Keywords
computational geometry; dynamic programming; hidden feature removal; image matching; image sequences; rendering (computer graphics); teleconferencing; video cameras; video coding; background model temporal maintenance; conversational sequence; cost aggregation algorithm; cyclopean view synthesis; dense-stereo dynamic-programming; gaze manipulation; minimum-cost surface; novel-view generation; occlusion labeling; one-to-one teleconferencing; spatial artefact; temporal artefact; three-dimensional matching cost space; three-plane graph; video stream; virtual camera; Application software; Cameras; Computer displays; Computerized monitoring; Costs; Heuristic algorithms; Labeling; Robustness; Streaming media; Teleconferencing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
Conference_Location
Nice, France
Print_ISBN
0-7695-1950-4
Type
conf
DOI
10.1109/ICCV.2003.1238340
Filename
1238340
Link To Document