• DocumentCode
    2173062
  • Title

    Gaze manipulation for one-to-one teleconferencing

  • Author

    Criminisi, A. ; Shotton, J. ; Blake, A. ; Torr, P.H.S.

  • Author_Institution
    Microsoft Res. Ltd., Cambridge, UK
  • fYear
    2003
  • fDate
    13-16 Oct. 2003
  • Firstpage
    191
  • Abstract
    A new algorithm is proposed for novel view generation in one-to-one teleconferencing applications. Given the video streams acquired by two cameras placed on either side of a computer monitor, the proposed algorithm synthesizes images from a virtual camera in arbitrary position (typically located within the monitor) to facilitate eye contact. Our technique is based on an improved, dynamic-programming, stereo algorithm for efficient novel-view generation. The two main contributions are: i) a new type of three-plane graph for dense-stereo dynamic-programming, that encourages correct occlusion labeling; ii) a compact geometric derivation for novel-view synthesis by direct projection of the minimum-cost surface. Furthermore, we present a novel algorithm for the temporal maintenance of a background model to enhance the rendering of occlusions and reduce temporal artefacts (flicker); and a cost aggregation algorithm that acts directly on our three-dimensional matching cost space. Examples are given that demonstrate the robustness of the new algorithm to spatial and temporal artefacts for long stereo video streams. These include demonstrations of synthesis of cyclopean views of extended conversational sequences. We further demonstrate synthesis from a freely translating virtual camera.
  • Keywords
    computational geometry; dynamic programming; hidden feature removal; image matching; image sequences; rendering (computer graphics); teleconferencing; video cameras; video coding; background model temporal maintenance; conversational sequence; cost aggregation algorithm; cyclopean view synthesis; dense-stereo dynamic-programming; gaze manipulation; minimum-cost surface; novel-view generation; occlusion labeling; one-to-one teleconferencing; spatial artefact; temporal artefact; three-dimensional matching cost space; three-plane graph; video stream; virtual camera; Application software; Cameras; Computer displays; Computerized monitoring; Costs; Heuristic algorithms; Labeling; Robustness; Streaming media; Teleconferencing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
  • Conference_Location
    Nice, France
  • Print_ISBN
    0-7695-1950-4
  • Type

    conf

  • DOI
    10.1109/ICCV.2003.1238340
  • Filename
    1238340