• DocumentCode
    2936625
  • Title

    Multimodal collaboration and human-computer interaction

  • Author

    Zhang, Zhengyou

  • Author_Institution
    Microsoft Res., Redmond, WA, USA
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    1596
  • Lastpage
    1599
  • Abstract
    The research effort at Microsoft research on multimodal collaboration and human-computer interaction aims at developing tools that allow people across geographically distributed sites to interact collaboratively with immersive experience. Our prototype systems consist of cameras, displays, speakers, microphones, computer controllable lights, and/or input devices such as touch sensitive surface, stylus, keyboard, and mouse. They require real-time processing a huge amount of data, such as foreground-background substraction, region-of-interest extraction, color estimation and correction, speaker detection, stereo matching, 3D reconstruction and rendering, without mentioning audio and video encoding and decoding possibly involving multiple microphones and cameras. Some of the processing can be easily parallelizable through general-purpose computation on graphics processing units (GPGPU) or on a multi-core processor machine, while others are not so trivial. In this extended summary, the author describe two projects: Visual echo cancellation in shared tele-collaborative space, and distributed meeting capture and broadcasting system. During the talk, the author will also present two recent projects: personal telepresence station and situated interaction.
  • Keywords
    audio coding; computer vision; feature extraction; groupware; human computer interaction; image colour analysis; image matching; image reconstruction; interactive devices; parallel processing; rendering (computer graphics); speaker recognition; stereo image processing; video coding; 3D reconstruction; audio encoding; broadcasting system; color estimation; computer vision; distributed meeting capture; foreground-background substraction; graphics processing unit; human computer interaction; input device; microphone; multicore processor machine; multimodal collaboration; parallelizable computation; real-time processing; region-of-interest extraction; rendering; speaker detection; stereo matching; video encoding; visual echo cancellation; Auditory displays; Cameras; Collaboration; Collaborative tools; Computer displays; Control systems; Lighting control; Microphones; Prototypes; Surface reconstruction; Audio Processing; Computer Vision; Human-Computer Interaction; Immersive Communication; Multimodal Collaboration; Parallel Computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202823
  • Filename
    5202823