DocumentCode :
1080732
Title :
Real-Time Vision and Speech Driven Avatars for Multimedia Applications
Author :
Schreer, Oliver ; Englert, Roman ; Eisert, Peter ; Tanger, Ralf
Author_Institution :
Fraunhofer Inst. for Telecommun., Berlin
Volume :
10
Issue :
3
fYear :
2008
fDate :
4/1/2008 12:00:00 AM
Firstpage :
352
Lastpage :
360
Abstract :
Recent progress in advanced video communication services and multimedia applications is grounded on novel human machine interfaces, improved usability, and user friendliness driven by user centric research and development. In this paper, we describe a complete system concept and algorithmic details of an example application within this area. The key features of the system are vision and speech based interfaces, which are used to animate an avatar for an audio-visual representation of a communication partner. The system is applied in two application scenarios, namely video chat and customer care services. Both applications are mass-market oriented and therefore careful design and development of robust and supporting user interfaces are required. The presented approach is integrated into a complete real-time prototype system, which is permanently demonstrated in the showcase at the head quarter of Deutsche Telekom, Bonn, Germany.
Keywords :
avatars; multimedia computing; speech-based user interfaces; audio-visual representation; human machine interfaces; multimedia applications; real-time vision avatars; speech based interfaces; speech driven avatars; vision based interfaces; Avatar; multimodality; real-time tracking; segmentation;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2008.917336
Filename :
4456693
Link To Document :
بازگشت