DocumentCode :
1065905
Title :
Real-Time Multimodal Human–Avatar Interaction
Author :
Fu, Yun ; Li, Renxiang ; Huang, Thomas S. ; Danielsen, Mike
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Univ. of Illinois at Urbana-Champaign, Urbana, IL
Volume :
18
Issue :
4
fYear :
2008
fDate :
4/1/2008 12:00:00 AM
Firstpage :
467
Lastpage :
477
Abstract :
This paper presents a novel real-time multimodal human-avatar interaction (RTM-HAI) framework with vision-based remote animation control (RAC). The framework is designed for both mobile and desktop avatar-based human-machine or human-human visual communications in real-world scenarios. Using 3-D components stored in the Java mobile 3-D (M3G) file format, the avatar models can be flexibly constructed and customized on the fly on any mobile devices or systems that support the M3G standard. For the RAC head tracker, we propose a 2-D real-time face detection/tracking strategy through an interactive loop, in which the detection and tracking complement each other for efficient and reliable face localization, tolerating extreme user movement. With the face location robustly tracked, the RAC head tracker selects a main user and estimates the user´s head rolling, tilting, yawing, scaling, horizontal, and vertical motion in order to generate avatar animation parameters. The animation parameters can be used either locally or remotely and can be transmitted through socket over the network. In addition, it integrates audio-visual analysis and synthesis modules to realize multichannel and runtime animations, visual TTS and real-time viseme detection and rendering. The framework is recognized as an effective design for future realistic industrial products of humanoid kiosk and human-to-human mobile communication.
Keywords :
avatars; face recognition; user interfaces; Java mobile 3-D file format; audio-visual analysis; face location; human-human visual communications; human-to-human mobile communication; humanoid kiosk; mobile devices; real-time multimodal human-avatar interaction; remote animation control; Avatar; DAZ3D; DAZ3D,; M3G; RTM-HAI; TTS; avatar; head tracking; human computer interaction; human–computer interaction; mobile 3-D (M3G); multimodal system; real-time multi modal human–avatar interaction (RTM-HAI); visual communication;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2008.918441
Filename :
4449082
Link To Document :
بازگشت