DocumentCode :
357106
Title :
Translingual visual speech synthesis
Author :
Faruquie, Tanveer A. ; Neti, Chalapathy ; Rajput, Nitendra ; Subramaniam, LVenkata ; Verma, Ashish
Author_Institution :
IBM India Res. Lab., New Delhi, India
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
1089
Abstract :
Audio-driven facial animation is an interesting and evolving technique for human-computer interaction. Based on an incoming audio stream, a face image is animated with full lip synchronization. This requires a speech recognition system in the language in which audio is provided to get the time alignment for the phonetic sequence of the audio signal. However, building a speech recognition system is data intensive and is a very tedious and time consuming task. We present a novel scheme to implement a language independent system for audio-driven facial animation given a speech recognition system for just one language, in our case, English. The method presented here can also be used for text to audio-visual speech synthesis
Keywords :
computer animation; language translation; natural language interfaces; speech recognition; speech synthesis; English; audio signal; audio-driven facial animation; audio-visual speech synthesis; face image; full lip synchronization; human-computer interaction; incoming audio stream; language independent system; phonetic sequence; speech recognition system; time alignment; time consuming task; translingual visual speech synthesis; Facial animation; Multimedia communication; Natural languages; Rendering (computer graphics); Speech recognition; Speech synthesis; Streaming media; Synthesizers; Telephony; Teleworking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
0-7803-6536-4
Type :
conf
DOI :
10.1109/ICME.2000.871550
Filename :
871550
Link To Document :
بازگشت