DocumentCode
2970622
Title
Multi-modal translation system and its evaluation
Author
Morishima, Shigeo ; Nakamura, Satoshi
Author_Institution
Fac. of Eng., Seikei Univ., Tokyo, Japan
fYear
2002
fDate
2002
Firstpage
241
Lastpage
246
Abstract
Speech-to-speech translation has been studied to realize natural human communication beyond language barriers. Toward further multi-modal natural communication, visual information such as face and lip movements will be necessary. We introduce a multi-modal English-to-Japanese and Japanese-to-English translation system that also translates the speaker´s speech motion while synchronizing it to the translated speech. To retain the speaker´s facial expression, we substitute only the speech organ´s image with the synthesized one, which is made by a three-dimensional wire-frame model that is adaptable to any speaker. Our approach enables image synthesis and translation with an extremely small database. We conduct subjective evaluation tests using the connected digit discrimination test using data with and without audio-visual lip-synchronization. The results confirm the significant quality of the proposed audio-visual translation system and the importance of lip-synchronization.
Keywords
computer animation; image motion analysis; language translation; speech recognition; speech synthesis; speech-based user interfaces; synchronisation; visual databases; English-to-Japanese translation; Japanese-to-English translation; audio-visual lip-synchronization; computer animation; connected digit discrimination test; face movements; facial expression; image database; image synthesis; lip movements; multimodal translation system; natural human communication; speech-to-speech translation; three-dimensional wire-frame model; visual information; Face detection; Head; Humans; Image databases; Image generation; Loudspeakers; Mouth; Natural languages; Speech synthesis; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
Print_ISBN
0-7695-1834-6
Type
conf
DOI
10.1109/ICMI.2002.1167000
Filename
1167000
Link To Document