DocumentCode :
2038551
Title :
Speech-to-image media conversion based on VQ and neural network
Author :
Morishima, Shigeo ; Harashima, Hiroshi
Author_Institution :
Fac. of Eng., Seikei Univ., Tokyo, Japan
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
2865
Abstract :
Automatic media conversion schemes from speech to a facial image and a construction of a real-time image synthesis system are presented. The purpose of this research is to realize an intelligent human-machine interface or intelligent communication system with synthesized human face images. A human face image is reconstructed on the display of a terminal using a 3-D surface model and texture mapping technique. Facial motion images are synthesized by transformation of the 3-D model. In the motion driving method, based on vector quantization and the neural network, the synthesized head image can appear to speak some given words and phrases naturally, in synchronization with voice signals from a speaker
Keywords :
data compression; neural nets; picture processing; real-time systems; speech analysis and processing; user interfaces; 3-D surface model; automatic media conversion; facial image; intelligent communication system; intelligent human-machine interface; motion driving method; neural network; real-time image synthesis system; speech to image media conversion; synthesized head image; synthesized human face images; terminal display; texture mapping; vector quantization; voice signals; Face; Humans; Image converters; Image generation; Man machine systems; Network synthesis; Neural networks; Real time systems; Signal synthesis; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.151000
Filename :
151000
Link To Document :
بازگشت