Title :
Speech-to-image media conversion based on VQ and neural network
Author :
Morishima, Shigeo ; Harashima, Hiroshi
Author_Institution :
Fac. of Eng., Seikei Univ., Tokyo, Japan
Abstract :
Automatic media conversion schemes from speech to a facial image and a construction of a real-time image synthesis system are presented. The purpose of this research is to realize an intelligent human-machine interface or intelligent communication system with synthesized human face images. A human face image is reconstructed on the display of a terminal using a 3-D surface model and texture mapping technique. Facial motion images are synthesized by transformation of the 3-D model. In the motion driving method, based on vector quantization and the neural network, the synthesized head image can appear to speak some given words and phrases naturally, in synchronization with voice signals from a speaker
Keywords :
data compression; neural nets; picture processing; real-time systems; speech analysis and processing; user interfaces; 3-D surface model; automatic media conversion; facial image; intelligent communication system; intelligent human-machine interface; motion driving method; neural network; real-time image synthesis system; speech to image media conversion; synthesized head image; synthesized human face images; terminal display; texture mapping; vector quantization; voice signals; Face; Humans; Image converters; Image generation; Man machine systems; Network synthesis; Neural networks; Real time systems; Signal synthesis; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.151000