DocumentCode
3298892
Title
A speech driven talking head system based on a single face image
Author
Lin, I-Chen ; Hung, Cheng-Sheng ; Yang, Tzong-Jer ; Ouhyoung, Ming
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
1999
fDate
1999
Firstpage
43
Abstract
In this paper, a lifelike talking head system is proposed. The talking head, which is driven by speaker independent speech recognition, requires only one single face image to synthesize lifelike facial expression. The proposed system uses speech recognition engines to get utterances and corresponding time stamps in the speech data. Associated facial expressions can be fetched from an expression pool and the synthetic facial expression can then be synchronized with speech. When applied to Internet, our web-enabled talking head system can be a vivid merchandise narrator, and only requires 50 K bytes/minute with an additional face image (about 40 Kbytes in CIF format, 24 bit-color, JPEG compression). The system can synthesize facial animation more than 30 frames/sec on a Pentium II 266 MHz PC
Keywords
computer animation; speech recognition; JPEG compression; expression pool; lifelike facial expression; lifelike talking head system; merchandise narrator; single face image; speaker independent speech recognition; speech driven talking head system; speech recognition engines; synthetic facial expression; time stamps; web-enabled talking head system; Computer science; Facial animation; Head; Internet; Laboratories; Multimedia communication; Multimedia systems; Search engines; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Graphics and Applications, 1999. Proceedings. Seventh Pacific Conference on
Conference_Location
Seoul
Print_ISBN
0-7695-0293-8
Type
conf
DOI
10.1109/PCCGA.1999.803347
Filename
803347
Link To Document