DocumentCode
3157909
Title
A Natural Chinese speech Driven Mouth Animation System
Author
Xu, Ming ; Ouyang, Jianjun ; Huang, Yunsen
Author_Institution
Inf. Center, Shenzhen Univ., Shenzhen
fYear
2007
fDate
22-24 Aug. 2007
Firstpage
642
Lastpage
646
Abstract
Distinguish with phoneme based human mouth animation researches, this paper presents a novel natural speech driven mouth animation approach. To recognize the mouth shapes sequence from continuous speech, the context-dependent viseme (triseme) modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the Viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. The experimental results demonstrate that the recognition accuracy is applicable and also the recognizing and aligning speed is acceptable in human vision range.
Keywords
computer animation; hidden Markov models; image coding; image representation; image sequences; MPEG-4 FAP; Viterbi beam searching algorithm; context-dependent viseme modeling technique; mouth animation system; mouth shapes sequence; natural Chinese speech driven system; phoneme based human mouth animation; states tying procedure; triseme modeling technique; trisemic HMM; Animation; Context modeling; Hidden Markov models; Humans; Mouth; Natural languages; Robustness; Shape; Speech recognition; Training data; MPEG-4 FAPs; Speech driven; mouth animation; triseme; viseme;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Networking in China, 2007. CHINACOM '07. Second International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-1008-8
Electronic_ISBN
978-1-4244-1009-5
Type
conf
DOI
10.1109/CHINACOM.2007.4469473
Filename
4469473
Link To Document