DocumentCode :
3121644
Title :
Triseme decision trees in the continuous speech recognition system for a talking head
Author :
Jiang, Dong-mei ; Xie, Lei ; Ravyse, I. ; Zhao, Rong-chun ; Sahli, Hichem ; Cornelis, Jan
Author_Institution :
Dept. Comput. Sci. & Eng., Northwestern Polytech. Univ., Xi´´an, China
Volume :
4
fYear :
2002
fDate :
4-5 Nov. 2002
Firstpage :
2097
Abstract :
In this paper, we present a viseme (the basic speech units In the visual domain) based continuous speech recognition system, which segments speech into viseme sequences with timing boundaries to drive a talking head. In the viseme Hidden Markov Model (HMM) training, the instances of a viseme with different contexts are formulated as trisemes. Based on the mouth shape parameters Liprounding and the defined viseme similarity weight (VSW) from the 3D viseme facial models, 166 questions concerning the viseme contexts are designed to build triseme decision trees to tie the states of the trisemes with similar contexts, so that they can share the same parameters. To evaluate the system performance, the image related measurements are also taken to evaluate the resulting viseme sequences, with ´jerky instances´ in Liprounding and VSW graphs evaluating their smoothness. Results show that compared to the phoneme based system, the tied-state triseme based speech recognition system gives talking head animation with smoother and more plausible mouth shapes.
Keywords :
computer animation; decision trees; hidden Markov models; speech recognition; Hidden Markov Model; VSW graphs; animation; decision tree; facial models; liprounding; mouth shape parameters; speech recognition; talking head; viseme; Animation; Context modeling; Decision trees; Head; Hidden Markov models; Mouth; Shape; Speech recognition; System performance; Timing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
Type :
conf
DOI :
10.1109/ICMLC.2002.1175408
Filename :
1175408
Link To Document :
بازگشت