مرکز منطقه ای اطلاع رساني علوم و فناوري - Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes

DocumentCode :

2731727

Title :

Visualize speech: a continuous speech recognition system for facial animation using acoustic visemes

Author :

Lei, Xie ; Dongmei, Jiang ; Ravyse, IIse ; Rongchun, Zhao ; Verhelst, Werner ; Sahli, Hichem ; Conlenis, J.

Author_Institution :

Dept. Comput. Sci. & Eng., Northwestern Polytech. Univ., Shaanxi, China

Volume :

fYear :

2003

fDate :

14-17 Dec. 2003

Firstpage :

872

Abstract :

This paper presents an acoustic viseme based continuous speech recognition system for speech driven talking face animation. The system is developed using viseme HMMs with acoustic speech as input only. Triseme HMMs are adopted to reflect the mouth shape contexts. Visual decision trees are introduced to get robust parameter training for triseme HMMs with the limited training data. In the tree building process, methods based on lip rounding and similarity of viseme shapes are introduced to design visual questions. The results from objective and subjective evaluations show that the talking face animation based on the speech recognition system provided by this paper outperforms the conventional phoneme based one, and it is possible to obtain visually relevant speech segmentation information from acoustic speech signal only.

Keywords :

computer animation; hidden Markov models; speech recognition; acoustic viseme based continuous speech recognition system; hidden Markov models; lip rounding; objective evaluations; speech driven talking face animation; speech segmentation; subjective evaluations; training data; triseme HMM; Buildings; Decision trees; Facial animation; Mouth; Robustness; Shape; Speech analysis; Speech recognition; Training data; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on

Conference_Location :

Nanjing

Print_ISBN :

0-7803-7702-8

Type :

conf

DOI :

10.1109/ICNNSP.2003.1280738

Filename :

1280738

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2731727