DocumentCode
2698065
Title
Visual speech synthesis based on Chinese dynamic visemes
Author
Zhao, Hui ; Tang, Chaojing
Author_Institution
Coll. of Electron. Sci. & Eng., Nat. Univ. of Defence Technol., Changsha
fYear
2008
fDate
20-23 June 2008
Firstpage
139
Lastpage
143
Abstract
In order to realize realistic visual speech synthesis, a visual speech synthesis method based on Chinese dynamic visemes is proposed. With mouth feature parameters of Chinese static visemes, consonants and vowels are classified using clustering algorithm. According to Chinese pronunciation characters, we can get 40 basic dynamic visemes by combining consonant types and vowel types. With these dynamic visemes and corresponding phonemes, two-layer hidden Markov model (HMM) is built up and trained. Experimental results show that, for speeches which are chosen randomly, the synthetic visual speech is smooth and realistic.
Keywords
feature extraction; hidden Markov models; natural language processing; pattern classification; pattern clustering; speech synthesis; Chinese dynamic visemes; Chinese pronunciation characters; HMM; clustering algorithm; mouth feature parameters; two-layer hidden Markov model; visual speech synthesis; Automation; Band pass filters; Chaotic communication; Clustering algorithms; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Mouth; Speech recognition; Speech synthesis; Dynamic viseme; Hidden Markov model; Visual speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Automation, 2008. ICIA 2008. International Conference on
Conference_Location
Changsha
Print_ISBN
978-1-4244-2183-1
Electronic_ISBN
978-1-4244-2184-8
Type
conf
DOI
10.1109/ICINFA.2008.4607983
Filename
4607983
Link To Document