Visual speech synthesis based on Chinese dynamic visemes

Author

Zhao, Hui ; Tang, Chaojing

Author_Institution

Coll. of Electron. Sci. & Eng., Nat. Univ. of Defence Technol., Changsha

fYear

2008

fDate

20-23 June 2008

Firstpage

139

Lastpage

143

Abstract

In order to realize realistic visual speech synthesis, a visual speech synthesis method based on Chinese dynamic visemes is proposed. With mouth feature parameters of Chinese static visemes, consonants and vowels are classified using clustering algorithm. According to Chinese pronunciation characters, we can get 40 basic dynamic visemes by combining consonant types and vowel types. With these dynamic visemes and corresponding phonemes, two-layer hidden Markov model (HMM) is built up and trained. Experimental results show that, for speeches which are chosen randomly, the synthetic visual speech is smooth and realistic.

Keywords

feature extraction; hidden Markov models; natural language processing; pattern classification; pattern clustering; speech synthesis; Chinese dynamic visemes; Chinese pronunciation characters; HMM; clustering algorithm; mouth feature parameters; two-layer hidden Markov model; visual speech synthesis; Automation; Band pass filters; Chaotic communication; Clustering algorithms; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Mouth; Speech recognition; Speech synthesis; Dynamic viseme; Hidden Markov model; Visual speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Information and Automation, 2008. ICIA 2008. International Conference on

Conference_Location

Changsha

Print_ISBN

978-1-4244-2183-1

Electronic_ISBN

978-1-4244-2184-8

Type

conf

DOI

10.1109/ICINFA.2008.4607983

Filename

4607983