• DocumentCode
    463350
  • Title

    GA-Based Speaking Mouth Correlative Speech Feature Abstraction

  • Author

    Jia, Xibin ; Yin, Baocai ; Sun, Yanfeng ; Lin, Xianping

  • Author_Institution
    Multimedia & Intelligent Software Technol., Beijing Univ. of Technol.
  • Volume
    1
  • fYear
    2006
  • fDate
    17-19 July 2006
  • Firstpage
    114
  • Lastpage
    119
  • Abstract
    The image-based lip animation synthesis approach is one kind of promising method that synthesizes the believable talking head. This paper seeks to show an improvement in the accuracy of mouth prediction with the speech stimulus, as well as showing the method used to extract the speaking mouth correlative speech feature. Our lip animation synthesis system is based on the construction of a frame level audiovisual mapping model between the acoustic speech class and speaking mouth image class. Taking the mapping model as a basis, genetic algorithm is used to extract the speaking mouth correlative speech feature. The key step used in this study is: fitness and coding scheme designing. Experimental results show that the extracted speech feature has a better correlation with the corresponding speaking mouth, compared to the single or mixed LPCC and MFCC. More research will be done in this specialist field of study the multi-layer speaking mouth correlative speech feature abstraction structure, and will attempt to show that the speaking mouth correlative speech feature should have better results
  • Keywords
    computer animation; feature extraction; genetic algorithms; image processing; speech processing; acoustic speech class; audiovisual mapping model; coding scheme; feature extraction; fitness designing; genetic algorithm; image-based lip animation synthesis; mouth prediction; speaking mouth correlative speech feature abstraction; speaking mouth image class; speech processing; speech stimulus; Acoustic testing; Animation; Image databases; Magnetic heads; Mouth; Natural languages; Speech processing; Speech recognition; Speech synthesis; Visual databases; audiovisual mapping; coding scheme; fitness designing; speaking mouth correlative speech feature abstraction; speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics, 2006. ICCI 2006. 5th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0475-4
  • Type

    conf

  • DOI
    10.1109/COGINF.2006.365685
  • Filename
    4216400