مرکز منطقه ای اطلاع رساني علوم و فناوري - A Visual Speech Feature to Indentify the Speaking States from Video

DocumentCode :

2978774

Title :

A Visual Speech Feature to Indentify the Speaking States from Video

Author :

Xibin Jia ; Baocai Yin ; Yanfen Sun

Author_Institution :

Beijing Municipal Key Lab., Beijing Univ. of Technol., Beijing, China

fYear :

2010

fDate :

29-31 Oct. 2010

Firstpage :

Lastpage :

Abstract :

The paper proposes a kind of visual speech feature for the speaking mouth images from the video combining clues of the shape and local teeth texture. The geometric feature we proposed based on the computing the Euclidian distant between each the feature point around the inner and outer lip. The local texture with G and B components as baseline is employed to calculate the color moment to describe the visibility of teeth. The weighted fusion is used to combine the two features. The k-mean algorithm is utilized to analyze the feature performance according to evaluate the clustering results. The results show that with G and B color component to derive the local texture to model the teeth visibility are better than the others and our feature has higher ability to perceive the visemes than the PCA and geometric feature only.

Keywords :

feature extraction; image colour analysis; image texture; pattern clustering; video signal processing; Euclidian distance; color component; geometric feature; k-mean algorithm; local texture; speaking mouth image; speaking state identification; visual speech feature; Computational modeling; Feature extraction; Image color analysis; Speech; Teeth; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia Technology (ICMT), 2010 International Conference on

Conference_Location :

Ningbo

Print_ISBN :

978-1-4244-7871-2

Type :

conf

DOI :

10.1109/ICMULT.2010.5629829

Filename :

5629829

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2978774