Title :
A Visual Speech Feature to Indentify the Speaking States from Video
Author :
Xibin Jia ; Baocai Yin ; Yanfen Sun
Author_Institution :
Beijing Municipal Key Lab., Beijing Univ. of Technol., Beijing, China
Abstract :
The paper proposes a kind of visual speech feature for the speaking mouth images from the video combining clues of the shape and local teeth texture. The geometric feature we proposed based on the computing the Euclidian distant between each the feature point around the inner and outer lip. The local texture with G and B components as baseline is employed to calculate the color moment to describe the visibility of teeth. The weighted fusion is used to combine the two features. The k-mean algorithm is utilized to analyze the feature performance according to evaluate the clustering results. The results show that with G and B color component to derive the local texture to model the teeth visibility are better than the others and our feature has higher ability to perceive the visemes than the PCA and geometric feature only.
Keywords :
feature extraction; image colour analysis; image texture; pattern clustering; video signal processing; Euclidian distance; color component; geometric feature; k-mean algorithm; local texture; speaking mouth image; speaking state identification; visual speech feature; Computational modeling; Feature extraction; Image color analysis; Speech; Teeth; Visualization;
Conference_Titel :
Multimedia Technology (ICMT), 2010 International Conference on
Conference_Location :
Ningbo
Print_ISBN :
978-1-4244-7871-2
DOI :
10.1109/ICMULT.2010.5629829