DocumentCode :
2548773
Title :
Research on Visual Speech Feature Extraction
Author :
Jun, He ; Hua, Zhang
Author_Institution :
Jiangxi key Lab. of Robot & Welding, Inf. Eng. Coll., NanChang Univ. Nan Chang, Nanchang
Volume :
2
fYear :
2009
fDate :
22-24 Jan. 2009
Firstpage :
499
Lastpage :
502
Abstract :
To solve the problem of extracting visual feature in lipreading, a new method based on DCT+LDA is proposed in this paper. First, region of interest (ROI) is located based on the lip contour information, and then discrete cosine transformation (DCT) is performed on ROI. In order to extract the most discriminative feature vectors from the DCT coefficients and further reduce the feature dimensionality, linear discriminative analysis (LDA) is then introduced. Experiments were performed on speaker-dependent (SD) and speaker-independent (SI) bimodal database respectively, the experimental results showed that this algorithm achieved high recognition accuracy than traditional Zig-Zag DCT coefficients selection method and DCT+PCA algorithm. finally, this algorithm is also justified on our real-time lipreading platform.
Keywords :
discrete cosine transforms; edge detection; feature extraction; speaker recognition; discrete cosine transformation; discriminative feature vector extraction; feature dimensionality reduction; linear discriminative analysis; lip contour information; lip reading; recognition accuracy; region of interest; speaker-dependent bimodal database; speaker-independent bimodal database; visual speech feature extraction; Audio databases; Automatic speech recognition; Data mining; Discrete cosine transforms; Feature extraction; Image databases; Principal component analysis; Spatial databases; Vectors; Visual databases; DCT; LDA; feature extraction; lipreading;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering and Technology, 2009. ICCET '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-3334-6
Type :
conf
DOI :
10.1109/ICCET.2009.63
Filename :
4769653
Link To Document :
بازگشت