DocumentCode
1804187
Title
Discrimination comparison between audio and visual features
Author
Chao Sui ; Togneri, Roberto ; Haque, Showera ; Bennamoun, Mohammed
Author_Institution
Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA, Australia
fYear
2012
fDate
4-7 Nov. 2012
Firstpage
1609
Lastpage
1612
Abstract
This paper aims at comparing the discrimination between audio, 2D-based visual and 3D-based visual features for the speech recognition purpose. The audio and visual feature extraction schemes and several feature selection techniques are described first in this paper. With the application of the described feature extraction and selection methods, several experiments are conducted to compare the discrimination of the audio features, the 2D visual features and the 3D visual features for the hVd words classification task. In our study, it is found that the 3D visual features have more separability than the 2D visual features, so that the 3D-based audio-visual speech recognition may achieve more desirable results than the traditional 2D-based counterpart.
Keywords
feature extraction; speech recognition; 2D-based visual feature extraction scheme; 3D-based audio-visual speech recognition; 3D-based visual feature extraction scheme; audio feature extraction scheme; feature selection technique; hVd words classification task;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on
Conference_Location
Pacific Grove, CA
ISSN
1058-6393
Print_ISBN
978-1-4673-5050-1
Type
conf
DOI
10.1109/ACSSC.2012.6489302
Filename
6489302
Link To Document