• DocumentCode
    1804187
  • Title

    Discrimination comparison between audio and visual features

  • Author

    Chao Sui ; Togneri, Roberto ; Haque, Showera ; Bennamoun, Mohammed

  • Author_Institution
    Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA, Australia
  • fYear
    2012
  • fDate
    4-7 Nov. 2012
  • Firstpage
    1609
  • Lastpage
    1612
  • Abstract
    This paper aims at comparing the discrimination between audio, 2D-based visual and 3D-based visual features for the speech recognition purpose. The audio and visual feature extraction schemes and several feature selection techniques are described first in this paper. With the application of the described feature extraction and selection methods, several experiments are conducted to compare the discrimination of the audio features, the 2D visual features and the 3D visual features for the hVd words classification task. In our study, it is found that the 3D visual features have more separability than the 2D visual features, so that the 3D-based audio-visual speech recognition may achieve more desirable results than the traditional 2D-based counterpart.
  • Keywords
    feature extraction; speech recognition; 2D-based visual feature extraction scheme; 3D-based audio-visual speech recognition; 3D-based visual feature extraction scheme; audio feature extraction scheme; feature selection technique; hVd words classification task;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA
  • ISSN
    1058-6393
  • Print_ISBN
    978-1-4673-5050-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2012.6489302
  • Filename
    6489302