• DocumentCode
    1882764
  • Title

    Gender identification using a general audio classifier

  • Author

    Harb, Hadi ; Chen, Liming

  • Author_Institution
    Dept. of Mathematiques Informatique, Ecole Centrale de Lyon, France
  • Volume
    2
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum´s statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.
  • Keywords
    audio signal processing; indexing; neural nets; spectral analysis; speech recognition; audio classifier; audio compression; audio-visual data; content-based multimedia indexing; gender identification; neural networks; spectrum statistics; speech signal; Audio compression; Automatic speech recognition; Context modeling; Indexing; Mel frequency cepstral coefficient; Neural networks; Robustness; Signal processing; Speech recognition; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1221721
  • Filename
    1221721