Gender identification using a general audio classifier

Author

Harb, Hadi ; Chen, Liming

Author_Institution

Dept. of Mathematiques Informatique, Ecole Centrale de Lyon, France

Volume

2

fYear

2003

fDate

6-9 July 2003

Abstract

In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum´s statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.

Keywords

audio signal processing; indexing; neural nets; spectral analysis; speech recognition; audio classifier; audio compression; audio-visual data; content-based multimedia indexing; gender identification; neural networks; spectrum statistics; speech signal; Audio compression; Automatic speech recognition; Context modeling; Indexing; Mel frequency cepstral coefficient; Neural networks; Robustness; Signal processing; Speech recognition; Statistics;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on

Print_ISBN

0-7803-7965-9

Type

conf

DOI

10.1109/ICME.2003.1221721

Filename

1221721