DocumentCode
1882764
Title
Gender identification using a general audio classifier
Author
Harb, Hadi ; Chen, Liming
Author_Institution
Dept. of Mathematiques Informatique, Ecole Centrale de Lyon, France
Volume
2
fYear
2003
fDate
6-9 July 2003
Abstract
In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum´s statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.
Keywords
audio signal processing; indexing; neural nets; spectral analysis; speech recognition; audio classifier; audio compression; audio-visual data; content-based multimedia indexing; gender identification; neural networks; spectrum statistics; speech signal; Audio compression; Automatic speech recognition; Context modeling; Indexing; Mel frequency cepstral coefficient; Neural networks; Robustness; Signal processing; Speech recognition; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN
0-7803-7965-9
Type
conf
DOI
10.1109/ICME.2003.1221721
Filename
1221721
Link To Document