DocumentCode
2713161
Title
Voice-based gender identification via multiresolution frame classification of spectro-temporal maps
Author
Abdollahi, M. ; Valavi, E. ; Noubari, H. Ahmadi
Author_Institution
Electr. & Comput. Eng. Dept., Univ. of Tehran, Tehran, Iran
fYear
2009
fDate
14-19 June 2009
Firstpage
1
Lastpage
4
Abstract
This paper presents a novel approach to gender identification based on adaptive multiresolution (MR) classification of spectro-temporal maps. The images of speech signals in this work are mainly provided by auditory inspired spectro-temporal representations: mel-spemelctrogram, cochleagram and auditory spectrogram. The 2-D representation of a segment of an utterance is used as the input to the system. The system adds MR decomposition in front of a generic classifier consisting of feature extraction and classification in each MR subspace, finally combined into a global decision using a weighting algorithm. It has been shown that the accuracy of the proposed method, by rising up to 99%, significantly outperforms the accuracy of most of other common algorithms which combine pitch and acoustical features for gender identification.
Keywords
feature extraction; speech recognition; auditory spectrogram; cochleagram; feature extraction; mel-spemelctrogram; multiresolution frame classification; spectro-temporal maps; speech signals; voice-based gender identification; weighting algorithm; Automatic speech recognition; Classification algorithms; Feature extraction; Image segmentation; Linear predictive coding; Neural networks; Signal processing algorithms; Signal resolution; Spectrogram; Uncertainty; gender classification; multiresolution (MR) techniques; spectro-temporal map; wavelet frame;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2009. IJCNN 2009. International Joint Conference on
Conference_Location
Atlanta, GA
ISSN
1098-7576
Print_ISBN
978-1-4244-3548-7
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2009.5178984
Filename
5178984
Link To Document