DocumentCode
3162679
Title
Spectro-temporal Gabor features for speaker recognition
Author
Lei, Howard ; Meyer, Bernd T. ; Mirghafori, Nikki
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
4241
Lastpage
4244
Abstract
In this work, we have investigated the performance of 2D Gabor features (known as spectro-temporal features) for speaker recognition. Gabor features have been used mainly for automatic speech recognition (ASR), where they have yielded improvements. We explored different Gabor feature implementations, along with different speaker recognition approaches, on ROSSI [1] and NIST SRE08 databases. Using the noisy ROSSI database, the Gabor features performed as well as the MFCC features standalone, and score-level combination of Gabor and MFCC features resulted in an 8% relative EER improvement over MFCC features standalone. These results demonstrated the value of both spectral and temporal information for feature extraction, and the complementarity of Gabor features to MFCC features.
Keywords
Gabor filters; feature extraction; speech recognition; 2D Gabor features; MFCC features standalone; NIST SRE08 database; automatic speech recognition; noisy ROSSI database; score-level combination; spectro-temporal Gabor features; Databases; Feature extraction; Frequency modulation; Mel frequency cepstral coefficient; NIST; Speaker recognition; Training; Gabor features; ROSSI database; Speaker recognition; spectral and temporal modulation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288855
Filename
6288855
Link To Document