• DocumentCode
    3162679
  • Title

    Spectro-temporal Gabor features for speaker recognition

  • Author

    Lei, Howard ; Meyer, Bernd T. ; Mirghafori, Nikki

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4241
  • Lastpage
    4244
  • Abstract
    In this work, we have investigated the performance of 2D Gabor features (known as spectro-temporal features) for speaker recognition. Gabor features have been used mainly for automatic speech recognition (ASR), where they have yielded improvements. We explored different Gabor feature implementations, along with different speaker recognition approaches, on ROSSI [1] and NIST SRE08 databases. Using the noisy ROSSI database, the Gabor features performed as well as the MFCC features standalone, and score-level combination of Gabor and MFCC features resulted in an 8% relative EER improvement over MFCC features standalone. These results demonstrated the value of both spectral and temporal information for feature extraction, and the complementarity of Gabor features to MFCC features.
  • Keywords
    Gabor filters; feature extraction; speech recognition; 2D Gabor features; MFCC features standalone; NIST SRE08 database; automatic speech recognition; noisy ROSSI database; score-level combination; spectro-temporal Gabor features; Databases; Feature extraction; Frequency modulation; Mel frequency cepstral coefficient; NIST; Speaker recognition; Training; Gabor features; ROSSI database; Speaker recognition; spectral and temporal modulation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288855
  • Filename
    6288855