• DocumentCode
    2772317
  • Title

    Comparison of features extracted using time-frequency and frequency-time analysis approach for text-independent speaker identification

  • Author

    Sen, Nirmalya ; Basu, Tapan ; Chakroborty, Sandipan

  • Author_Institution
    CET, Signal Process. Res. Group, IIT Kharagpur, Kharagpur, India
  • fYear
    2011
  • fDate
    28-30 Jan. 2011
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper compares the feature sets extracted using time-frequency analysis approach and frequency-time analysis approach for text-independent speaker identification. Mel-frequency cepstral coefficient (MFCC) feature set and Inverted Mel-frequency cepstral coefficient (IMFCC) feature set are extracted using time-frequency analysis approach. Temporal energy subband cepstral coefficient (TESBCC) feature set is extracted using frequency time analysis approach. Time-bandwidth product of MFCC filter bank and TESBCC filter bank has been compared. RV coefficient has been used to calculate the correlation between the feature sets. Experimental evaluation was conducted on POLYCOST database with 130 speakers using Gaussian mixture speaker model. The TESBCC feature set has 9.5% higher average accuracy compared to the MFCC feature set. It is found that, the feature set extracted using time-frequency analysis approach is practically uncorrelated with the feature set extracted using frequency-time analysis approach. It is also demonstrated that IMFCC feature set has important role in fusion.
  • Keywords
    cepstral analysis; channel bank filters; feature extraction; speaker recognition; time-frequency analysis; Gaussian mixture speaker model; MFCC filter bank; POLYCOST database; TESBCC filter bank; feature extraction; frequency-time analysis; inverted Mel-frequency cepstral coefficient; temporal energy subband cepstral coefficient; text-independent speaker identification; time-frequency analysis; Accuracy; Feature extraction; Filter banks; Finite impulse response filter; Mel frequency cepstral coefficient; Speech; Time frequency analysis; Feature extraction; GMM; Nyquist filter; POLYCOST database; frequency-time analysis; time-frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2011 National Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-61284-090-1
  • Type

    conf

  • DOI
    10.1109/NCC.2011.5734720
  • Filename
    5734720