• DocumentCode
    2507760
  • Title

    New features extracted from Nyquist filter bank for text-independent speaker identification

  • Author

    Sen, Nirmalya ; Basu, T.K. ; Patil, Hemant A.

  • Author_Institution
    Signal Process. Res. Group, Indian Inst. of Technol., Kharagpur, India
  • fYear
    2010
  • fDate
    17-19 Dec. 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper introduces the use of a new method of feature extraction based on frequency-time analysis approach for text-independent speaker identification. The impetus for this new feature extraction technique comes from the filter bank summation method of STFT using Nyquist filter bank. The focus of this work is on applications which yield higher identification accuracy without increasing the computational effort. We have proposed this transform from speaker identification perspective. The proposed transformation can be used for both uniform width filter bank and non-uniform width filter bank representation. A complete experimental evaluation was conducted on a database of 61 speakers with Gaussian mixture speaker model. This new feature extraction technique has been compared with Mel-frequency cepstral coefficient (MFCC) feature. The average accuracy of MFCC feature set was 88.05%. The average accuracy of proposed feature set with uniform width filter bank and non uniform width filter bank was 90.24% and 90.42% respectively. The average accuracy was 92.26% after score level fusion of uniform width filter bank feature and non uniform width filter bank feature of the proposed transformation. The discrimination capability of the proposed feature sets have been evaluated statistically using F-ratio and J-measure. Experimental results show that the proposed feature sets have higher discrimination capability compared to MFCC feature set.
  • Keywords
    Gaussian processes; channel bank filters; feature extraction; speaker recognition; statistical analysis; F-ratio; Gaussian mixture speaker model; J-measure; MFCC feature; Mel-frequency cepstral coefficient feature; Nyquist filter bank summation method; STFT; feature extraction technique; frequency-time analysis approach; nonuniform width filter bank representation; score level fusion; text-independent speaker identification; uniform width filter bank; Accuracy; Conferences; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Speech; Speech recognition; Feature extraction; GMM; Nyquist filter; TESBCC; score level fusion; speaker identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2010 Annual IEEE
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-9072-1
  • Type

    conf

  • DOI
    10.1109/INDCON.2010.5712689
  • Filename
    5712689