• DocumentCode
    3542263
  • Title

    Efficient text independent speaker identification based on GFCC and CMN methods

  • Author

    Tazi, El Bachir ; Benabbou, Adil ; Harti, Mostafa

  • Author_Institution
    Fac. des Sci. Dhar El-Mehraz, Dept. d´´Inf., Univ. Sidi Mohamed Ben Abdallah, Fes, Morocco
  • fYear
    2012
  • fDate
    10-12 May 2012
  • Firstpage
    90
  • Lastpage
    95
  • Abstract
    The performance of automatic speaker identification systems degrade drastically in the presence of noise and other distortions, especially when there is a noise level mismatch between the training and testing environments. In this experimental research we have studied a recently robust front-end algorithm based on Gammatone Frequency Cepstral Coefficients GFCC combined to Cepstral Mean Normalization CMN technique. Our system using a Gaussian Mixture Models GMM classifier are implemented and tested under MATLAB®7 programming environment with multilevel White Gaussian Noise WGN applied to a test utterances using our proper database containing 51 Arabic speakers. Our aim is to study the performance of this suggested architecture and make a comparison with the conventional Mel Frequency Cepstral Coefficients MFCC method which we have successfully implemented and tested in the previous work. The obtained experimental results confirm the superior performance of the proposed method over MFCC and outperform it in different noisy environments. Our evaluations based on the recognition rate accuracy show that both MFCC and the proposed feature extractor have perfects performances in low-noise environments when Signal per Noise Ratio SNR is greater than 35 dB (practically 100% in all cases), but when the SNR of test signal changed from 0 to 40 dB, the average accuracy of the MFCCs methods is only 50.05%, while the proposed GFCCs extractors combined to CMN normalization still achieves an average accuracy of 55.43%.
  • Keywords
    Gaussian processes; cepstral analysis; speaker recognition; text analysis; Arabic speakers; CMN methods; GFCC methods; GMM classifier; Gaussian mixture models; MATLAB programming environment; MFCC method; Mel frequency cepstral coefficients; SNR; cepstral mean normalization technique; feature extractor; gammatone frequency cepstral coefficients; low-noise environments; multilevel white Gaussian noise; signal per noise ratio; text independent speaker identification; Discrete cosine transforms; MATLAB; Mathematical model; Mel frequency cepstral coefficient; Signal to noise ratio; Training; Cepstral Mean Normalisation CMN; Gammatone Frequency Cepstral Coefficients GFCC; Gaussian Mixture Models GMMs; Mel Frequency Cepstral Coefficients MFCC; Robust speaker identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Computing and Systems (ICMCS), 2012 International Conference on
  • Conference_Location
    Tangier
  • Print_ISBN
    978-1-4673-1518-0
  • Type

    conf

  • DOI
    10.1109/ICMCS.2012.6320152
  • Filename
    6320152