• DocumentCode
    126909
  • Title

    Mel frequency cepstral coefficients based text independent Automatic Speaker Recognition using matlab

  • Author

    Singh, A.K. ; Singh, Rajdeep ; Dwivedi, Atul

  • Author_Institution
    Dept. of Electr. Eng., Shiv Nadar Univ., Gautam Budh, India
  • fYear
    2014
  • fDate
    6-8 Feb. 2014
  • Firstpage
    524
  • Lastpage
    527
  • Abstract
    Speech feature extraction is the most significant step in any Automatic speaker recognition system. In the last 60 years a lot of research has gone into parametric representation of these speech features. Several techniques are currently being used for Automatic Speaker Recognition. Yet Automatic Speaker Recognition still remains a confront mainly due to variations in speaker´s vocal tract with time and health, varying environmental conditions, disparities in the behavior and quality of speech recorders etc. MFCC is a extensively used technique in Automatic speaker recognition. In this paper the performance of MFCC technique was evaluated in a quiet environment. A speaker database containing 30 male and 30 female speakers was created. Two separate experiments were conducted for the performance evaluation of MFCC technique when applied to K means clustering. In the first case the speech features were directly matched. In the second case a VQ codebook was created by clustering the training features of these 60 speakers. A distortion easure based on the minimum Euclidean distance was used for speaker recognition. The failure rate of speaker recognition in first ase was found to be was found to be 10% while in the second case as found to be 14%. Matlab-7.10.0 was used for this study.
  • Keywords
    feature extraction; pattern clustering; speaker recognition; K means clustering; MFCC technique; Matlab; Mel frequency cepstral coefficients; VQ codebook; distortion measure; minimum Euclidean distance; parametric representation; speech feature extraction; speech recorders; text independent automatic speaker recognition; Cepstrum; Indexes; Mel frequency cepstral coefficient; Speech; Vectors; K means; Mel Frequency Cepstral Coefficients (MFCC); Mel Window Design; Vector Quantization; feature vector;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Optimization, Reliabilty, and Information Technology (ICROIT), 2014 International Conference on
  • Conference_Location
    Faridabad
  • Print_ISBN
    978-1-4799-3958-9
  • Type

    conf

  • DOI
    10.1109/ICROIT.2014.6798379
  • Filename
    6798379