Title :
Applications of MFCC and Vector Quantization in speaker recognition
Author :
Gupta, Arpan ; Gupta, H.
Author_Institution :
Dept. Of Electron. & Commun. Eng., Jaypee Inst. of Inf. Technol., Noida, India
Abstract :
In speaker recognition, most of the computation originates from the likelihood computations between feature vectors of the unknown speaker and the models in the database. In this paper, we concentrate on optimizing Mel Frequency Cepstral Coefficient (MFCC) for feature extraction and Vector Quantization (VQ) for feature modeling. We reduce the number of feature vectors by pre-quantizing the test sequence prior to matching, and number of speakers by ruling out unlikely speakers during recognition process. The two important parameters, Recognition rate and minimized Average Distance between the samples, depends on the codebook size and the number of cepstral coefficients. We find, that this approach yields significant performance when the changes are made in the number of mfcc´s and the codebook size. Recognition rate is found to reach upto 89% and the distortion reduced upto 69%.
Keywords :
cepstral analysis; feature extraction; pattern matching; sequences; speaker recognition; speech coding; vector quantisation; MFCC; average distance minimization; codebook size; feature extraction; feature modeling; feature vector; likelihood computation; mel frequency cepstral coefficient; recognition rate; speaker matching; speaker recognition; test sequence prequantization; vector quantization; Feature extraction; Mel frequency cepstral coefficient; Signal processing algorithms; Speech; Training; Vector quantization; Vectors; MFCC; VQ; cepstral coefficients; feature extraction; feature modeling; feature vector;
Conference_Titel :
Intelligent Systems and Signal Processing (ISSP), 2013 International Conference on
Conference_Location :
Gujarat
Print_ISBN :
978-1-4799-0316-0
DOI :
10.1109/ISSP.2013.6526896