Title :
Robust Speaker Verification Based on Multi Stage Vector Quantization of MFCC Parameters on Narrow Bandwidth Channels
Author :
Homayounpour, M. Mehdi ; Rezaian, Iman
Author_Institution :
Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran
Abstract :
This paper presents a very low bit rate and robust client-server-based speaker verification system using MFCC parameters. Two aspects are proposed and assessed including very low bit rate transmission of test utterance feature vectors from client to server, and robust speaker verification in situations where training and test environment noise conditions including noise types and SNRs are different and unknown for speaker verification system. Very low bit rate transmission of feature vectors are achieved using multi stage vector quantization technique (MSVQ). This technique is used for quantization of MFCC feature vectors obtained from speaker´s utterance in client side. This leads to significant bits per frame (bpf) reduction from 416 bpf for transmission of 13 dimensional MFCC feature vectors to 36 bpf i.e. 3600 bps. Robust speaker verification is achieved when instead of training only a speaker model using clean data, several speaker models are trained using a limited number of noises in different SNRs. This leads to very good performances even for conditions where test environment noise types and SNRs are different from those of training phase. The results of conducted experiments approve the effectiveness of the proposed methods.
Keywords :
cepstral analysis; client-server systems; speaker recognition; speech coding; vector quantisation; MFCC parameters; MSVQ; bits per frame reduction; low-bit rate transmission; mel-frequency cepstral coefficients; multistage vector quantization; narrow bandwidth channels; robust client-server-based speaker verification system; speaker model training; utterance feature vectors; Acoustic noise; Bandwidth; Bit rate; Loudspeakers; Mel frequency cepstral coefficient; Noise robustness; Speech coding; Topology; Vector quantization; Working environment noise; codebook design; multi stage vector quantization; noisy conditions; robust speaker verification; speaker verification;
Conference_Titel :
Advanced Communication Technology, 2008. ICACT 2008. 10th International Conference on
Conference_Location :
Gangwon-Do
Print_ISBN :
978-89-5519-136-3
DOI :
10.1109/ICACT.2008.4493773