Robust Speaker Verification Based on Multi Stage Vector Quantization of MFCC Parameters on Narrow Bandwidth Channels

Author

Homayounpour, M. Mehdi ; Rezaian, Iman

Author_Institution

Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran

Volume

1

fYear

2008

fDate

17-20 Feb. 2008

Firstpage

336

Lastpage

340

Abstract

This paper presents a very low bit rate and robust client-server-based speaker verification system using MFCC parameters. Two aspects are proposed and assessed including very low bit rate transmission of test utterance feature vectors from client to server, and robust speaker verification in situations where training and test environment noise conditions including noise types and SNRs are different and unknown for speaker verification system. Very low bit rate transmission of feature vectors are achieved using multi stage vector quantization technique (MSVQ). This technique is used for quantization of MFCC feature vectors obtained from speaker´s utterance in client side. This leads to significant bits per frame (bpf) reduction from 416 bpf for transmission of 13 dimensional MFCC feature vectors to 36 bpf i.e. 3600 bps. Robust speaker verification is achieved when instead of training only a speaker model using clean data, several speaker models are trained using a limited number of noises in different SNRs. This leads to very good performances even for conditions where test environment noise types and SNRs are different from those of training phase. The results of conducted experiments approve the effectiveness of the proposed methods.

Keywords

cepstral analysis; client-server systems; speaker recognition; speech coding; vector quantisation; MFCC parameters; MSVQ; bits per frame reduction; low-bit rate transmission; mel-frequency cepstral coefficients; multistage vector quantization; narrow bandwidth channels; robust client-server-based speaker verification system; speaker model training; utterance feature vectors; Acoustic noise; Bandwidth; Bit rate; Loudspeakers; Mel frequency cepstral coefficient; Noise robustness; Speech coding; Topology; Vector quantization; Working environment noise; codebook design; multi stage vector quantization; noisy conditions; robust speaker verification; speaker verification;

fLanguage

English

Publisher

ieee

Conference_Titel

Advanced Communication Technology, 2008. ICACT 2008. 10th International Conference on

Conference_Location

Gangwon-Do

ISSN

1738-9445

Print_ISBN

978-89-5519-136-3

Type

conf

DOI

10.1109/ICACT.2008.4493773

Filename

4493773