• DocumentCode
    3303739
  • Title

    Robust Speaker Verification Based on Multi Stage Vector Quantization of MFCC Parameters on Narrow Bandwidth Channels

  • Author

    Homayounpour, M. Mehdi ; Rezaian, Iman

  • Author_Institution
    Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran
  • Volume
    1
  • fYear
    2008
  • fDate
    17-20 Feb. 2008
  • Firstpage
    336
  • Lastpage
    340
  • Abstract
    This paper presents a very low bit rate and robust client-server-based speaker verification system using MFCC parameters. Two aspects are proposed and assessed including very low bit rate transmission of test utterance feature vectors from client to server, and robust speaker verification in situations where training and test environment noise conditions including noise types and SNRs are different and unknown for speaker verification system. Very low bit rate transmission of feature vectors are achieved using multi stage vector quantization technique (MSVQ). This technique is used for quantization of MFCC feature vectors obtained from speaker´s utterance in client side. This leads to significant bits per frame (bpf) reduction from 416 bpf for transmission of 13 dimensional MFCC feature vectors to 36 bpf i.e. 3600 bps. Robust speaker verification is achieved when instead of training only a speaker model using clean data, several speaker models are trained using a limited number of noises in different SNRs. This leads to very good performances even for conditions where test environment noise types and SNRs are different from those of training phase. The results of conducted experiments approve the effectiveness of the proposed methods.
  • Keywords
    cepstral analysis; client-server systems; speaker recognition; speech coding; vector quantisation; MFCC parameters; MSVQ; bits per frame reduction; low-bit rate transmission; mel-frequency cepstral coefficients; multistage vector quantization; narrow bandwidth channels; robust client-server-based speaker verification system; speaker model training; utterance feature vectors; Acoustic noise; Bandwidth; Bit rate; Loudspeakers; Mel frequency cepstral coefficient; Noise robustness; Speech coding; Topology; Vector quantization; Working environment noise; codebook design; multi stage vector quantization; noisy conditions; robust speaker verification; speaker verification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Communication Technology, 2008. ICACT 2008. 10th International Conference on
  • Conference_Location
    Gangwon-Do
  • ISSN
    1738-9445
  • Print_ISBN
    978-89-5519-136-3
  • Type

    conf

  • DOI
    10.1109/ICACT.2008.4493773
  • Filename
    4493773