DocumentCode
2629713
Title
Two-Stage Vector Quantization Based Multi-band Models for Speaker Identification
Author
Chen, Wan-Chen ; Hsieh, Ching-Tang ; Hsu, Chih-Hsu
Author_Institution
St. John´´s Univ., Taipei
fYear
2007
fDate
21-23 Nov. 2007
Firstpage
2336
Lastpage
2341
Abstract
This paper presents an effective method for speaker identification. Based on the wavelet transform, the input speech signal is decomposed into several frequency bands, and then the linear predictive cepstral coefficients (LPCC) of each band are calculated. Furthermore, the cepstral mean normalization technique is applied to all computed features in order to provide similar parameter statistics in all acoustic environments. We propose a multi-band 2-stage vector quantization (VQ) as the recognition model in which different 2-stage VQ classifiers are applied independently to each band and the errors of all 2-stage VQ classifiers are combined to yield total error and a global recognition decision. The experimental results show that the proposed method gives better performance than other recognition models proposed previously in both clean and noisy environments.
Keywords
speaker recognition; vector quantisation; wavelet transforms; cepstral mean normalization technique; input speech signal; linear predictive cepstral coefficients; multi-band models; speaker identification; two-stage vector quantization; wavelet transform; Acoustic noise; Cepstral analysis; Feature extraction; Information technology; Loudspeakers; Mel frequency cepstral coefficient; Speech coding; Speech enhancement; Vector quantization; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Convergence Information Technology, 2007. International Conference on
Conference_Location
Gyeongju
Print_ISBN
0-7695-3038-9
Type
conf
DOI
10.1109/ICCIT.2007.41
Filename
4420601
Link To Document