Title :
Temporal decomposition: a promising approach to VQ-based speaker identification
Author :
Nguyen, Phu Chien ; Akagi, Masato ; Ho, Tu Bao
Author_Institution :
Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
Abstract :
In this paper, a new set of features is proposed that has been found to improve the performance of automatic speaker identification systems. The new set of features is referred to as "even targets". The new features have been derived from line spectral frequency (LSF) parameters using the so-called "temporal decomposition" (TD) technique. The number of feature vectors required for both training and testing phases has been reduced by one-fifth compared to that of the traditional mel-frequency cepstrum coefficients (MFCC) features, while the identification results obtained are comparable or even better. Also, this work introduces one more application of TD (speaker recognition) in addition to speech coding, speech segmentation, and speech recognition. It shows that the event targets in TD can convey information about the identity of a speaker.
Keywords :
speaker recognition; speech coding; vector quantisation; even targets; line spectral frequency parameters; mel-frequency cepstrum coefficients; speaker recognition; speech coding; speech recognition; speech segmentation; temporal decomposition; vector quantization-based speaker identification; Cepstrum; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speaker recognition; Spectral analysis; Speech coding; Speech processing; Speech recognition; Testing;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221387