Title :
Speaker recognition employing waveform based signal representation in nonorthogonal multiple transform domains
Author :
Mikhael, Wasfy B. ; Premakanthan, Pravinkumar
Author_Institution :
Dept. of Electr. Eng., Univ. of Central Florida, Orlando, FL, USA
Abstract :
Automatic speaker recognition (ASR) technique employing split vector quantized speech representation in multiple transform domains is presented. In this approach, a set of appropriate transform domains are selected and a vector quantized codebook is generated in each of these selected transform domains for the signal waveform. For each speaker, each signal vector is represented from the codebooks that yield the highest accuracy of representation. The algorithm is given and a performance measure is developed and used to evaluate the algorithm performance. Improved speech recognition accuracy was consistently obtained employing the proposed technique in comparison with vector quantization employing single transform VQ representations. Sample results for 10 speakers are presented to illustrate the considerable performance improvement for ASR
Keywords :
signal representation; speaker recognition; speech coding; transform coding; transforms; vector quantisation; ASR; VQ; algorithm performance; automatic speaker recognition; codebooks; nonorthogonal multiple transform domains; performance measure; representation accuracy; signal vector; signal waveform; single transform VQ representation; speech recognition accuracy; split vector quantized speech representation; vector quantization; vector quantized codebook; waveform based signal representation; Automatic speech recognition; Compaction; Computational complexity; Loudspeakers; Signal generators; Signal representations; Spatial databases; Speaker recognition; Speech recognition; Vector quantization;
Conference_Titel :
Circuits and Systems, 2002. ISCAS 2002. IEEE International Symposium on
Conference_Location :
Phoenix-Scottsdale, AZ
Print_ISBN :
0-7803-7448-7
DOI :
10.1109/ISCAS.2002.1011426