DocumentCode :
284780
Title :
Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs
Author :
Matsui, Tomoko ; Furui, Sadaoki
Author_Institution :
NTT Human Interface Lab., Tokyo, Japan
Volume :
2
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
157
Abstract :
A VQ (vector quantization)-distortion-based speaker recognition method and discrete/continuous ergodic HMM (hidden Markov model)-based ones are compared, especially from the viewpoint of robustness against utterance variations. It is shown that a continuous ergodic HMM is far superior to a discrete ergodic HMM. It is also shown that the information on transitions between different states is ineffective for text-independent speaker recognition. Therefore, the speaker identification rates using a continuous ergodic HMM are strongly correlated with the total number of mixtures irrespective of the number of states. It is also found that, for continuous ergodic HMM-based speaker recognition, the distortion-intersection measure (DIM), which was introduced as a VQ-distortion measure to increase the robustness against utterance variations, is effective
Keywords :
hidden Markov models; speech coding; speech recognition; vector quantisation; VQ distortion method; continuous ergodic HMM; discrete ergodic HMM; distortion-intersection measure; hidden Markov model; speaker identification rates; text-independent speaker recognition; utterance variations; vector quantization; Books; Cepstral analysis; Databases; Hidden Markov models; Humans; Laboratories; Robustness; Speaker recognition; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.226096
Filename :
226096
Link To Document :
بازگشت