Title :
A model distance measure for talker clustering and identification
Author :
Foote, J.T. ; Silverman, H.F.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
This paper describes methods of talker clustering and identification based on a “distance” metric between discrete HMM output probabilities. Output probabilities are derived on a tree-based MMI partition of the feature space, rather than the usual vector quantization. The information divergence (relative entropy) between speaker-dependent models is used as a quantitative measure of how much a given talker differs from another talker. An immediate application is talker identification: an unknown speaker may be identified by finding the closest speaker-dependent reference model to a model trained on the unknown speaker´s data. Another application is to cluster similar talkers into a group; these may be used to train a HMM model that represents that talker better than a more general model. It is shown that using the model “nearest” a novel talker enhances the performance of a talker-independent speech recognition system
Keywords :
entropy; hidden Markov models; probability; speaker recognition; speech coding; vector quantisation; HMM model; discrete HMM output probabilities; feature space; information divergence; model distance measure; mutual information; relative entropy; speaker-dependent reference model; talker clustering; talker identification; talker-independent speech recognition system; tree based vector quantization; tree-based MMI partition; Acoustic emission; Decision trees; Entropy; Hidden Markov models; Parameter estimation; Probability distribution; Q measurement; Speech recognition; Vector quantization; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389292