Title :
Discriminative training for speaker identification based on maximum model distance algorithm
Author :
Hong, Q.Y. ; Kwong, S.
Author_Institution :
Dept of Comput. Sci., City Univ. of Hong Kong, China
Abstract :
In this paper we apply the maximum model distance (MMD) training to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.
Keywords :
maximum likelihood estimation; speaker recognition; ML method; MMD training; TIMIT corpus; competitive speakers; discriminative training; dissimilarities; identification performance; limited training data; maximum model distance algorithm; selection strategy; speaker identification; Computer science; Feature extraction; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Optimization methods; Speech recognition; Stochastic processes; Training data; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325913