DocumentCode :
417104
Title :
Discriminative training for speaker identification based on maximum model distance algorithm
Author :
Hong, Q.Y. ; Kwong, S.
Author_Institution :
Dept of Comput. Sci., City Univ. of Hong Kong, China
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In this paper we apply the maximum model distance (MMD) training to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.
Keywords :
maximum likelihood estimation; speaker recognition; ML method; MMD training; TIMIT corpus; competitive speakers; discriminative training; dissimilarities; identification performance; limited training data; maximum model distance algorithm; selection strategy; speaker identification; Computer science; Feature extraction; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Optimization methods; Speech recognition; Stochastic processes; Training data; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325913
Filename :
1325913
Link To Document :
بازگشت