DocumentCode
417104
Title
Discriminative training for speaker identification based on maximum model distance algorithm
Author
Hong, Q.Y. ; Kwong, S.
Author_Institution
Dept of Comput. Sci., City Univ. of Hong Kong, China
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
In this paper we apply the maximum model distance (MMD) training to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.
Keywords
maximum likelihood estimation; speaker recognition; ML method; MMD training; TIMIT corpus; competitive speakers; discriminative training; dissimilarities; identification performance; limited training data; maximum model distance algorithm; selection strategy; speaker identification; Computer science; Feature extraction; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Optimization methods; Speech recognition; Stochastic processes; Training data; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325913
Filename
1325913
Link To Document