مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminative training for speaker identification based on maximum model distance algorithm

DocumentCode :

417104

Title :

Discriminative training for speaker identification based on maximum model distance algorithm

Author :

Hong, Q.Y. ; Kwong, S.

Author_Institution :

Dept of Comput. Sci., City Univ. of Hong Kong, China

Volume :

fYear :

2004

fDate :

17-21 May 2004

Abstract :

In this paper we apply the maximum model distance (MMD) training to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.

Keywords :

maximum likelihood estimation; speaker recognition; ML method; MMD training; TIMIT corpus; competitive speakers; discriminative training; dissimilarities; identification performance; limited training data; maximum model distance algorithm; selection strategy; speaker identification; Computer science; Feature extraction; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Optimization methods; Speech recognition; Stochastic processes; Training data; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-8484-9

Type :

conf

DOI :

10.1109/ICASSP.2004.1325913

Filename :

1325913

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=417104