Title :
Experiments in Speaker Adaptation for Factor Analysis Based Speaker Verification
Author :
Yin, Shou-Chun ; Kenny, Patrick ; Rose, Richard
Author_Institution :
Centre de Recherche Informatoque de Montreal, Que.
Abstract :
This paper presents methods for supervised and unsupervised speaker adaptation of Gaussian mixture speaker models in text-independent speaker verification. The methods are based on an approach which is able to decompose speaker and channel variability so that progressive updating of speaker models can be performed while minimizing the influence of the channel variability associated with the adaptation utterances. This approach relies on a joint factor analysis model of intrinsic speaker variability and session variability where inter-session variation is assumed to result primarily from the effects of the channel. These adaptation methods have been evaluated under the adaptation paradigm defined under the NIST 2005 speaker recognition evaluation plan which is based on conversational telephone speech. It was found that when both target speaker model training and speaker verification trials were performed using a five minute excerpt from a single conversation, an equal error rate (EER) of 4.5% and minimum detection cost function (DCF) of 0.013 were obtained when performing unsupervised speaker adaptation during evaluation. It will be shown that this performance is comparable to that obtained by state of the art speaker verification systems that rely on a larger set of features and are trained from as many as eight conversations from the target speaker
Keywords :
Gaussian processes; error statistics; speaker recognition; DCF; EER; GMM; Gaussian mixture model; NIST 2005 speaker recognition evaluation; channel variability; conversational telephone speech; equal error rate; factor analysis; minimum detection cost function; speaker adaptation; text-independent speaker verification; Adaptation model; Aging; Cost function; Error analysis; NIST; Performance evaluation; Speaker recognition; Speech analysis; Telephony; Testing;
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
DOI :
10.1109/ODYSSEY.2006.248130