DocumentCode
3327155
Title
Identifying in-set and out-of-set speakers using neighborhood information
Author
Angkititrakul, Pongtep ; Hansen, John H L
Author_Institution
Robust Speech Process. Group, Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We study the problem of identifying in-set and out-of-set speakers. The goal is to identify whether an unknown input speaker belongs to either a group of in-set speakers or an unseen out-of-set group. A state-of-the-art GMM classifier, with universal background model (UBM) and standard likelihood ratio test, is used as our baseline system. We propose an alternative hypothesis testing method that employs neighborhood information with respect to each in-set speaker model in the model space based on the Kullback-Leibier divergence. The Bayes factor is used in the verification stage (accept/reject hypothesis). We evaluate the proposed procedure on a clean CORPUS 1 set, and a noisy CORPUS 2 set which contains session-to-session variability. Experiments show an improvement in equal error rate for the system even when in-set speaker models are acoustically close in the model space, and as the size of the in-set speaker group increases.
Keywords
Bayes methods; Gaussian processes; error statistics; speaker recognition; Bayes factor; GMM classifier; Kullback-Leibier divergence; equal error rate; hypothesis testing method; in-set speaker identification; neighborhood information; out-of-set speaker identification; standard likelihood ratio test; universal background model; Bayesian methods; Error analysis; Forensics; Loudspeakers; Natural languages; Robustness; Speaker recognition; Speech processing; System testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326005
Filename
1326005
Link To Document