Title :
Utilization of unlabeled development data for speaker verification
Author :
Gang Liu ; Chengzhu Yu ; Shokouhi, Navid ; Misra, Abhinav ; Hua Xing ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
State-of-the-art speaker verification systems model speaker identity by mapping i-Vectors onto a probabilistic linear discriminant analysis (PLDA) space. Compared to other modeling approaches (such as cosine distance scoring), PLDA provides a more efficient mechanism to separate speaker information from other sources of undesired variabilities and offers superior speaker verification performance. Unfortunately, this efficiency is obtained at the cost of a required large corpus of labeled development data, which is too expensive/unrealistic in many cases. This study investigates a potential solution to resolve this challenge by effectively utilizing unlabeled development data with universal imposter clustering. The proposed method offers +21.9% and +34.6% relative gains versus the baseline system on two public available corpora, respectively. This significant improvement proves the effectiveness of the proposed method.
Keywords :
probability; speaker recognition; PLDA; baseline system; i-Vectors; labeled development data corpus; probabilistic linear discriminant analysis; public available corpora; relative gains; speaker identity modelling; speaker information; speaker verification systems; undesired variability sources; universal imposter clustering; unlabeled development data utilization; Clustering algorithms; Clustering methods; Data models; NIST; Speaker recognition; Speech; Vectors; Clustering; PLDA; Speaker verification; Universal imposter clustering; i-Vector;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078611