DocumentCode :
134353
Title :
An iterative framework for unsupervised learning in the PLDA based speaker verification
Author :
Wenbo Liu ; Zhiding Yu ; Ming Li
Author_Institution :
SYSU-CMU Joint Inst. of Eng., Sun Yat-Sen Univ., Guangzhou, China
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
78
Lastpage :
82
Abstract :
We present an iterative and unsupervised learning approach for the speaker verification task. In conventional speaker verification, Probabilistic Linear Discriminant Analysis (PLDA) has been widely used as a supervised backend. However, PLDA requires fully labeled training data, which is often difficult to obtain in reality. To automatically retrieve the speaker labels of unlabeled training data, we propose to use the Affinity Propagation (AP) - a clustering method that takes pairwise data similarity as input - to generate the labels for the PLDA modeling. We further propose an iterative refinement strategy that incrementally updates the similarity input of the AP clustering with the previous iteration´s PLDA scoring outputs. Moreover, we evaluate the performance of different PLDA scoring methods for the multiple enrollment task and show that the generalized hypothesis testing achieves the best results. Experiments were conducted on the NIST SRE 2010 and the 2014 i-vector challenge database. The results show that our proposed iterative and unsupervised PLDA model learning approach outperformed the cosine similarity baseline by 35% relatively.
Keywords :
iterative methods; pattern clustering; probability; speaker recognition; unsupervised learning; PLDA scoring method; affinity propagation; clustering method; iterative refinement strategy; pairwise data similarity; probabilistic linear discriminant analysis; speaker verification; unsupervised learning; Data models; Databases; NIST; Probabilistic logic; Speaker recognition; Testing; Vectors; Affinity Propagation; I-Vector; Probabilistic Linear Discriminant Analysis; Speaker Verification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936726
Filename :
6936726
Link To Document :
بازگشت