• DocumentCode
    134353
  • Title

    An iterative framework for unsupervised learning in the PLDA based speaker verification

  • Author

    Wenbo Liu ; Zhiding Yu ; Ming Li

  • Author_Institution
    SYSU-CMU Joint Inst. of Eng., Sun Yat-Sen Univ., Guangzhou, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    78
  • Lastpage
    82
  • Abstract
    We present an iterative and unsupervised learning approach for the speaker verification task. In conventional speaker verification, Probabilistic Linear Discriminant Analysis (PLDA) has been widely used as a supervised backend. However, PLDA requires fully labeled training data, which is often difficult to obtain in reality. To automatically retrieve the speaker labels of unlabeled training data, we propose to use the Affinity Propagation (AP) - a clustering method that takes pairwise data similarity as input - to generate the labels for the PLDA modeling. We further propose an iterative refinement strategy that incrementally updates the similarity input of the AP clustering with the previous iteration´s PLDA scoring outputs. Moreover, we evaluate the performance of different PLDA scoring methods for the multiple enrollment task and show that the generalized hypothesis testing achieves the best results. Experiments were conducted on the NIST SRE 2010 and the 2014 i-vector challenge database. The results show that our proposed iterative and unsupervised PLDA model learning approach outperformed the cosine similarity baseline by 35% relatively.
  • Keywords
    iterative methods; pattern clustering; probability; speaker recognition; unsupervised learning; PLDA scoring method; affinity propagation; clustering method; iterative refinement strategy; pairwise data similarity; probabilistic linear discriminant analysis; speaker verification; unsupervised learning; Data models; Databases; NIST; Probabilistic logic; Speaker recognition; Testing; Vectors; Affinity Propagation; I-Vector; Probabilistic Linear Discriminant Analysis; Speaker Verification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936726
  • Filename
    6936726