• DocumentCode
    1691658
  • Title

    Diffusion maps for PLDA-based speaker verification

  • Author

    Barkan, Oren ; Aronowitz, Hagai

  • Author_Institution
    IBM Res., Haifa, Israel
  • fYear
    2013
  • Firstpage
    7639
  • Lastpage
    7643
  • Abstract
    During the last few years, i-vectors have become an important component in most state-of-the-art speaker recognition systems. I-vector extraction is based on an assumption that GMM supervectors reside on a low dimensional space, which is modeled using Factor Analysis. In this paper we replace the above assumption with an assumption that the GMM supervectors reside on a low dimensional manifold and propose to use Diffusion Maps to learn that manifold. The learnt manifold implies a mapping of spoken sessions into a modified i-vector space which we call d-vector space. D-vectors can further be processed using standard techniques such as LDA, WCCN, cosine distance scoring or Probabilistic Linear Discriminant Analysis (PLDA). We demonstrate the usefulness of our approach on the telephone core conditions of NIST 2010, and obtain significant error reduction.
  • Keywords
    Gaussian processes; speaker recognition; vectors; GMM supervectors; NIST 2010; PLDA-based speaker verification; WCCN; cosine distance scoring; d-vector space; diffusion maps; error reduction; factor analysis; i-vector extraction; low dimensional space; probabilistic linear discriminant analysis; speaker recognition systems; spoken sessions; standard techniques; telephone core conditions; Diffusion processes; Feature extraction; Harmonic analysis; Manifolds; NIST; Speaker recognition; Vectors; Diffusion Maps; Non-linear dimensionality reduction; Pattern recognition; Speaker verification; ivectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639149
  • Filename
    6639149