• DocumentCode
    638548
  • Title

    Using the conformal embedding analysis to compensate the channel effect in the I-vector based speaker verification system

  • Author

    Boulkenafet, Z. ; Bengherabi, Messaoud ; Nouali, Omar ; Cheriet, Mohamed

  • Author_Institution
    Centre de Dev. des Technol. Av. (CDTA) Algeria, Algeria
  • fYear
    2013
  • fDate
    5-6 Sept. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The I-vector approach to speaker recognition has become the prevalent paradigm over the past 2 years, showing top performance in NIST evaluations. This success is due mainly to the capability of the I-vector to capture and compress the speaker characteristics at low dimension and the subsequent channel compensation techniques that minimize channel variability. The Linear Discriminative Analysis (LDA) followed by Within-Class Covariance Normalization (WCCN) and Cosine Similarity Scoring (CSS) represents the best compromise between performance and computational complexity. In this paper, we propose to use Conformal Embedding Analysis (CEA); a recently proposed manifold leaning technique; to tackle the main limitations of LDA which are: the Gaussian assumption on the classes distribution, the inability to preserve the local geometric relationships of the data-space and its reliance on the Euclidean distance for characterizing the relationships between feature vectors. Experimental results on the challenging MOBIO-voice database show that CEA+WCCN outperforms LDA+WCCN for both male and female speakers at all operating points.
  • Keywords
    computational complexity; geometry; speaker recognition; CEA+WCCN; CSS; Euclidean distance; Gaussian assumption; LDA+WCCN; MOBIO-voice database; NIST evaluations; channel effect; channel variability; classes distribution; computational complexity; conformal embedding analysis; cosine similarity scoring; i-vector based speaker verification system; linear discriminative analysis; local geometric relationships; speaker recognition; subsequent channel compensation techniques; within-class covariance normalization; Databases; Euclidean distance; Manifolds; Speaker recognition; Speech; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Biometrics Special Interest Group (BIOSIG), 2013 International Conference of the
  • Conference_Location
    Darmstadt
  • Type

    conf

  • Filename
    6617162