DocumentCode :
638548
Title :
Using the conformal embedding analysis to compensate the channel effect in the I-vector based speaker verification system
Author :
Boulkenafet, Z. ; Bengherabi, Messaoud ; Nouali, Omar ; Cheriet, Mohamed
Author_Institution :
Centre de Dev. des Technol. Av. (CDTA) Algeria, Algeria
fYear :
2013
fDate :
5-6 Sept. 2013
Firstpage :
1
Lastpage :
8
Abstract :
The I-vector approach to speaker recognition has become the prevalent paradigm over the past 2 years, showing top performance in NIST evaluations. This success is due mainly to the capability of the I-vector to capture and compress the speaker characteristics at low dimension and the subsequent channel compensation techniques that minimize channel variability. The Linear Discriminative Analysis (LDA) followed by Within-Class Covariance Normalization (WCCN) and Cosine Similarity Scoring (CSS) represents the best compromise between performance and computational complexity. In this paper, we propose to use Conformal Embedding Analysis (CEA); a recently proposed manifold leaning technique; to tackle the main limitations of LDA which are: the Gaussian assumption on the classes distribution, the inability to preserve the local geometric relationships of the data-space and its reliance on the Euclidean distance for characterizing the relationships between feature vectors. Experimental results on the challenging MOBIO-voice database show that CEA+WCCN outperforms LDA+WCCN for both male and female speakers at all operating points.
Keywords :
computational complexity; geometry; speaker recognition; CEA+WCCN; CSS; Euclidean distance; Gaussian assumption; LDA+WCCN; MOBIO-voice database; NIST evaluations; channel effect; channel variability; classes distribution; computational complexity; conformal embedding analysis; cosine similarity scoring; i-vector based speaker verification system; linear discriminative analysis; local geometric relationships; speaker recognition; subsequent channel compensation techniques; within-class covariance normalization; Databases; Euclidean distance; Manifolds; Speaker recognition; Speech; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biometrics Special Interest Group (BIOSIG), 2013 International Conference of the
Conference_Location :
Darmstadt
Type :
conf
Filename :
6617162
Link To Document :
بازگشت