DocumentCode
638548
Title
Using the conformal embedding analysis to compensate the channel effect in the I-vector based speaker verification system
Author
Boulkenafet, Z. ; Bengherabi, Messaoud ; Nouali, Omar ; Cheriet, Mohamed
Author_Institution
Centre de Dev. des Technol. Av. (CDTA) Algeria, Algeria
fYear
2013
fDate
5-6 Sept. 2013
Firstpage
1
Lastpage
8
Abstract
The I-vector approach to speaker recognition has become the prevalent paradigm over the past 2 years, showing top performance in NIST evaluations. This success is due mainly to the capability of the I-vector to capture and compress the speaker characteristics at low dimension and the subsequent channel compensation techniques that minimize channel variability. The Linear Discriminative Analysis (LDA) followed by Within-Class Covariance Normalization (WCCN) and Cosine Similarity Scoring (CSS) represents the best compromise between performance and computational complexity. In this paper, we propose to use Conformal Embedding Analysis (CEA); a recently proposed manifold leaning technique; to tackle the main limitations of LDA which are: the Gaussian assumption on the classes distribution, the inability to preserve the local geometric relationships of the data-space and its reliance on the Euclidean distance for characterizing the relationships between feature vectors. Experimental results on the challenging MOBIO-voice database show that CEA+WCCN outperforms LDA+WCCN for both male and female speakers at all operating points.
Keywords
computational complexity; geometry; speaker recognition; CEA+WCCN; CSS; Euclidean distance; Gaussian assumption; LDA+WCCN; MOBIO-voice database; NIST evaluations; channel effect; channel variability; classes distribution; computational complexity; conformal embedding analysis; cosine similarity scoring; i-vector based speaker verification system; linear discriminative analysis; local geometric relationships; speaker recognition; subsequent channel compensation techniques; within-class covariance normalization; Databases; Euclidean distance; Manifolds; Speaker recognition; Speech; Training; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Biometrics Special Interest Group (BIOSIG), 2013 International Conference of the
Conference_Location
Darmstadt
Type
conf
Filename
6617162
Link To Document