Title :
I-matrix for text-independent speaker recognition
Author :
Liang He ; Jia Liu
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
This paper proposes an i-matrix for text-independent speaker recognition. The framework of the proposed i-matrix is similar to an i-vector. However, the presented method takes short-time cepstral feature matrices as inputs to explore both cepstral feature distribution and temporal information for the recognition task in the phase of statistical modeling. In the i-matrix, the variability of an utterance is constrained by two subspaces U and V, which are estimated by an iterative method on a large database. When U and V are well built, each utterance is represented by an i-matrix. Decision function is a cosine kernel. Experiments were carried out on the tel-tel-English condition of NIST SRE 2008 core task. Compared with an i-vector-LDA, the average EER and MDCF of an i-matrix-LDA showed a relative decrease of 4.82% and 5.12% respectively.
Keywords :
cepstral analysis; iterative methods; matrix algebra; speaker recognition; statistical analysis; EER; MDCF; NIST SRE 2008 core task; cepstral feature distribution; cosine kernel; i-matrix-LDA; i-vector; iterative method; short-time cepstral feature matrices; statistical modeling; tel-tel-English condition; temporal information; text-independent speaker recognition; Cepstral analysis; Covariance matrices; Equations; Feature extraction; NIST; Speaker recognition; Vectors; Gaussian mixturemodels; I-vector; i-matrix; text-independent speaker recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639059