DocumentCode :
2009532
Title :
Factor analysis based spatial correlation modeling for speaker verification
Author :
Wang, Er-yu ; Guo, Wu ; Dai, Li-Rong ; Lee, Kong-Aik ; Ma, Bin ; Li, Hai-Zhou
Author_Institution :
iFly Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
166
Lastpage :
170
Abstract :
Gaussian mixture models (GMMs) are commonly used in text-independent speaker verification for modeling the spectral distribution of speech. Recent studies have shown the effectiveness of characterizing speaker information using the mean super-vector obtained by concatenating the mean vectors of the GMM. This paper proposes to use the spatial correlation captured by the covariance matrix of the mean super-vector for speaker verification. Factor analysis method is adopted to estimate the covariance of the super-vector. For measuring the similarity between speech utterances in terms of the spatial correlation, we propose two kernel metrics, namely, log-Euclidean inner product and Frobenius angle. For computational simplicity, we introduce an inner product classifier (IPC) with equivalent performance compared to the commonly used support vector machine (SVM). Experiments conducted on the 2006 NIST speaker recognition evaluation (SRE) dataset confirm the efficacy of the proposed factor analysis based spatial modeling technique.
Keywords :
Gaussian processes; covariance matrices; pattern classification; speaker recognition; Frobenius angle; Gaussian mixture model; covariance matrix; factor analysis; log Euclidean inner product classifier; mean super vector; spatial correlation modeling; speaker recognition evaluation; speech spectral distribution; support vector machine; text independent speaker verification; Correlation; Covariance matrix; Kernel; Loading; Measurement; Speech; Support vector machines; Frobenius angle; factor analysis; inner product classifier; log-Euclidean distance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684490
Filename :
5684490
Link To Document :
بازگشت