Title :
NAP and WCCN: Comparison of Approaches using MLLR-SVM Speaker Verification System
Author :
Kajarekar, S.S. ; Stolche, A.
Author_Institution :
SRI Int., Menlo Park, CA, USA
Abstract :
We compare two recently proposed techniques, within class covariance normalization (WCCN) (A. Hatch et al., 2006) and nuisance attribute projection (NAP) (A. Solomonoff et al., 2005), for intersession variability compensation in speaker verification. The comparison is performed using an MLLR-SVM speaker verification system. Both techniques model intersession variability using a within-speaker covariance matrix (WSCM). However, they manipulate eigenvectors of this matrix differently. We compare them on the 2005 and 2006 NIST speaker recognition evaluation (SRE) task. Results show that WCCN is more sensitive to the choice of background speakers and NAP is more sensitive to the choice of data for WSCM estimation. WCCN gives the best performance on 2005 SRE. On 2006 SRE, both techniques give similar performance under matched conditions. Further experiments with a simple combination of these techniques show slight improvements in the best performance of either technique. Overall results show that an MLLR-SVM system with either NAP or WCCN performs comparably to the best single systems in the 2006 NIST SRE.
Keywords :
covariance matrices; speaker recognition; MLLR-SVM speaker verification system; NAP; WCCN; nuisance attribute projection; speaker recognition evaluation; within class covariance normalization; within-speaker covariance matrix; Computer science; Cost function; Covariance matrix; Maximum likelihood linear regression; NIST; Speaker recognition; Statistics; Strontium; Support vector machines; Testing; Intersession variability; MLLR transforms; SVM; Speaker recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366896