Title :
Exploiting prosodic information for Speaker Recognition
Author :
Long, Yanhua ; Ma, Bin ; Li, Haizhou ; Guo, Wu ; Chng, Eng Siong ; Dai, Lirong
Author_Institution :
iFly Speech Lab., Univ. of Sci. & Technol. of China, Hefei
Abstract :
In this paper, we study speaker characterization using prosodic supervectors with negative within-class covariance normalization (NWCCN) projection and speaker modeling with support vector regression (SVR). We also propose a segmental weight fusion (SWF) technique that combines acoustic and prosodic subsystems effectively, despite the big performance gap between the subsystems. We validate the effectiveness of our proposed techniques on the NIST 2006 Speaker Recognition Evaluation (SRE) in comparison with other prominent solutions. The experiments have reported competitive results of 17.72% Equal Error Rate for the prosodic subsystem alone and 4.50% for the fusion system on NIST 2006 SRE core test condition.
Keywords :
regression analysis; speaker recognition; support vector machines; negative within-class covariance normalization; segmental weight fusion; speaker modeling; speaker recognition; support vector regression; Covariance matrix; Feature extraction; Information analysis; Kernel; Loudspeakers; NIST; Speaker recognition; Speech analysis; Training data; Vectors; Negative within-class covariance normalization; Segmental weight fusion; Support vector regression;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960561