Title :
An investigation on back-end for speaker recognition in multi-session enrollment
Author :
Gang Liu ; Hasan, T. ; Boril, Hynek ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
This study explores various back-end classifiers for robust speaker recognition in multi-session enrollment, with emphasis on optimal utilization and organization of speaker information present in the development data. Our objective is to construct a highly discriminative back-end framework by fusing several back-ends on an i-vector system framework. It is demonstrated that, by using different information/data configuration and modeling schemes, performance of the fused system can be significantly improved compared to an individual system using a single front-end and back-end. Averaged across both genders, we obtain a relative improvement in EER and minDCF by 56.5% and 49.4%, respectively. Consistent performance gains obtained using the proposed strategy validates its effectiveness. This system is part of the CRSS´ NIST SRE 2012 submission system.
Keywords :
pattern classification; speaker recognition; CRSS NIST SRE 2012 submission system; EER improvement; back-end classifiers; data configuration; highly discriminative back-end framework; i-vector system framework; information configuration; minDCF improvement; modeling schemes; multisession enrollment; optimal speaker information organization; optimal speaker information utilization; performance gains; performance improvement; robust speaker recognition; Abstracts; Cepstral analysis; Educational institutions; Indexes; Integrated optics; Optical design; Training; GCDS; PLDA; Universal Background Support; classification algorithms; speaker recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639173