Title :
Significance of utterance partitioning in GMM-SVM based speaker verification in varying background environment
Author :
Sarkar, Santonu ; Rao, K. Sreenivasa
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
Abstract :
This paper explores the GMM-SVM combined approach for text-independent speaker verification in rapidly varying environmental noise. For mitigating the effect of mismatched training and test utterance length and countering the impact of data-imbalance in SVM scoring, we partition each full-length enrollment utterance into a number of sub-utterances and derive a GMM supervector from each of them prior to SVM training. Experiments conducted on the NIST-SRE-2003 database demonstrate that the GMM-SVM system with partitioned utterances outperform the conventional GMM-SVM based speaker verification system. The noisy background is simulated by degrading non-overlapping segments of each training and test utterance of the NIST-SRE-2003 by additive noises (car, factory, pink & white) collected from the NOISEX-92 database, at OdB, 5dB, 7dB & 10dB SNRs respectively. An average performance improvement of 6.41% and 10.56% EER across all SNRs is observed in comparison to the traditional GMM-SVM and GMM-UBM based systems, respectively.
Keywords :
Gaussian processes; mixture models; noise (working environment); speaker recognition; support vector machines; GMM-SVM; GMM-UBM based systems; NIST-SRE-2003 database; data imbalance; environmental noise; speaker verification; test utterance length; utterance partitioning; varying background environment; Acoustics; Kernel; Noise measurement; Speech; Support vector machines; Training; Vectors; Kernel; Maximum aPosteriori Estimation; Supervector Gaussian Mixture Model; Support Vector Machine;
Conference_Titel :
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location :
Gurgaon
DOI :
10.1109/ICSDA.2013.6709859