مرکز منطقه ای اطلاع رساني علوم و فناوري - Significance of utterance partitioning in GMM-SVM based speaker verification in varying background environment

DocumentCode :

672834

Title :

Significance of utterance partitioning in GMM-SVM based speaker verification in varying background environment

Author :

Sarkar, Santonu ; Rao, K. Sreenivasa

Author_Institution :

Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India

fYear :

2013

fDate :

25-27 Nov. 2013

Firstpage :

Lastpage :

Abstract :

This paper explores the GMM-SVM combined approach for text-independent speaker verification in rapidly varying environmental noise. For mitigating the effect of mismatched training and test utterance length and countering the impact of data-imbalance in SVM scoring, we partition each full-length enrollment utterance into a number of sub-utterances and derive a GMM supervector from each of them prior to SVM training. Experiments conducted on the NIST-SRE-2003 database demonstrate that the GMM-SVM system with partitioned utterances outperform the conventional GMM-SVM based speaker verification system. The noisy background is simulated by degrading non-overlapping segments of each training and test utterance of the NIST-SRE-2003 by additive noises (car, factory, pink & white) collected from the NOISEX-92 database, at OdB, 5dB, 7dB & 10dB SNRs respectively. An average performance improvement of 6.41% and 10.56% EER across all SNRs is observed in comparison to the traditional GMM-SVM and GMM-UBM based systems, respectively.

Keywords :

Gaussian processes; mixture models; noise (working environment); speaker recognition; support vector machines; GMM-SVM; GMM-UBM based systems; NIST-SRE-2003 database; data imbalance; environmental noise; speaker verification; test utterance length; utterance partitioning; varying background environment; Acoustics; Kernel; Noise measurement; Speech; Support vector machines; Training; Vectors; Kernel; Maximum aPosteriori Estimation; Supervector Gaussian Mixture Model; Support Vector Machine;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference

Conference_Location :

Gurgaon

Type :

conf

DOI :

10.1109/ICSDA.2013.6709859

Filename :

6709859

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=672834