DocumentCode :
178920
Title :
Type-2 Fuzzy GMMs for Robust Text-Independent Speaker Verification in Noisy Environments
Author :
Pinheiro, H.N.B. ; Tsang Ing Ren ; Cavalcanti, G.D.C. ; Tsang Ing Jyh ; Sijbers, J.
Author_Institution :
Centro de Inf. (CIn), Univ. Fed. de Pernambuco (UFPE), Recife, Brazil
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
4531
Lastpage :
4536
Abstract :
This paper proposes the use of the type-2 fuzzy GMM (T2FGMM) framework in order to improve the verification rates of the standard GMM-UBM text-independent speaker verification system in noisy environments. Based on type-2 fuzzy sets, the T2FGMM framework describes GMMs with uncertain parameters and provides likelihood intervals for them. The proposed method (T2F-GMM-UBM) estimates the parameter intervals using the noisy speeches from the speakers and the Bayesian estimation used in the standard GMM-UBM system. The proposed method was evaluated using the MIT Device Speaker Verification Corpus (MITDSVC) which contains speeches from 48 speakers recorded in three different locations: a quiet office, a mildly noisy lobby, and a busy street intersection. The Equal Error Rate (EER) was computed for each speaker and the mean and standard deviation were analyzed. Although the proposed method did not achieve better performance in the office location, significant improvements were achieved in both lobby and street intersection locations. The improvement in the lobby was 14.21% while in the street intersection location was 10.47%. The left tailed paired Rank Sign Wilcox on Test was also performed in both locations and the p-values found were 0.0127 and 0.0230, respectively. The proposed method proved to have better performance in noisy environments compared to the standard GMM-UBM system.
Keywords :
Gaussian processes; mixture models; speaker recognition; text analysis; Bayesian estimation; EER; MIT device speaker verification corpus; MITDSVC; Rank Sign Wilcox; T2FGMM framework; equal error rate; noisy environments; noisy speech; robust text-independent speaker verification; street intersection location; type-2 fuzzy GMM; Adaptation models; Gaussian distribution; Mathematical model; Noise measurement; Speech; Standards; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.775
Filename :
6977488
Link To Document :
بازگشت