Title :
Uncertainty propagation in front end factor analysis for noise robust speaker recognition
Author :
Chengzhu Yu ; Gang Liu ; Seongjun Hahm ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
In this study, we explore the propagation of uncertainty in the state-of-the-art speaker recognition system. Specifically, we incorporate the uncertainty associated with observation features into the i-Vector extraction framework. To prove the concept, both the oracle and practically estimated uncertainty are used for evaluation. The oracle uncertainty is calculated assuming the knowledge of clean speech features, while the estimated uncertainties are obtained using SPLICE and joint-GMM based methods. We evaluate the proposed framework on both YOHO and NIST 2010 Speaker Recognition Evaluation (SRE) corpora by artificially introducing noise at different SNRs. In the speaker verification experiments, we confirmed that the proposed uncertainty based i-Vector extraction framework shows significant robustness against noise.
Keywords :
Gaussian noise; acoustic noise; feature extraction; mixture models; speaker recognition; Gaussian mixture model; NIST 2010; SPLICE; SRE corpora; Speaker Recognition Evaluation; YOHO; front end factor analysis; i-Vector extraction framework; joint-GMM based methods; noise robust speaker recognition; observation features; oracle uncertainty; uncertainty propagation; Estimation; Feature extraction; NIST; Noise; Speaker recognition; Speech; Uncertainty; i-Vector; robust speaker recognition; uncertainty propagation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854356