DocumentCode :
1691820
Title :
Duration mismatch compensation for i-vector based speaker recognition systems
Author :
Hasan, T. ; Saeidi, Rahim ; Hansen, John H. L. ; van Leeuwen, David A.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Dallas, TX, USA
fYear :
2013
Firstpage :
7663
Lastpage :
7667
Abstract :
Speaker recognition systems trained on long duration utterances are known to perform significantly worse when short test segments are encountered. To address this mismatch, we analyze the effect of duration variability on phoneme distributions of speech utterances and i-vector length. We demonstrate that, as utterance duration is decreased, number of detected unique phonemes and i-vector length approaches zero in a logarithmic and non-linear fashion, respectively. Assuming duration variability as an additive noise in the i-vector space, we propose three different strategies for its compensation: i) multi-duration training in Probabilistic Linear Discriminant Analysis (PLDA) model, ii) score calibration using log duration as a Quality Measure Function (QMF), and iii) multi-duration PLDA training with synthesized short duration i-vectors. Experiments are designed based on the 2012 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE) protocol with varying test utterance duration. Experimental results demonstrate the effectiveness of the proposed schemes on short duration test conditions, especially with the QMF calibration approach.
Keywords :
protocols; speaker recognition; duration mismatch compensation; i-vector; long duration utterances; multiduration training; probabilistic linear discriminant analysis; quality measure function; short test segments; speaker recognition evaluation protocol; speaker recognition systems; Acoustics; Calibration; NIST; Speaker recognition; Speech; Training; Vectors; Speaker verification; i-vector; quality measure fusion (QMF); short utterance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639154
Filename :
6639154
Link To Document :
بازگشت