DocumentCode
179022
Title
Supervised domain adaptation for I-vector based speaker recognition
Author
Garcia-Romero, Daniel ; McCree, Alan
Author_Institution
Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
fYear
2014
fDate
4-9 May 2014
Firstpage
4047
Lastpage
4051
Abstract
In this paper, we present a comprehensive study on supervised domain adaptation of PLDA based i-vector speaker recognition systems. After describing the system parameters subject to adaptation, we study the impact of their adaptation on recognition performance. Using the recently designed domain adaptation challenge, we observe that the adaptation of the PLDA parameters (i.e. across-class and within-class co variances) produces the largest gains. Nonetheless, length-normalization is also important; whereas using an indomani UBM and T matrix is not crucial. For the PLDA adaptation, we compare four approaches. Three of them are proposed in this work, and a fourth one was previously published. Overall, the four techniques are successful at leveraging varying amounts of labeled in-domain data and their performance is quite similar. However, our approaches are less involved, and two of them are applicable to a larger class of models (low-rank across-class).
Keywords
matrix algebra; speaker recognition; statistical analysis; vectors; PLDA based i-vector speaker recognition systems; in-domain T matrix; in-domain UBM matrix; labeled in-domain data; length-normalization; probabilistic linear discriminant analysis; supervised domain adaptation; Adaptation models; Approximation methods; Bayes methods; Computational modeling; Speaker recognition; Speech; Training; PLDA; i-vectors; speaker recognition; supervised domain adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854362
Filename
6854362
Link To Document