Title :
UBM data selection for effective speaker modeling
Author :
Huang, Chien-Lin ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res., A*Star, Singapore, Singapore
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
This paper presents a UBM data selection method for robust training. We know that there is no promise that more training data guarantee better results. Therefore, the way of sub-sampling and effective training become important. The proposed method uses the feature vector selection with the maximum-entropy criterion. The maximum-entropy shows the diverse characters of speaker and minimum redundant information as well. The UBM training data is investigated on three datasets to compare the proposed method with the conventional sub-sampling approaches. We conducted experiments on the 2008 NIST Speaker Recognition Evaluation corpus shows that the proposed method outperforms the conventional one in speaker recognition.
Keywords :
entropy; speaker recognition; vectors; UBM; data selection; feature vector selection; maximum entropy criterion; minimum redundant information; speaker recognition; universal background model; Adaptation model; Entropy; NIST; Speaker recognition; Speech; Training; Training data; UBM training; data selection; maximum-extropy; speaker recognition; sub-sampling;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684493