Title :
A novel feature sub-sampling method for efficient universal background model training in speaker verification
Author :
Hasan, Taufiq ; Lei, Yun ; Chandrasekaran, Aravind ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst., Univ. of Texas at Dallas, Dallas, TX, USA
Abstract :
Speaker recognition/verification systems require an extensive universal background model (UBM), which typically requires extensive resources, especially if new channel domains are considered. In this study we propose an effective and computationally efficient algorithm for training the UBM for speaker verification. A novel method based on Euclidean distance between features is developed for effective sub-sampling of potential training feature vectors. Using only about 1.5 seconds of data from each development utterance, the proposed UBM training method drastically reduces the computation time, while improving, or at least retaining original speaker verification system performance. While methods such as factor analysis can mitigate some of the issues associated with channel/microphone/environmental mismatch, the proposed rapid UBM training scheme offers a viable alternative for rapid environment dependent UBMs.
Keywords :
speaker recognition; Euclidean distance; channel-microphone-environmental mismatch; computation time; factor analysis; feature subsampling method; speaker recognition; speaker verification; time 1.5 s; universal background model training; Computer science; Euclidean distance; Loudspeakers; Microphones; NIST; Robustness; Speaker recognition; Speech; Support vector machines; System performance; Speaker verification; universal background model;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495601