Title :
Compensation of extrinsic variability in speaker verification systems on simulated Skype and HF channel data
Author :
Riedhammer, Korbinian ; Bocklet, Tobias ; Nöth, Elmar
Author_Institution :
Lehrstuhl fur Mustererkennung, Univ. Erlangen-Nurnberg, Erlangen, Germany
Abstract :
In this work we focus on speaker verification on channels of varying quality, namely Skype and high frequency (HF) radio. In our setup, we assume to have telephone recordings of speakers for training, but recordings of different channels for testing with varying (lower) signal quality. Starting from a Gaussian mixture / support vector machine (GMM/SVM) baseline, we evaluate multi-condition training (MCT), an ideal channel classification approach (ICC), and nuisance attribute projection (NAP) to compensate for the loss of information due to the transmission. In an evaluation on Switchboard-2 data using Skype and HF channel simulators, we show that, for good signal quality, NAP improves the baseline system performance from 5% EER to 3.33% EER (for both Skype and HF). For strongly distorted data, MCT or, if adequate, ICC turn out to be the method of choice.
Keywords :
Gaussian processes; speaker recognition; support vector machines; EER; GMM-SVM baseline; Gaussian mixture-support vector machine baseline; HF channel data; ICC; MCT; NAP; Switchboard-2 data; channel classification approach; extrinsic variability compensation; multicondition training; nuisance attribute projection; simulated skype; speaker verification systems; telephone recordings; Adaptation models; Hafnium; Robustness; Signal to noise ratio; Support vector machines; System performance; Training; channel compensation; speaker verification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947439