Title :
SVM Based Speaker Recognition Using Maximum a posteriori Linear Regression
Author :
Zhang, Xiang ; Zhao, Qingwei ; Yan, Yonghong
Author_Institution :
Inst. of Acoust., Chinese Acad. of Sci., Beijing
Abstract :
Maximum likelihood linear regression (MLLR) is a widely used technique for speaker adaptation in large vocabulary speech recognition system. Recently, using MLLR transforms as features for SVM based speaker recognition tasks has been proposed, achieving performance comparable to that obtained with cepstral features. In this paper, we focus on calculating the transforms based on a GMM universal background model (UBM). Rather than estimating the transforms using maximum likelihood criterion, this paper describes a new feature extraction technique for speaker recognition based on maximum a posteriori linear regression (MAPLR), which uses maximum a posteriori (MAP) as estimation criterion. We perform experiments on a NIST SRE 2008 corpus. Experimental results show that the system based on MAPLR technique outperforms MLLR in the task of speaker recognition.
Keywords :
Gaussian processes; feature extraction; maximum likelihood estimation; regression analysis; speaker recognition; support vector machines; transforms; GMM; MAP estimation; SVM; cepstral analysis; feature extraction; large vocabulary speech recognition system; maximum a posteriori linear regression; maximum likelihood linear regression transform; speaker adaptation; speaker recognition; universal background model; Cepstral analysis; Feature extraction; Linear regression; Maximum likelihood estimation; Maximum likelihood linear regression; NIST; Speaker recognition; Speech recognition; Support vector machines; Vocabulary;
Conference_Titel :
Electronic Computer Technology, 2009 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-0-7695-3559-3
DOI :
10.1109/ICECT.2009.82