DocumentCode :
2235585
Title :
Using GMM with Embedded TDNN to Speaker Identification
Author :
Chen, Cunbao ; Zhao, Li ; Zhao, Yan
Author_Institution :
Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China
fYear :
2009
fDate :
26-28 Dec. 2009
Firstpage :
3713
Lastpage :
3716
Abstract :
This paper proposes a modified Gaussian Mixed Model (GMM) with an embedded Time Delay Neural Network (TDNN). It integrates the merits of GMM which is a generative model and TDNN which is a discriminative model. TDNN digests the timing information of the feature sequences, and through the transformation of the feature vectors it makes the hypothesis of variable independence that maximum likelihood needed more reasonable. GMM and TDNN are trained as a whole by means of maximum likelihood probability. In the process of training, the parameters of GMM and TDNN are updated alternately. Experiments show that the proposed model improves accuracy rate against baseline GMM at all SNR, maximum to 21%.
Keywords :
Gaussian processes; maximum likelihood estimation; neural nets; speaker recognition; GMM; embedded TDNN; feature sequences; maximum likelihood probability; speaker identification; time delay neural network; Delay effects; Information science; Multi-layer neural network; Neural networks; Probability density function; Signal processing algorithms; Speaker recognition; Speech recognition; Timing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Engineering (ICISE), 2009 1st International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4909-5
Type :
conf
DOI :
10.1109/ICISE.2009.1329
Filename :
5455648
Link To Document :
بازگشت