Title :
Using GMM with Embedded TDNN to Speaker Identification
Author :
Chen, Cunbao ; Zhao, Li ; Zhao, Yan
Author_Institution :
Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China
Abstract :
This paper proposes a modified Gaussian Mixed Model (GMM) with an embedded Time Delay Neural Network (TDNN). It integrates the merits of GMM which is a generative model and TDNN which is a discriminative model. TDNN digests the timing information of the feature sequences, and through the transformation of the feature vectors it makes the hypothesis of variable independence that maximum likelihood needed more reasonable. GMM and TDNN are trained as a whole by means of maximum likelihood probability. In the process of training, the parameters of GMM and TDNN are updated alternately. Experiments show that the proposed model improves accuracy rate against baseline GMM at all SNR, maximum to 21%.
Keywords :
Gaussian processes; maximum likelihood estimation; neural nets; speaker recognition; GMM; embedded TDNN; feature sequences; maximum likelihood probability; speaker identification; time delay neural network; Delay effects; Information science; Multi-layer neural network; Neural networks; Probability density function; Signal processing algorithms; Speaker recognition; Speech recognition; Timing; Training data;
Conference_Titel :
Information Science and Engineering (ICISE), 2009 1st International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4909-5
DOI :
10.1109/ICISE.2009.1329