مرکز منطقه ای اطلاع رساني علوم و فناوري - Using GMM with Embedded TDNN to Speaker Identification

DocumentCode :

2235585

Title :

Using GMM with Embedded TDNN to Speaker Identification

Author :

Chen, Cunbao ; Zhao, Li ; Zhao, Yan

Author_Institution :

Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China

fYear :

2009

fDate :

26-28 Dec. 2009

Firstpage :

3713

Lastpage :

3716

Abstract :

This paper proposes a modified Gaussian Mixed Model (GMM) with an embedded Time Delay Neural Network (TDNN). It integrates the merits of GMM which is a generative model and TDNN which is a discriminative model. TDNN digests the timing information of the feature sequences, and through the transformation of the feature vectors it makes the hypothesis of variable independence that maximum likelihood needed more reasonable. GMM and TDNN are trained as a whole by means of maximum likelihood probability. In the process of training, the parameters of GMM and TDNN are updated alternately. Experiments show that the proposed model improves accuracy rate against baseline GMM at all SNR, maximum to 21%.

Keywords :

Gaussian processes; maximum likelihood estimation; neural nets; speaker recognition; GMM; embedded TDNN; feature sequences; maximum likelihood probability; speaker identification; time delay neural network; Delay effects; Information science; Multi-layer neural network; Neural networks; Probability density function; Signal processing algorithms; Speaker recognition; Speech recognition; Timing; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Science and Engineering (ICISE), 2009 1st International Conference on

Conference_Location :

Nanjing

Print_ISBN :

978-1-4244-4909-5

Type :

conf

DOI :

10.1109/ICISE.2009.1329

Filename :

5455648

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2235585