مرکز منطقه ای اطلاع رساني علوم و فناوري - Fast training of Large Margin diagonal Gaussian mixture models for speaker identification

DocumentCode :

2019852

Title :

Fast training of Large Margin diagonal Gaussian mixture models for speaker identification

Author :

Jourani, Reda ; Daoudi, Khalid ; André-Obrecht, Régine ; Aboutajdine, Driss

Author_Institution :

SAMoVA Group, Univ. Paul Sabatier, Toulouse, France

fYear :

2011

fDate :

18-21 May 2011

Firstpage :

Lastpage :

Abstract :

Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decades. They are generally trained using the generative criterion of maximum likelihood estimation. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we present a new version of this algorithm which has the major advantage of being computationally highly efficient. The resulting algorithm is thus well suited to handle large scale databases. We carry out experiments on a speaker identification task using NIST-SRE´2006 data and compare our new algorithm to the baseline generative GMM using different GMM sizes. The results show that our system significantly outperforms the baseline GMM in all configurations, and with high computational efficiency.

Keywords :

Gaussian processes; maximum likelihood estimation; speaker recognition; large margin diagonal Gaussian mixture models; maximum likelihood estimation; speaker identification; speaker recognition; Adaptation model; Algorithm design and analysis; Hidden Markov models; Speaker recognition; Speech; Speech recognition; Training; Gaussian mixture models; discriminative learning; large margin training; speaker identification; speaker recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on

Conference_Location :

Brasov

Print_ISBN :

978-1-4577-0440-6

Type :

conf

DOI :

10.1109/SPED.2011.5940738

Filename :

5940738

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2019852