مرکز منطقه ای اطلاع رساني علوم و فناوري - Dimension-Decoupled Gaussian Mixture Model for Short Utterance Speaker Recognition

DocumentCode :

2499286

Title :

Dimension-Decoupled Gaussian Mixture Model for Short Utterance Speaker Recognition

Author :

Stadelmann, Thilo ; Freisleben, Bernd

Author_Institution :

Dept. of Math. & Comput. Sci., Univ. of Marburg, Marburg, Germany

fYear :

2010

fDate :

23-26 Aug. 2010

Firstpage :

1602

Lastpage :

1605

Abstract :

The Gaussian Mixture Model (GMM) is often used in conjunction with Mel-frequency cepstral coefficient (MFCC) feature vectors for speaker recognition. A great challenge is to use these techniques in situations where only small sets of training and evaluation data are available, which typically results in poor statistical estimates and, finally, recognition scores. Based on the observation of marginal MFCC probability densities, we suggest to greatly reduce the number of free parameters in the GMM by modeling the single dimensions separately after proper preprocessing. Saving about 90% of the free parameters as compared to an already optimized GMM and thus making the estimates more stable, this approach considerably improves recognition accuracy over the baseline as the utterances get shorter and saves a huge amount of computing time both in training and evaluation, enabling real-time performance. The approach is easy to implement and to combine with other short-utterance approaches, and applicable to other features as well.

Keywords :

Gaussian processes; probability; speaker recognition; vectors; MFCC probability densities; Mel-frequency cepstral coefficient feature vectors; dimension-decoupled Gaussian mixture model; free parameters; recognition accuracy; short utterance speaker recognition; Accuracy; Computational modeling; Data models; Mel frequency cepstral coefficient; Speaker recognition; Training; Training data; GMM; MFCC; short data; speaker recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Pattern Recognition (ICPR), 2010 20th International Conference on

Conference_Location :

Istanbul

ISSN :

1051-4651

Print_ISBN :

978-1-4244-7542-1

Type :

conf

DOI :

10.1109/ICPR.2010.396

Filename :

5596995

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2499286