مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards better GMM-based acoustic modeling for spoken language identification

DocumentCode :

547982

Title :

Towards better GMM-based acoustic modeling for spoken language identification

Author :

Ghasemian, F. ; Homayounpour, Mohammad Mehdi

Author_Institution :

Amirkabir Univ. of Technol., Tehran, Iran

fYear :

2011

fDate :

17-19 May 2011

Firstpage :

Lastpage :

Abstract :

Gaussian Mixture Model (GMM) is a widely used, simple and effective modeling approach for spoken language identification. Traditionally EM algorithm is used to train this model. In this paper we propose a new method named WA-GMM (Weight Adapted GMM) for estimating the weights of GMM Gaussian components using bag-of-unigram and Support Vector Machine (SVM): SVM weights which are trained on bag-of-unigram vectors, are used as new weights for GMM Gaussian components. These new weights act better than the weights resulted by EM algorithm. Our experiments on 3 different LID systems on 4 languages from OGI-TS multi-language corpus prove our claim.

Keywords :

linguistics; speech recognition; support vector machines; GMM based acoustic modeling; Gaussian mixture model; SVM; WA-GMM; bag of unigram vector; spoken language identification; support vector machine; weight adapted GMM; GMM; SVM; Tokenizer; bag-of unigram; language identification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Electrical Engineering (ICEE), 2011 19th Iranian Conference on

Conference_Location :

Tehran

Print_ISBN :

978-1-4577-0730-8

Type :

conf

Filename :

5955872

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=547982