DocumentCode :
547657
Title :
Towards better GMM-based acoustic modeling for spoken language identification
Author :
Ghasemian, Fahime ; Homayounpour, Mohammad Mahdi
Author_Institution :
Amirkabir University of technology, Tehran, Iran
fYear :
2011
fDate :
17-19 May 2011
Firstpage :
1
Lastpage :
4
Abstract :
Gaussian Mixture Model (GMM) is a widely used, simple and effective modeling approach for spoken language identification. Traditionally EM algorithm is used to train this model. In this paper we propose a new method named WA-GMM (Weight Adapted GMM) for estimating the weights of GMM Gaussian components using bag-of-unigram and Support Vector Machine (SVM): SVM weights which are trained on bag-of-unigram vectors, are used as new weights for GMM Gaussian components. These new weights act better than the weights resulted by EM algorithm. Our experiments on 3 different LID systems on 4 languages from OGI-TS multi-language corpus prove our claim.
Keywords :
Accuracy; Adaptation models; Computational modeling; Speech; Speech recognition; Support vector machines; Training; GMM; SVM; Tokenizer; bag-of-unigram; language identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Engineering (ICEE), 2011 19th Iranian Conference on
Conference_Location :
Tehran, Iran
Print_ISBN :
978-1-4577-0730-8
Electronic_ISBN :
978-964-463-428-4
Type :
conf
Filename :
5955545
Link To Document :
بازگشت