DocumentCode
3334789
Title
Initialization in speaker model training based on expectation maximization
Author
Yihong Wang
Author_Institution
Coll. of Energy & Electr. Eng., Hohai Univ., Nanjing, China
Volume
03
fYear
2013
fDate
16-18 Dec. 2013
Firstpage
1309
Lastpage
1313
Abstract
The optimized speaker model is trained by many time iterative algorithm based on expectation maximization (Abbr. EM). In the process, the choice of speaker model initial value has great influence on the final recognition effect. The most common algorithms which are used to choose the initial value are K-means algorithm and LBG algorithm at present, but the two algorithms belong to a sort of local clustering arithmetic, therefore, it is difficult for them to provide the optimal initial value. For this reason, the ant colony algorithm combined with genetic arithmetic is proposed in the paper. The comparative experiment between this algorithm and K-means algorithm has been done, and the experimental results have been obtained to verify that this algorithm can bring better recognition rate than K-means algorithm.
Keywords
ant colony optimisation; expectation-maximisation algorithm; iterative methods; pattern clustering; speaker recognition; K-means algorithm; LBG algorithm; ant colony algorithm; expectation maximization algorithm; genetic arithmetic; iterative algorithm; local clustering arithmetic; speaker model training; Clustering algorithms; Genetic algorithms; Signal processing algorithms; Speech; Statistics; Training; Vectors; Colony Algorithm; Gaussian Mixture Model; Genetic Algorithms; Model Parameters; voiceprint recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2013 6th International Congress on
Conference_Location
Hangzhou
Print_ISBN
978-1-4799-2763-0
Type
conf
DOI
10.1109/CISP.2013.6743875
Filename
6743875
Link To Document