DocumentCode
3131871
Title
Dialect identification based on VQ codebook design with GA-LBG algorithm
Author
He, Yan ; Yu, Feng Qin
Author_Institution
Sch. of Internet of Things Eng., Jiangnan Univ., Wuxi, China
Volume
2
fYear
2011
fDate
20-21 Aug. 2011
Firstpage
94
Lastpage
97
Abstract
In order to solve the problem of GA´s slow convergence in the VQ codebook design due to the strong global search ability and the complex computation, a hybrid algorithm based on the GA-LBG algorithm is adopted because the LBG algorithm as an iterative algorithm based on the nearest neighbor rule and the centroid rule owns the advantage of fast convergence. In the simulation experiment, MFCC extracted from Mandarin, Shanghainese, Cantonese and Hokkien are employed as the feature vectors to establish codebook models with GA-LBG for the dialect identification, and the recognition performances on different size of the VQ codebooks are studied. And simulation results demonstrate that the running time with GA-LBG reduces to 1066.4s, less than that with GA alone.
Keywords
feature extraction; genetic algorithms; iterative methods; natural language processing; pattern clustering; search problems; vector quantisation; Cantonese; GA-LBG algorithm; Hokkien; MFCC extraction; Mandarin; Mel-frequency cepstrum coefficients; Shanghainese; VQ codebook design; centroid rule; complex computation; dialect identification; feature vector; genetic algorithm; global search ability; hybrid algorithm; iterative algorithm; nearest neighbor rule; vector quantization; Algorithm design and analysis; Clustering algorithms; Genetic algorithms; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; GA-LBG algorithm; MFCC; dialect identification; vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Control and Industrial Engineering (CCIE), 2011 IEEE 2nd International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-9599-3
Type
conf
DOI
10.1109/CCIENG.2011.6008075
Filename
6008075
Link To Document