• DocumentCode
    3131871
  • Title

    Dialect identification based on VQ codebook design with GA-LBG algorithm

  • Author

    He, Yan ; Yu, Feng Qin

  • Author_Institution
    Sch. of Internet of Things Eng., Jiangnan Univ., Wuxi, China
  • Volume
    2
  • fYear
    2011
  • fDate
    20-21 Aug. 2011
  • Firstpage
    94
  • Lastpage
    97
  • Abstract
    In order to solve the problem of GA´s slow convergence in the VQ codebook design due to the strong global search ability and the complex computation, a hybrid algorithm based on the GA-LBG algorithm is adopted because the LBG algorithm as an iterative algorithm based on the nearest neighbor rule and the centroid rule owns the advantage of fast convergence. In the simulation experiment, MFCC extracted from Mandarin, Shanghainese, Cantonese and Hokkien are employed as the feature vectors to establish codebook models with GA-LBG for the dialect identification, and the recognition performances on different size of the VQ codebooks are studied. And simulation results demonstrate that the running time with GA-LBG reduces to 1066.4s, less than that with GA alone.
  • Keywords
    feature extraction; genetic algorithms; iterative methods; natural language processing; pattern clustering; search problems; vector quantisation; Cantonese; GA-LBG algorithm; Hokkien; MFCC extraction; Mandarin; Mel-frequency cepstrum coefficients; Shanghainese; VQ codebook design; centroid rule; complex computation; dialect identification; feature vector; genetic algorithm; global search ability; hybrid algorithm; iterative algorithm; nearest neighbor rule; vector quantization; Algorithm design and analysis; Clustering algorithms; Genetic algorithms; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; GA-LBG algorithm; MFCC; dialect identification; vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing, Control and Industrial Engineering (CCIE), 2011 IEEE 2nd International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-9599-3
  • Type

    conf

  • DOI
    10.1109/CCIENG.2011.6008075
  • Filename
    6008075