• DocumentCode
    134260
  • Title

    Multilayer structure based lexicon optimization for agglutinative languages

  • Author

    Ablimit, Mijit ; Pattar, Akbar ; Hamdulla, Askar

  • Author_Institution
    Postdoctoral Res. Station of Comput. Sci. & Technol., Xinjiang Univ., Urumqi, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    411
  • Lastpage
    411
  • Abstract
    Summary form only given. For large vocabulary continuous speech recognition (LVCSR), selection of appropriate lexical unit is the first important step. When the word unit is selected as the lexicon, word boundary detection problem can be avoided. But selection of lexicon is not clear for the derivative morphological structure (e.g. agglutinative languages), and there is no word boundary for many languages (Chinese, Japanese, etc.). This paper, based on the Uyghur LVCSR system, analyze multi-layered lexicon based automatic speech recognition (ASR) systems, compare the ASR results of various linguistic layers, propose a new method which can balance the advantages of two layers of lexicons. By aligning and comparing the ASR results of two layers, we analyze error patterns, extract samples as the training data for the alternative selection method. Experimental results show that the proposed method effectively improved the ASR accuracy while maintaining small lexicon size.
  • Keywords
    natural language processing; speech recognition; ASR systems; Uyghur LVCSR system; agglutinative languages; derivative morphological structure; large vocabulary continuous speech recognition; lexical unit; linguistic layers; multilayer structure based lexicon optimization; multilayered lexicon based automatic speech recognition; word boundary detection problem; Computer science; Educational institutions; Information science; Nonhomogeneous media; Optimization; Software; Speech recognition; Agglutinative Language; Lexicon optimization; Multilayer structure; Speech recognition; Uyghur;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936653
  • Filename
    6936653