DocumentCode
134260
Title
Multilayer structure based lexicon optimization for agglutinative languages
Author
Ablimit, Mijit ; Pattar, Akbar ; Hamdulla, Askar
Author_Institution
Postdoctoral Res. Station of Comput. Sci. & Technol., Xinjiang Univ., Urumqi, China
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
411
Lastpage
411
Abstract
Summary form only given. For large vocabulary continuous speech recognition (LVCSR), selection of appropriate lexical unit is the first important step. When the word unit is selected as the lexicon, word boundary detection problem can be avoided. But selection of lexicon is not clear for the derivative morphological structure (e.g. agglutinative languages), and there is no word boundary for many languages (Chinese, Japanese, etc.). This paper, based on the Uyghur LVCSR system, analyze multi-layered lexicon based automatic speech recognition (ASR) systems, compare the ASR results of various linguistic layers, propose a new method which can balance the advantages of two layers of lexicons. By aligning and comparing the ASR results of two layers, we analyze error patterns, extract samples as the training data for the alternative selection method. Experimental results show that the proposed method effectively improved the ASR accuracy while maintaining small lexicon size.
Keywords
natural language processing; speech recognition; ASR systems; Uyghur LVCSR system; agglutinative languages; derivative morphological structure; large vocabulary continuous speech recognition; lexical unit; linguistic layers; multilayer structure based lexicon optimization; multilayered lexicon based automatic speech recognition; word boundary detection problem; Computer science; Educational institutions; Information science; Nonhomogeneous media; Optimization; Software; Speech recognition; Agglutinative Language; Lexicon optimization; Multilayer structure; Speech recognition; Uyghur;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936653
Filename
6936653
Link To Document