Title :
Segmentation-based Mongolian LVCSR approach
Author :
Feilong Bao ; Guanglai Gao ; Xueliang Yan ; Weihua Wang
Author_Institution :
Coll. of Comput. Sci., Inner Mongolia Univ., Hohhot, China
Abstract :
Mongolian is an agglutinative language. Each root can be followed by several suffixes to formulate new words. This special word formation characteristic results in probably millions of Mongolian words, which is far beyond the coverage of the pronunciation dictionary of any current Mongolian speech recognition system. Moreover, even if the pronunciation dictionary is large enough to cover all of the Mongolian words, the recognition system still cannot perform well due to the problem of sample sparseness. In this paper, we propose a segmentation-based Mongolian Large Vocabulary Continuous Speech Recognition (LVCSR) approach and rebuild the corresponding acoustic model and language model. Experimental results show that, by converting most of these words into their corresponding In-Vocabulary form, the proposed approach effectively recognizes most of the Mongolian words and greatly improves the sample sparseness problem in the language model.
Keywords :
natural language processing; speech recognition; vocabulary; Mongolian speech recognition system; acoustic model; agglutinative language; in-vocabulary form; language model; large vocabulary continuous speech recognition; segmentation based Mongolian LVCSR approach; special word formation; Acoustics; Dictionaries; Educational institutions; Hidden Markov models; Speech recognition; Testing; Training; LVCSR; Mongolian; ending suffix; segmentation; stem;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639250