DocumentCode :
1954279
Title :
A Combination of Statistical and Rule-Based Approach for Mongolian Lexical Analysis
Author :
Zhao, Lili ; Men, Jia ; Zhang, Congpin ; Liu, Qun ; Jiang, Wenbin ; Wu, Jinxin ; Chang, Qing
Author_Institution :
Coll. of Comput. & Inf. Technol., Henan Normal Univ., Xinxiang, China
fYear :
2010
fDate :
28-30 Dec. 2010
Firstpage :
7
Lastpage :
10
Abstract :
Mongolian lexical analysis is the first step in Mongolian information processing such as Chinese-Mongolian machine translation. In this paper, we introduce a statistic and rule based approach to solving the Mongolian word segmentation & POS tagging all at once. In this method, we use tree frame as basic statistical model. And then we combine the model with some rules to improve the lexical analysis system accuracy. The experiment results show that the word-level accuracy of joint segmentation and POS tagging is 95.2%, stem / postfix-level accuracy is 94.6%.
Keywords :
identification technology; knowledge based systems; natural language processing; statistical analysis; word processing; Chinese-Mongolian machine translation; Mongolian information processing; Mongolian lexical analysis; Mongolian word segmentation; POS tagging; rule based approach; statistical approach; tree frame; word level accuracy; Accuracy; Analytical models; Dictionaries; Joints; Pragmatics; Probability; Tagging; Joint segmentation and POS tagging; Mongolian Part of Speech Tagging; Mongolian Word Segment; Mongolian information processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9063-9
Type :
conf
DOI :
10.1109/IALP.2010.79
Filename :
5681555
Link To Document :
بازگشت