DocumentCode
690365
Title
A Hybrid Approach Using Maximum Entropy Model and Rules to Identify Tibetan Person Names
Author
Yangji Jia ; Jing Jiang ; Hongzhi Yu
Author_Institution
China Inst. of Minorities Inf. Technol., Northwest Univ. for Nat., Lanzhou, China
fYear
2013
fDate
14-15 Dec. 2013
Firstpage
377
Lastpage
380
Abstract
Tibetan person name recognition is one of the most difficult tasks in the area of Tibetan information processing, and the effect of recognition impacts directly on the precision of Tibetan word segmentation and the performance of relative application systems, which include Tibetan-Chinese machine translation, Tibetan information search, text categorization, etc. Based on the analysis of wording rules and features of Tibetan name, this paper proposed a method which combines maximum entropy and rules to identify Tibetan person names. The experiment shows that this approach works really well for the value of F1-measure reaches 95.92%.
Keywords
maximum entropy methods; natural language processing; text analysis; word processing; F1-measure value; Tibetan information processing; Tibetan information search; Tibetan name feature analysis; Tibetan person name identification rules; Tibetan person name recognition; Tibetan word segmentation; Tibetan-Chinese machine translation; hybrid approach; maximum entropy model; text categorization; wording rule analysis; Character recognition; Dictionaries; Educational institutions; Entropy; Natural language processing; Text recognition; Training; Tibetan name recognition; maximum entropy; rule-based approaches;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Sciences and Applications (CSA), 2013 International Conference on
Conference_Location
Wuhan
Type
conf
DOI
10.1109/CSA.2013.95
Filename
6835622
Link To Document