• DocumentCode
    690365
  • Title

    A Hybrid Approach Using Maximum Entropy Model and Rules to Identify Tibetan Person Names

  • Author

    Yangji Jia ; Jing Jiang ; Hongzhi Yu

  • Author_Institution
    China Inst. of Minorities Inf. Technol., Northwest Univ. for Nat., Lanzhou, China
  • fYear
    2013
  • fDate
    14-15 Dec. 2013
  • Firstpage
    377
  • Lastpage
    380
  • Abstract
    Tibetan person name recognition is one of the most difficult tasks in the area of Tibetan information processing, and the effect of recognition impacts directly on the precision of Tibetan word segmentation and the performance of relative application systems, which include Tibetan-Chinese machine translation, Tibetan information search, text categorization, etc. Based on the analysis of wording rules and features of Tibetan name, this paper proposed a method which combines maximum entropy and rules to identify Tibetan person names. The experiment shows that this approach works really well for the value of F1-measure reaches 95.92%.
  • Keywords
    maximum entropy methods; natural language processing; text analysis; word processing; F1-measure value; Tibetan information processing; Tibetan information search; Tibetan name feature analysis; Tibetan person name identification rules; Tibetan person name recognition; Tibetan word segmentation; Tibetan-Chinese machine translation; hybrid approach; maximum entropy model; text categorization; wording rule analysis; Character recognition; Dictionaries; Educational institutions; Entropy; Natural language processing; Text recognition; Training; Tibetan name recognition; maximum entropy; rule-based approaches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Sciences and Applications (CSA), 2013 International Conference on
  • Conference_Location
    Wuhan
  • Type

    conf

  • DOI
    10.1109/CSA.2013.95
  • Filename
    6835622