• DocumentCode
    476200
  • Title

    Chinese chunking algorithm based on conditional random fields

  • Author

    Sun, Guang-lu ; Liu, Bing-quan ; Wang, Xiao-long ; Liu, Yuan-Chao

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin
  • Volume
    5
  • fYear
    2008
  • fDate
    12-15 July 2008
  • Firstpage
    2509
  • Lastpage
    2513
  • Abstract
    A new Chinese chunking algorithm is proposed based on conditional random fields and semantic features. Through the analysis of Chinese chunking task and its sequential characteristics, conditional random fields that combine various kinds of features were applied. Semantic features were utilized to further improve the chunking performance. Experimental results on the Chinese chunking corpus of Microsoft Research Asia show that the algorithm achieves impressive accuracy of 92.52% in terms of the F-score.
  • Keywords
    natural language processing; random processes; Chinese chunking algorithm; conditional random fields; semantic feature; Asia; Computer science; Cybernetics; Entropy; Hidden Markov models; Machine learning; Machine learning algorithms; Sun; Support vector machines; Tagging; Chinese chunking; Conditional random fields; Semantic features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2008 International Conference on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2095-7
  • Electronic_ISBN
    978-1-4244-2096-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2008.4620830
  • Filename
    4620830