DocumentCode
476200
Title
Chinese chunking algorithm based on conditional random fields
Author
Sun, Guang-lu ; Liu, Bing-quan ; Wang, Xiao-long ; Liu, Yuan-Chao
Author_Institution
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin
Volume
5
fYear
2008
fDate
12-15 July 2008
Firstpage
2509
Lastpage
2513
Abstract
A new Chinese chunking algorithm is proposed based on conditional random fields and semantic features. Through the analysis of Chinese chunking task and its sequential characteristics, conditional random fields that combine various kinds of features were applied. Semantic features were utilized to further improve the chunking performance. Experimental results on the Chinese chunking corpus of Microsoft Research Asia show that the algorithm achieves impressive accuracy of 92.52% in terms of the F-score.
Keywords
natural language processing; random processes; Chinese chunking algorithm; conditional random fields; semantic feature; Asia; Computer science; Cybernetics; Entropy; Hidden Markov models; Machine learning; Machine learning algorithms; Sun; Support vector machines; Tagging; Chinese chunking; Conditional random fields; Semantic features;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2095-7
Electronic_ISBN
978-1-4244-2096-4
Type
conf
DOI
10.1109/ICMLC.2008.4620830
Filename
4620830
Link To Document