DocumentCode :
476200
Title :
Chinese chunking algorithm based on conditional random fields
Author :
Sun, Guang-lu ; Liu, Bing-quan ; Wang, Xiao-long ; Liu, Yuan-Chao
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin
Volume :
5
fYear :
2008
fDate :
12-15 July 2008
Firstpage :
2509
Lastpage :
2513
Abstract :
A new Chinese chunking algorithm is proposed based on conditional random fields and semantic features. Through the analysis of Chinese chunking task and its sequential characteristics, conditional random fields that combine various kinds of features were applied. Semantic features were utilized to further improve the chunking performance. Experimental results on the Chinese chunking corpus of Microsoft Research Asia show that the algorithm achieves impressive accuracy of 92.52% in terms of the F-score.
Keywords :
natural language processing; random processes; Chinese chunking algorithm; conditional random fields; semantic feature; Asia; Computer science; Cybernetics; Entropy; Hidden Markov models; Machine learning; Machine learning algorithms; Sun; Support vector machines; Tagging; Chinese chunking; Conditional random fields; Semantic features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
Type :
conf
DOI :
10.1109/ICMLC.2008.4620830
Filename :
4620830
Link To Document :
بازگشت