Title :
Chinese chunking algorithm based on conditional random fields
Author :
Sun, Guang-lu ; Liu, Bing-quan ; Wang, Xiao-long ; Liu, Yuan-Chao
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin
Abstract :
A new Chinese chunking algorithm is proposed based on conditional random fields and semantic features. Through the analysis of Chinese chunking task and its sequential characteristics, conditional random fields that combine various kinds of features were applied. Semantic features were utilized to further improve the chunking performance. Experimental results on the Chinese chunking corpus of Microsoft Research Asia show that the algorithm achieves impressive accuracy of 92.52% in terms of the F-score.
Keywords :
natural language processing; random processes; Chinese chunking algorithm; conditional random fields; semantic feature; Asia; Computer science; Cybernetics; Entropy; Hidden Markov models; Machine learning; Machine learning algorithms; Sun; Support vector machines; Tagging; Chinese chunking; Conditional random fields; Semantic features;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4620830