DocumentCode :
2426667
Title :
Chinese chunking and its application on similarity computation
Author :
Sun, Guanglu ; Liu, Bingquan ; Wang, Xiaolong ; Liu, Yuanchao
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin
fYear :
2008
fDate :
7-9 July 2008
Firstpage :
1194
Lastpage :
1198
Abstract :
This paper presents a new Chinese chunking algorithm based on conditional random fields. Conditional random fields overcome the label bias problem, model the labeling sequence and utilize many types of features. Furthermore, an algorithm of chunk similarity computation is proposed based on the systematic similarity method and semantic dictionary. The experimental results show that this approach achieves impressive accuracy in terms of the F-score: 92.00%. And the similarity computation algorithm performs well.
Keywords :
dictionaries; natural language processing; Chinese chunking; chunk similarity computation; conditional random fields; label bias problem; semantic dictionary; systematic similarity method; Application software; Computer applications; Dictionaries; Entropy; Feature extraction; Hidden Markov models; Machine learning algorithms; Solid modeling; Tagging; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
Type :
conf
DOI :
10.1109/ICALIP.2008.4590216
Filename :
4590216
Link To Document :
بازگشت