DocumentCode :
2348634
Title :
A new cascade algorithm based on CRFs for recognizing Chinese verb-object collocation
Author :
Zhang, Guiping ; Liu, Zhichao ; Zhou, Qiaoli ; Cai, Dongfeng ; Cheng, Jiao
Author_Institution :
Knowledge Eng. ResearchCenter, Shenyang Aerosp. Univ., Shenyang, China
fYear :
2010
fDate :
21-23 Aug. 2010
Firstpage :
1
Lastpage :
7
Abstract :
This paper proposes a new cascade algorithm based on conditional random fields. The algorithm is applied to automatic recognition of Chinese verb-object collocation, and combined with a new sequence labeling of “ONIY”. Experiments compare identified results under two segmentations and part-of-speech tag sets. The comprehensive experimental results show that the best performance is 90.65% in F-score over Tsinghua Treebank, and 82.00% in F-score over the segmentation and part-of-speech tagging scheme of Peking University. Our experiments show that the proposed algorithm can greatly improve recognition accuracy of multi-nested collocation, and play a positive role on long distance collocation.
Keywords :
natural language processing; probability; Chinese verb object collocation recognition; F-score; ONIY; Peking university; Tsinghua treebank; cascade algorithm; conditional random fields; long distance collocation; multinested collocation; part-of-speech tag sets; sequence labeling; Verb-object collocation; conditional random fields; long distance; multi-nested; new cascade algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
Type :
conf
DOI :
10.1109/NLPKE.2010.5587828
Filename :
5587828
Link To Document :
بازگشت