DocumentCode :
2567952
Title :
Feature-Based Approach to Chinese Term Relation Extraction
Author :
Xia, Sun ; Lehong, Dong
Author_Institution :
Dept. of Comput. Sci. & Technol., Northwest Univ., Xi´´an, China
fYear :
2009
fDate :
15-17 May 2009
Firstpage :
410
Lastpage :
414
Abstract :
In this paper, we propose a feature-based Chinese term relation extraction approach that combined the advantages of both naive bayes algorithm and perceptron algorithm. A subset of the features was estimated in training data; another subset of the features was trained by discriminative function. The results demonstrate that the proposed hybrid algorithm almost always outperforms the naive bayes algorithms and perceptron algorithms whether the training set is small or not. On the other hand, a novel feature representation was proposed, which included term sequence feature, term appearance features and context information features. Comparing the previous method, long-range dependence was considered in the proposed feature representation, which add the position of feature into vector space model (VSM) and promotes the capability of feature representation. Further, punctuation feature is the important character for terms relation extraction.
Keywords :
Bayes methods; classification; data analysis; feature extraction; learning (artificial intelligence); perceptrons; context information feature; discriminative function; feature-based Chinese term relation extraction; naive bayes algorithm; perceptron algorithm; term appearance feature; term sequence feature; vector space model; Computer science; Data mining; Feature extraction; Kernel; Natural languages; Signal processing algorithms; Space exploration; Space technology; Sun; Tree graphs; classification algorithm; feature representation; term relation extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
2009 International Conference on Signal Processing Systems
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3654-5
Type :
conf
DOI :
10.1109/ICSPS.2009.79
Filename :
5166819
Link To Document :
بازگشت