Title :
A Classification Approach to Identify Definitions in Aviation Domain
Author :
Pan, Xu ; Gu, Hongbin ; Sun, Chanjuan
Author_Institution :
Coll. of Civil Aviation, Nanjing Univ. of Aeronaut. & Astronaut., Nanjing, China
Abstract :
In this paper, we introduce a classification approach to identify definitions of all terms from a aviation professional corpus. The corpora of aviation domain are firstly segmented by LTP platform from HIT. Then four feature selection methods and two classifiers are applied to extract definitions. First of all, we summarize the correct proportion of feature subset used in classification of term definitions, and secondly argue that the naive Bayes classifier combined with CHI or ODDS for feature selection achieve the best score in the Fl-measure and F2-measure. In the end, we recognize that the use of SVM classifier with linear kernel could achieve very high precision, but the worst recall.
Keywords :
Bayes methods; aerospace computing; pattern classification; support vector machines; LTP platform; SVM classifier; aviation domain corpora; classification approach; feature selection method; naive bayes classifier; Data mining; Educational institutions; Electronic mail; Information analysis; Kernel; Mutual information; Sun; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4199-0
DOI :
10.1109/CCPR.2009.5344021