DocumentCode :
1571934
Title :
An Extensive Empirical Study of Feature Selection for Text Categorization
Author :
Qiu, Li-Qing ; Zhao, Ru-Yi ; Zhou, Gang ; Yi, Sheng-Wei
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beihang
fYear :
2008
Firstpage :
312
Lastpage :
315
Abstract :
We present a novel feature selection (FS) approach for text categorization. It first constructs a local feature set for each category by selecting a set of features based on three different schemes: DF, TF and TFIDF, and then constructs a global feature set utilizing well-known CHI method based on the local feature set. The experimental comparison is carried out between our method and CHI method. Results from the experiments are summarized. The results show that our proposed method based on DF scheme can perform comparatively well with CHI methods.
Keywords :
learning (artificial intelligence); pattern classification; text analysis; CHI methods; feature selection; local feature set; text categorization; Character generation; Computational efficiency; Frequency measurement; Gain measurement; Information analysis; Information science; Performance gain; Programming; Space technology; Text categorization; Text Categorization; feature selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2008. ICIS 08. Seventh IEEE/ACIS International Conference on
Conference_Location :
Portland, OR
Print_ISBN :
978-0-7695-3131-1
Type :
conf
DOI :
10.1109/ICIS.2008.49
Filename :
4529838
Link To Document :
بازگشت