DocumentCode :
3730432
Title :
Feature selection for text classification based on part of speech filter and synonym merge
Author :
Sijun Qin; Jia Song;Pengzhou Zhang; Yue Tan
Author_Institution :
New Media Institute, Communication University of China, Beijing, China
fYear :
2015
Firstpage :
681
Lastpage :
685
Abstract :
In recent years, text categorization based on machine learning is a widely used technology in the field of natural language processing and text mining and has gained many advances. Feature selection is one of the key problems in text categorization. The chief obstacles to feature selection are noise and sparseness. In this paper, we propose an approach of Chinese text feature selection based on CV (contribution value), POS (part of speech) filter and synonym merge. We carry out experiments over corpus-TanCorpV1.0 and find that the proposed method performs better than traditional ones.
Keywords :
"Text categorization","Semantics","Merging","Thesauri","Tagging","Training","Frequency selective surfaces"
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type :
conf
DOI :
10.1109/FSKD.2015.7382024
Filename :
7382024
Link To Document :
بازگشت