DocumentCode :
3367163
Title :
Improving Short Text Classification through Better Feature Space Selection
Author :
Meng Wang ; Lanfen Lin ; Feng Wang
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear :
2013
fDate :
14-15 Dec. 2013
Firstpage :
120
Lastpage :
124
Abstract :
Nowadays people are overwhelmed by more and more short information from lots of different applications, especially with the rapid development of mobile systems. One way to alleviate this issue is an automatic classification of the short texts before they are delivered to users. Several methods have been proposed to classify the short texts, and they are largely based on expanding the short texts to longer ones with external resources to solve the sparseness problem. Different from these studies, we tackle the sparseness problem by selecting a better feature space in which the feature vectors of the short texts are denser, and our method needs no external resources at all. The experimental results on an open dataset show that this method can significantly improve the short text classification accuracy comparing with the baseline, especially when the dimension of the feature space is low.
Keywords :
classification; text analysis; vectors; feature space selection; feature vector; short text classification; sparseness problem; Accuracy; Encyclopedias; Presses; Search engines; Support vector machine classification; Text categorization; feature space; feature words; short text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security (CIS), 2013 9th International Conference on
Conference_Location :
Leshan
Print_ISBN :
978-1-4799-2548-3
Type :
conf
DOI :
10.1109/CIS.2013.32
Filename :
6746368
Link To Document :
بازگشت