DocumentCode
182984
Title
A novel feature voting model for text classification
Author
Sen Jia ; Jinquan Liang ; Yao Xie ; Lin Deng
Author_Institution
Key Lab. of Spatial Inf. Intell. Perception & Services, Shenzhen Univ., Shenzhen, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
306
Lastpage
311
Abstract
Along with the information explosion in the Internet era, the traditional classification methods, such as KNN (k-nearest neighbor), Naive Bayes (NB), encounter bottlenecks due to the endless stream of new words. In this paper, through comparing with the Rocchio and Bayesian algorithms, it has been found that centroid-based algorithms are insufficient for text classification. Therefore, a novel feature voting model is proposed, which gives rise to a bag-of-words based feature voting algorithm for text classification. This algorithm assigns categories for each document according to the ranking of weighted sum of feature values. Experimental results have shown the efficiency of the proposed method over the other state-of-the-art methods.
Keywords
Bayes methods; pattern classification; text analysis; Bayesian algorithm; Internet; KNN; NB; Rocchio algorithm; bag-of-words based feature voting algorithm; centroid-based algorithms; information explosion; k-nearest neighbor; naive Bayes; text classification; Accuracy; Classification algorithms; Equations; Internet; Mathematical model; Training; Vectors; Naive Bayes; feature voting; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5147-5
Type
conf
DOI
10.1109/FSKD.2014.6980851
Filename
6980851
Link To Document