DocumentCode :
3773587
Title :
Feature Weighting Method Based on Real-Coded Genetic Algorithm in Text Categorization
Author :
Junwei Li;Xiangqian Li
Author_Institution :
Sch. of Comput. &
Volume :
2
fYear :
2015
Firstpage :
91
Lastpage :
94
Abstract :
Feature weighting technique can improve the accuracy of text categorization, TF-IDF is a generally used feature weighting method. Currently, some improved methods based TF-IDF have been proposed, but there is not a method that is able to comprehensive each algorithm´s advantages. So, feature weighting method based on real-coded genetic algorithm (GA) is proposed in this paper, the real-coded GA is used to calculate the feature weights. Concrete steps as follows: firstly, use information gain to reduce dimension. Secondly, use real-coded GA to calculate each feature weights. Lastly, classify text according to the weighted cosine distance. Experiments proved that the real-coded GA method is superior to the traditional TF-IDF method.
Keywords :
"Genetic algorithms","Sociology","Statistics","Biological cells","Text categorization","Classification algorithms","Training"
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Design (ISCID), 2015 8th International Symposium on
Print_ISBN :
978-1-4673-9586-1
Type :
conf
DOI :
10.1109/ISCID.2015.131
Filename :
7469088
Link To Document :
بازگشت