DocumentCode :
3431732
Title :
Improvement of Mutual Information based on TF-CA-CI algorithm
Author :
Chai, Jiajia ; Zhang, Dexian ; Geng, Ruihuan
fYear :
2012
fDate :
11-13 Aug. 2012
Firstpage :
699
Lastpage :
702
Abstract :
Mutual Information algorithm for text feature selection usually tends to select the rare terms. In allusion to this limitation, this paper makes use of the term frequency, the coupling factor among classes and the cohesion degree inside a class to the MI algorithm, and proposes an improved MI approach based on TFCA-CI algorithm. The experimental result shows that the improved method can effectively control the randomness of the MI method appeared in the process of feature selection when the dimension is low, and achieve a better classified results. So the effectiveness and feasibility of the improved method is achieved.
Keywords :
Accuracy; Classification algorithms; Feature selection; Mutual information; Term frequency; Text category;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2012 IEEE International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4673-2310-9
Type :
conf
DOI :
10.1109/GrC.2012.6468636
Filename :
6468636
Link To Document :
بازگشت