Title :
Improvement of Mutual Information based on TF-CA-CI algorithm
Author :
Chai, Jiajia ; Zhang, Dexian ; Geng, Ruihuan
Abstract :
Mutual Information algorithm for text feature selection usually tends to select the rare terms. In allusion to this limitation, this paper makes use of the term frequency, the coupling factor among classes and the cohesion degree inside a class to the MI algorithm, and proposes an improved MI approach based on TFCA-CI algorithm. The experimental result shows that the improved method can effectively control the randomness of the MI method appeared in the process of feature selection when the dimension is low, and achieve a better classified results. So the effectiveness and feasibility of the improved method is achieved.
Keywords :
Accuracy; Classification algorithms; Feature selection; Mutual information; Term frequency; Text category;
Conference_Titel :
Granular Computing (GrC), 2012 IEEE International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4673-2310-9
DOI :
10.1109/GrC.2012.6468636