DocumentCode
2124516
Title
Research of improved IF-IDF Weighting algorithm
Author
Jie, Gan ; Li-chao, Chen
Author_Institution
Institute of Computer Science and Technology, Taiyuan University of Science and Technology, Shanxi, China
fYear
2010
fDate
4-6 Dec. 2010
Firstpage
2304
Lastpage
2307
Abstract
It does not consider how similar words are distributed in the text that the traditional algorithm of the VSM characteristic weighs - TF-IDF. For solving the problem, from the semantic view and combined optimization techniques, a improved IF-IDF Weighting algorithm is proposed. This algorithm can effectually reduce the subjective factors of faceted classification, and further improve the effect of current most text clustering algorithm that based on Vector Space Model (VSM). By experiments, the algorithm is feasible and effective, and to some extent, the precision ratio and recall ratio of text clustering is enhanced.
Keywords
Algorithm design and analysis; Classification algorithms; Clustering algorithms; Computational modeling; Semantics; Software; Time frequency analysis; HowNet; clustering; semantic similarity; term weighting algorithm; text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location
Hangzhou, China
Print_ISBN
978-1-4244-7616-9
Type
conf
DOI
10.1109/ICISE.2010.5690286
Filename
5690286
Link To Document