DocumentCode :
1657499
Title :
Automatic text categorization based on Jensen-Shannon Divergence
Author :
Li, Xiangdong ; Sun, Denghui ; Wuhan, Li Huang
Author_Institution :
The School of Information Management, Wuhan University, Wuhan 430072, China
fYear :
2011
Firstpage :
1
Lastpage :
4
Abstract :
This paper studies the principle of text categorization in which Jensen-Shannon Divergence is used to calculate text similarity, comparing its accuracy of classification and time taking to the traditional Cosine Similarity algorithm. Experimental research shows that Jensen-Shannon Divergence algorithm will reach better results when test materials remain unchanged.
Keywords :
Classification algorithms; Information retrieval; Libraries; Publishing; Research and development; Sun; Text categorization; Jensen-Shannon divergence; KNN algorithm; cosine similarity; text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
E -Business and E -Government (ICEE), 2011 International Conference on
Conference_Location :
Shanghai, China
Print_ISBN :
978-1-4244-8691-5
Type :
conf
DOI :
10.1109/ICEBEG.2011.5882549
Filename :
5882549
Link To Document :
بازگشت