DocumentCode :
3278121
Title :
Fast text categorization based on collaborative work in the semantic and class spaces
Author :
Zheng, Wen-bin ; Zhang, Hua ; Qian, Yun-tao
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Volume :
4
fYear :
2011
fDate :
10-13 July 2011
Firstpage :
1497
Lastpage :
1502
Abstract :
The blooming of the Internet information has made fast text categorization very essential. Generally, in order to accelerate the classification process, the classifier needs to be simplified as much as possible; however, the accuracy might descend drastically in that case, This paper proposes a novel approach to achieve a suitable tradeoff between the speed and accuracy. With category information fusion and basis orthogonality non-negative matrix factorization, the documents can be mapped from the term space to a semantic or class s-pace, and a simple and fast classification method in the class space is proposed. Furthermore a criterion for re-classifying in the semantic space is discussed. Finally, the collaborative work framework in the semantic and class spaces is implemented. Experiments in two benchmarks are presented, and the results are encouraging.
Keywords :
Internet; groupware; matrix decomposition; pattern classification; semantic networks; sensor fusion; text analysis; Internet information; basis orthogonality nonnegative matrix factorization; category information fusion; class spaces; classification process; collaborative work; document mapping; semantic spaces; term space; text categorization; Support vector machine classification; Collaborative work; Fast; Map; Non-negative matrix factorization; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location :
Guilin
ISSN :
2160-133X
Print_ISBN :
978-1-4577-0305-8
Type :
conf
DOI :
10.1109/ICMLC.2011.6016976
Filename :
6016976
Link To Document :
بازگشت