Title :
A method of Chinese text categorization based on proximal support vector machine
Author :
Zhou, Jian-Guo ; Wang, Kai ; Wu, Jing ; Yan, Pu-Liu ; Wu, Ming
Author_Institution :
Sch. of Electron. Inf., Wuhan Univ., China
Abstract :
A Chinese text categorization method based on proximal support vector machine and similarity of words is studied in the paper. Firstly feature vectors are extracted, and then the text feature subset based on similarity of words is obtained, finally the text is categorized based on proximal support vector machine. The tests on the large-scale text show that the recall is comparatively low and the precision is comparatively high.
Keywords :
classification; support vector machines; text analysis; Chinese text; feature vectors; proximal support vector machine; text categorization; text feature subset; Data mining; Feature extraction; Frequency; Large-scale systems; Machine learning; Mutual information; Support vector machine classification; Support vector machines; Testing; Text categorization; Precision; Proximal support vector machine (PSVM); Recall; Text categorization;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527203