Title :
A Hybrid Documents Classification Based on SVM and Rough Sets
Author :
Rujiang, Bai ; Junhua, Liao
Author_Institution :
Shandong Univ. of Technol. Libr., Zibo, China
Abstract :
Standard machine learning techniques like support vector machines (SVM) and related large margin methods have been successfully applied for text classification. Unfortunately, the high dimensionality of input feature vectors impacts on the classification speed. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy. The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. In order to improve classification speed we spent rough sets theory to reduce the feature vector space. We present a genetic algorithm approach for feature selection and parameters optimization to improve classification accuracy. Experimental results indicate our method is more effective than traditional SVM methods and other traditional methods.
Keywords :
classification; genetic algorithms; rough set theory; support vector machines; feature selection; genetic algorithms; hybrid documents classification; kernel parameters; machine learning; rough sets; support vector machines; Electronic mail; Genetic algorithms; Kernel; Libraries; Organizing; Rough sets; Space technology; Support vector machine classification; Support vector machines; Text categorization; Document Classification; Genetic Algorithms; Rough Sets; Support Vector Machine;
Conference_Titel :
Advanced Science and Technology, 2009. AST '09. International e-Conference on
Conference_Location :
Dajeon
Print_ISBN :
978-0-7695-3672-9
DOI :
10.1109/AST.2009.14