Title :
The algorithm of text classification based on rough set and support vector machine
Author :
Zhuo, Wang ; Lili, Chu
Author_Institution :
Coll. of Bus. Adm., Liaoning Tech. Univ., Huludao, China
Abstract :
Support Vector Machine (SVM) is a new technology of classification in data mining, which is a small sample of statistical learning theory based on structural risk minimization principle and VC theory. It has simple structure and good classification ability, but its processing speed is slow when we deal with large amount of data, affecting classification performance. In order to overcome the shortcoming that SVM is better adaptability, combining rough sets of attribute reduction algorithm with SVM method of classification, the paper presents a new algorithm of text classification based on rough set and support vector machine. In a certain extent of support vector machines (SVM) to improve the ability of processing large-scale data of support vector machine, and through the simulation experiments to verify the superiority and adaptability of algorithm.
Keywords :
classification; data mining; learning (artificial intelligence); rough set theory; support vector machines; text analysis; VC theory; attribute reduction algorithm; data mining; large-scale data; processing speed; rough set; statistical learning theory; structural risk minimization principle; support vector machine; text classification; Classification algorithms; Data mining; Large-scale systems; Risk management; Rough sets; Statistical learning; Support vector machine classification; Support vector machines; Text categorization; Virtual colonoscopy; classification; rough set; support vector machine;
Conference_Titel :
Future Computer and Communication (ICFCC), 2010 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5821-9
DOI :
10.1109/ICFCC.2010.5497769