DocumentCode :
3297554
Title :
A Web Text Classification Rules Extraction Algorithm
Author :
Liu, He ; Liu, Dayou ; Shi, Xiaohu
Author_Institution :
Key Lab. for Symbolic Comput. & Knowledge Eng. of Minist. of Educ., Jilin Univ., Changchun
Volume :
1
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
693
Lastpage :
697
Abstract :
Text classification is a very important technique for gathering Web information. A novel approach based on multi-population collaborative optimization is proposed for the extraction of Web text classification rules. The information entropy was applied for the initialization of the populations and the multi-population collaborative optimization was applied for the evolution of the populations. The proposed method was applied to three benchmark test sets to examine its effectiveness. Results show that the precision of the proposed method is higher to those of three existing methods, and the cost of computation is less than those of three methods. Furthermore, the classification rules obtained by the proposed method are simple compared with those of three methods.
Keywords :
Internet; classification; data mining; entropy; optimisation; text analysis; Web text classification rule extraction algorithm; information entropy; multipopulation collaborative optimization; Benchmark testing; Classification algorithms; Collaboration; Computational efficiency; Data mining; Helium; Information entropy; Knowledge engineering; Text categorization; Text mining; collaborative optimization; information entropy; rule extraction; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
Type :
conf
DOI :
10.1109/ICNC.2008.231
Filename :
4666933
Link To Document :
بازگشت