DocumentCode :
3312644
Title :
A novel Voting Algorithm of multi-class SVM for web page classification
Author :
Thamrongrat, Pornpon ; Preechaveerakul, Ladda ; Wettayaprasit, Wiphada
Author_Institution :
Comput. Sci. Dept., Prince of Songkla Univ., Songkla, Thailand
fYear :
2009
fDate :
8-11 Aug. 2009
Firstpage :
327
Lastpage :
331
Abstract :
The increasing numbers of Web pages on the cyber world result to the less effectiveness of document retrieval that matches the need of users. The classification of Web pages is one of the solutions to solve this problem. This paper proposes VAMSVM_WPC model which is a novel voting algorithm for classifying the Web pages, which uses a multi-class SVM method. First, feature is generated from text and title, and then reduces the number of features by two feature selection techniques. Use these two types of features to give input to multi-class SVM. Finally, on the output of SVM, a voting algorithm is used to determine the category of the Web pages. Results on CMU benchmark dataset show that using text and title feature with 1vsAll_Voting Algorithm gives the highest F-measure value.
Keywords :
Internet; information retrieval; pattern classification; support vector machines; text analysis; Web page classification; document retrieval; feature selection; multiclass support vector machine; text analysis; Artificial intelligence; Computer science; Equations; Laboratories; Performance gain; Support vector machine classification; Support vector machines; Testing; Voting; Web pages; feature selection; support vector machine; web page classification voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4519-6
Electronic_ISBN :
978-1-4244-4520-2
Type :
conf
DOI :
10.1109/ICCSIT.2009.5234603
Filename :
5234603
Link To Document :
بازگشت