DocumentCode :
389304
Title :
Web categorization using hybrid algorithms
Author :
Ye, Wei-guo ; Lu, Zheng-Ding
Author_Institution :
Dept. of Comput. Sci., Huazhong Univ. of Sci. & Technol., Wuhan, China
Volume :
2
fYear :
2002
fDate :
2002
Firstpage :
978
Abstract :
Obtaining information from the Web is becoming a very much important issue nowadays. The traditional text categorization algorithm is not sufficient for web categorization. In this paper we discuss the process in Web categorization, and proposed a new information gain measure for feature selections and term weighting. We also discussed three linear classifiers. Then we propose a novel hyperlink based classifier. It uses the characteristics of the Web graph. Experimental comparisons of these algorithms show that our approach is more appropriate than traditional information retrieval methods in Web categorization.
Keywords :
Web sites; classification; feature extraction; information retrieval; learning (artificial intelligence); Web categorization; Web graph; feature selection; hyperlink classifier; information gain measure; information retrieval; learning methods; term weighting; Artificial intelligence; Computer science; Information resources; Information retrieval; Machine learning; Resumes; Spatial databases; Text categorization; Web pages; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
Type :
conf
DOI :
10.1109/ICMLC.2002.1174529
Filename :
1174529
Link To Document :
بازگشت