DocumentCode
389304
Title
Web categorization using hybrid algorithms
Author
Ye, Wei-guo ; Lu, Zheng-Ding
Author_Institution
Dept. of Comput. Sci., Huazhong Univ. of Sci. & Technol., Wuhan, China
Volume
2
fYear
2002
fDate
2002
Firstpage
978
Abstract
Obtaining information from the Web is becoming a very much important issue nowadays. The traditional text categorization algorithm is not sufficient for web categorization. In this paper we discuss the process in Web categorization, and proposed a new information gain measure for feature selections and term weighting. We also discussed three linear classifiers. Then we propose a novel hyperlink based classifier. It uses the characteristics of the Web graph. Experimental comparisons of these algorithms show that our approach is more appropriate than traditional information retrieval methods in Web categorization.
Keywords
Web sites; classification; feature extraction; information retrieval; learning (artificial intelligence); Web categorization; Web graph; feature selection; hyperlink classifier; information gain measure; information retrieval; learning methods; term weighting; Artificial intelligence; Computer science; Information resources; Information retrieval; Machine learning; Resumes; Spatial databases; Text categorization; Web pages; Web sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN
0-7803-7508-4
Type
conf
DOI
10.1109/ICMLC.2002.1174529
Filename
1174529
Link To Document