DocumentCode :
465981
Title :
Learning Web Categorization with Controlled Generation of Context Features
Author :
Wong, Alex K S ; Lee, John W T ; Yeung, Daniel S.
Author_Institution :
Hong Kong Polytech Univ., Kowloon
Volume :
4
fYear :
2006
fDate :
8-11 Oct. 2006
Firstpage :
2960
Lastpage :
2964
Abstract :
Automatic categorization of Web pages is an important area of study due to the rapidly growing amount of Web data. Efficient and accurate classification would greatly facilitate finding what one needs in the sea of information. Context-sensitive techniques have been proven to be effective in the classification task. However, the feature space for context feature that one can explore in these techniques is enormous. To consider these features comprehensively often become prohibitive in terms of resource requirements. In this paper, we propose an approach to intelligently control generating context features for the classification learning process. We present our investigation of this approach in the context of Web page categorization using the sleeping-experts technique.
Keywords :
Internet; classification; learning (artificial intelligence); text analysis; Web pages categorization; classification learning process; context-sensitive techniques; sleeping-experts technique; text categorization; Automatic control; Automatic generation control; Control systems; Cybernetics; HTML; Intelligent control; Internet; Tagging; Text categorization; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
Type :
conf
DOI :
10.1109/ICSMC.2006.384568
Filename :
4274332
Link To Document :
بازگشت