DocumentCode
2500670
Title
Enhancing Web Page Classification via Local Co-training
Author
Du, Youtian ; Guan, Xiaohong ; Cai, Zhongmin
Author_Institution
MOE Key Lab. for Intell. Networks & Network Security, Xi ´´an Jiaotong Univ., Xi´´an, China
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
2905
Lastpage
2908
Abstract
In this paper we propose a new multi-view semi-supervised learning algorithm called Local Co-Training(LCT). The proposed algorithm employs a set of local models with vector outputs to model the relations among examples in a local region on each view, and iteratively refines the dominant local models (i.e. the local models related to the unlabeled examples chosen for enriching the training set) using unlabeled examples by the co-training process. Compared with previous co-training style algorithms, local co-training has two advantages: firstly, it has higher classification precision by introducing local learning; secondly, only the dominant local models need to be updated, which significantly decreases the computational load. Experiments on WebKB and Cora datasets demonstrate that LCT algorithm can effectively exploit unlabeled data to improve the performance of web page classification.
Keywords
Internet; learning (artificial intelligence); pattern classification; Cora datasets; Web page classification; WebKB datasets; local co-training; machine learning; multiview semi-supervised learning algorithm; Computational modeling; Error analysis; Machine learning; Support vector machines; Training; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.712
Filename
5597065
Link To Document