• DocumentCode
    2500670
  • Title

    Enhancing Web Page Classification via Local Co-training

  • Author

    Du, Youtian ; Guan, Xiaohong ; Cai, Zhongmin

  • Author_Institution
    MOE Key Lab. for Intell. Networks & Network Security, Xi ´´an Jiaotong Univ., Xi´´an, China
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    2905
  • Lastpage
    2908
  • Abstract
    In this paper we propose a new multi-view semi-supervised learning algorithm called Local Co-Training(LCT). The proposed algorithm employs a set of local models with vector outputs to model the relations among examples in a local region on each view, and iteratively refines the dominant local models (i.e. the local models related to the unlabeled examples chosen for enriching the training set) using unlabeled examples by the co-training process. Compared with previous co-training style algorithms, local co-training has two advantages: firstly, it has higher classification precision by introducing local learning; secondly, only the dominant local models need to be updated, which significantly decreases the computational load. Experiments on WebKB and Cora datasets demonstrate that LCT algorithm can effectively exploit unlabeled data to improve the performance of web page classification.
  • Keywords
    Internet; learning (artificial intelligence); pattern classification; Cora datasets; Web page classification; WebKB datasets; local co-training; machine learning; multiview semi-supervised learning algorithm; Computational modeling; Error analysis; Machine learning; Support vector machines; Training; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.712
  • Filename
    5597065