• DocumentCode
    2860272
  • Title

    GE-CKO: A Method to Optimize Composite Kernels for Web Page Classification

  • Author

    Sun, Jian-Tao ; Zhang, Ben-Yu ; Chen, Zheng ; Lu, Yu-Chang ; Shi, Chun-yi ; Ma, Wei-Ying

  • Author_Institution
    TsingHua University, China
  • fYear
    2004
  • fDate
    20-24 Sept. 2004
  • Firstpage
    299
  • Lastpage
    305
  • Abstract
    Most of current researches on Web page classification focus on leveraging heterogeneous features such as plain text, hyperlinks and anchor texts in an effective and efficient way. Composite kernel method is one topic of interest among them. It first selects a bunch of initial kernels, each of which is determined separately by a certain type of features. Then a classifier is trained based on a linear combination of these kernels. In this paper, we propose an effective way to optimize the linear combination of kernels. We proved that this problem is equivalent to solving a generalized eigenvalue problem. And the weight vector of the kernels is the eigenvector associated with the largest eigen-value. A support vector machine (SVM) classifier is then trained based on this optimized combination of kernels. Our experiment on the WebKB dataset has shown the effectiveness of our proposed method.
  • Keywords
    Asia; Computer science; Eigenvalues and eigenfunctions; Information retrieval; Kernel; Optimization methods; Sun; Support vector machine classification; Support vector machines; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2100-2
  • Type

    conf

  • DOI
    10.1109/WI.2004.10029
  • Filename
    1410818