• DocumentCode
    3110125
  • Title

    A Voting Method for the Classification of Web Pages

  • Author

    Fang, Rui ; Mikroyannidis, Alexander ; Theodoulidis, Babis

  • Author_Institution
    Sch. of Informatics, Manchester Univ.
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    610
  • Lastpage
    613
  • Abstract
    This paper discusses Web page classification using hypertext features such as the text included in the Web page, the title, headings, URL, and anchor text. Five different classification approaches based on SVM that use individual features or combinations are investigated on the LookSmart dataset. The initial experimental results have shown that combining the features improves the performance of the classifier and that some features such as title and headings can be very useful for certain tasks. On the basis of this analysis, we propose a voting method that further improves the performance compared with the individual classifiers
  • Keywords
    Internet; classification; support vector machines; LookSmart dataset; URL; Web pages classification; hypertext features; support vector machines; voting method; Citation analysis; Classification algorithms; Informatics; Kernel; Performance analysis; Support vector machine classification; Support vector machines; Uniform resource locators; Voting; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology Workshops, 2006. WI-IAT 2006 Workshops. 2006 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    0-7695-2749-3
  • Type

    conf

  • DOI
    10.1109/WI-IATW.2006.23
  • Filename
    4053325