• DocumentCode
    3052901
  • Title

    Research on web filtering technology based on the dual feature selection

  • Author

    Bin Zhang ; Miao Xu ; Minli Wu

  • Author_Institution
    Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2012
  • fDate
    21-23 Sept. 2012
  • Firstpage
    675
  • Lastpage
    679
  • Abstract
    In the topic search system, some of web pages got by crawling are inconsistent with user demands. For this situation, this paper had a research on content-based web filtering technology. This paper proposed a dual feature selection method based on the CHI statistical method and N-gram, and then made binary text classification by SVM in order to achieve Web Filtering. The experiments showed that the proposed web filtering method has better results.
  • Keywords
    Internet; content-based retrieval; information filtering; pattern classification; query formulation; support vector machines; text analysis; SVM; Web pages; binary text classification; content-based Web filtering technology; dual feature selection; support vector machines; topic search system; Feature extraction; Filtering; Procurement; Statistical analysis; Support vector machines; Text categorization; Web pages; CHI statistical method; Feature selection; TF-IDF; Web filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4673-2201-0
  • Type

    conf

  • DOI
    10.1109/ICNIDC.2012.6418841
  • Filename
    6418841