DocumentCode
3052901
Title
Research on web filtering technology based on the dual feature selection
Author
Bin Zhang ; Miao Xu ; Minli Wu
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2012
fDate
21-23 Sept. 2012
Firstpage
675
Lastpage
679
Abstract
In the topic search system, some of web pages got by crawling are inconsistent with user demands. For this situation, this paper had a research on content-based web filtering technology. This paper proposed a dual feature selection method based on the CHI statistical method and N-gram, and then made binary text classification by SVM in order to achieve Web Filtering. The experiments showed that the proposed web filtering method has better results.
Keywords
Internet; content-based retrieval; information filtering; pattern classification; query formulation; support vector machines; text analysis; SVM; Web pages; binary text classification; content-based Web filtering technology; dual feature selection; support vector machines; topic search system; Feature extraction; Filtering; Procurement; Statistical analysis; Support vector machines; Text categorization; Web pages; CHI statistical method; Feature selection; TF-IDF; Web filtering;
fLanguage
English
Publisher
ieee
Conference_Titel
Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4673-2201-0
Type
conf
DOI
10.1109/ICNIDC.2012.6418841
Filename
6418841
Link To Document